The second motif that was defined has been completed in Java.

metagraphExample output from Java (cutoff dataset):

Protein-Protein-Simularity

Code was ran on the updated dataset:

  • There were 70,866 instances of this motif definition in the dataset.
  • However, this doesn’t address the issue of the OMIM database. So many of these will not have true “Diseases”. This would probably be best addressed when trying to score this type, if its even possible to do this.