<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Semantic Motif Hunting &#187; Ondex</title>
	<atom:link href="http://bio.adking.co.uk/category/ondex/feed/" rel="self" type="application/rss+xml" />
	<link>http://bio.adking.co.uk</link>
	<description>Research Blog</description>
	<lastBuildDate>Wed, 06 Oct 2010 22:04:10 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Stats</title>
		<link>http://bio.adking.co.uk/2010/07/01/stats/</link>
		<comments>http://bio.adking.co.uk/2010/07/01/stats/#comments</comments>
		<pubDate>Thu, 01 Jul 2010 14:06:53 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Ondex]]></category>
		<category><![CDATA[Paper]]></category>
		<category><![CDATA[p-value]]></category>
		<category><![CDATA[Stats]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=194</guid>
		<description><![CDATA[4842 &#8211; Drugs 3500 &#8211; Targets 10743 &#8211; bi_to 274607 &#8211; h_s_s With this information how likely/often will a motif appear in the dataset. ONDEXView hss = graph.getRelationsOfRelationType(graph.getMetaData().getRelationType("h_s_s")); int j = 0; for (ONDEXRelation r : hss) { ONDEXConcept to = r.getToConcept(); ONDEXConcept from = r.getFromConcept(); if (to.getOfType().getId().equals("Protein") &#38;&#38; from.getOfType().getId().equals("Protein")) { j++; } } System.out.println("hss]]></description>
			<content:encoded><![CDATA[<ul>
<li>4842 &#8211; Drugs</li>
<li>3500 &#8211; Targets</li>
<li>10743 &#8211; bi_to</li>
<li>274607 &#8211; h_s_s</li>
</ul>
<p>With this information how likely/often will a motif appear in the dataset.</p>
<p><code><br />
ONDEXView hss = graph.getRelationsOfRelationType(graph.getMetaData().getRelationType("h_s_s"));<br />
int j = 0;<br />
for (ONDEXRelation r : hss) {<br />
ONDEXConcept to = r.getToConcept();<br />
ONDEXConcept from = r.getFromConcept();<br />
if (to.getOfType().getId().equals("Protein") &amp;&amp; from.getOfType().getId().equals("Protein")) {<br />
j++;<br />
}<br />
}<br />
System.out.println("hss " + j);<br />
</code></p>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/07/01/stats/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>I&#8217;m a ropinirole star</title>
		<link>http://bio.adking.co.uk/2010/06/22/im-a-ropinirole-star/</link>
		<comments>http://bio.adking.co.uk/2010/06/22/im-a-ropinirole-star/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 11:50:34 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Ondex]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=137</guid>
		<description><![CDATA[Played around with Ondex suite to try and define more motifs. There were 16 instances of motifs(2) involving Ropinirole. Useful IDs: Chlorpromazine 120943 Trimeprazine 120836 Ropinrole 118054 Pramipexole 118131 Bupropion 118567 Sibutramine 118536 Lidocaine 120780 Celecoxib 121260 Finasteride 118603 Raloxifene 118164 Topiramate 116780]]></description>
			<content:encoded><![CDATA[<p>Played around with Ondex suite to try and define more motifs.</p>
<div id="attachment_139" class="wp-caption aligncenter" style="width: 310px"><a href="http://bio.adking.co.uk/wp-content/uploads/2010/06/hahaha.png"><img class="size-medium wp-image-139 " title="Ropinirole" src="http://bio.adking.co.uk/wp-content/uploads/2010/06/hahaha-300x143.png" alt="Ropinirole" width="300" height="143" /></a><p class="wp-caption-text">Ropinirole</p></div>
<p>There were 16 instances of motifs(2) involving Ropinirole.</p>
<p>Useful IDs:</p>
<ul>
<li>Chlorpromazine <strong>120943</strong></li>
<li>Trimeprazine<strong> 120836<br />
</strong></li>
<li>Ropinrole <strong>118054</strong></li>
<li>Pramipexole <strong>118131</strong></li>
<li>Bupropion <strong>118567</strong></li>
<li>Sibutramine <strong>118536</strong></li>
<li>Lidocaine <strong>120780</strong></li>
<li>Celecoxib <strong>121260</strong></li>
<li>Finasteride <strong>118603</strong></li>
<li>Raloxifene <strong>118164</strong></li>
<li>Topiramate <strong>116780</strong></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/06/22/im-a-ropinirole-star/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Motif Definition 2</title>
		<link>http://bio.adking.co.uk/2010/06/17/motif-definition-2/</link>
		<comments>http://bio.adking.co.uk/2010/06/17/motif-definition-2/#comments</comments>
		<pubDate>Thu, 17 Jun 2010 14:08:58 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Ondex]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Project]]></category>
		<category><![CDATA[Code]]></category>
		<category><![CDATA[Java]]></category>
		<category><![CDATA[motif]]></category>
		<category><![CDATA[protein]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=125</guid>
		<description><![CDATA[The second motif that was defined has been completed in Java. Example output from Java (cutoff dataset): Code was ran on the updated dataset: There were 70,866 instances of this motif definition in the dataset. However, this doesn&#8217;t address the issue of the OMIM database. So many of these will not have true &#8220;Diseases&#8221;. This]]></description>
			<content:encoded><![CDATA[<p>The second motif that was defined has been completed in Java.</p>
<p><a href="http://bio.adking.co.uk/wp-content/uploads/2010/06/p_p_similar.png"><img class="aligncenter size-full wp-image-82" title="p_p_similar" src="http://bio.adking.co.uk/wp-content/uploads/2010/06/p_p_similar.png" alt="metagraph" width="422" height="218" /></a>Example output from Java (cutoff dataset):</p>
<p><a href="http://bio.adking.co.uk/wp-content/uploads/2010/06/newAbstractBig.jpg"><img class="aligncenter size-full wp-image-126" title="proteintoprotein" src="http://bio.adking.co.uk/wp-content/uploads/2010/06/newAbstractBig.jpg" alt="Protein-Protein-Simularity" width="259" height="262" /></a></p>
<p>Code was ran on the updated dataset:</p>
<ul>
<li>There were <strong>70,866</strong> instances of this motif definition in the dataset.</li>
<li>However, this doesn&#8217;t address the issue of the OMIM database. So many of these will not have true &#8220;Diseases&#8221;. This would probably be best addressed when trying to score this type, if its even possible to do this.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/06/17/motif-definition-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Abstract Motifs</title>
		<link>http://bio.adking.co.uk/2010/06/14/abstract-motifs/</link>
		<comments>http://bio.adking.co.uk/2010/06/14/abstract-motifs/#comments</comments>
		<pubDate>Mon, 14 Jun 2010 15:00:34 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Ondex]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Project]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=90</guid>
		<description><![CDATA[Developed a java class to represent a abstractMotif as described in the paper (has fields for drug1, drug2 and thepotential target for drug1 (see previous post)). Created a TreeSet (Allows automated sorting of a Set, currently sorting by drug1.getId()) to store motifs found within the dataset, these are added in the method when a target]]></description>
			<content:encoded><![CDATA[<p>Developed a java class to represent a abstractMotif as described in the paper (has fields for drug1, drug2 and thepotential target for drug1 (see previous post)). Created a TreeSet (Allows automated sorting of a Set, currently sorting by <code>drug1.getId()</code>) to store motifs found within the dataset, these are added in the method when a target is discovered. These motifs can then be manipulated when trying to score this type of motif.</p>
<p>It would be a good idea to cross check this with Chlorpromazine, but I need to get Linux up and running properly in a dual-boot due to memory issues in Windows (also not being able to print concept names is a problem).</p>
<p>Wrote a toString() method for a nicer output for testing purposes:</p>
<p><code>Drug1 &lt;--&gt; Drug2 --&gt; Target</code></p>
<p>I wanted the Concepts name for example Chlorpromazine:</p>
<p><code>Chlorpromazine &lt;--&gt; Trimeprazine --&gt; Histamine H1 Receptor</code></p>
<p>However , the ondex API is not working for printing names? &#8211; so using getId()&#8217;s for now.</p>
<p><strong>Output</strong></p>
<p>Here is some current output of the basic semantic motif definition.</p>
<p><a href="http://bio.adking.co.uk/wp-content/uploads/2010/06/2.jpg"><img class="aligncenter size-medium wp-image-108" title="2" src="http://bio.adking.co.uk/wp-content/uploads/2010/06/2-300x271.jpg" alt="" width="300" height="271" /></a></p>
<p><strong>Differences</strong></p>
<p>This new implementation is different to the previously developed code by Cockell, S.J. for the 2010 paper. However, in the <code>cutoff_data.xml.gz</code> 157 motifs were identified, the same as the using the other method. However, there is a difference when using the code on the larger dataset <code>ib2010_data.xml.gz</code>. Previously there were 26,693 motifs identified compared with 29,633 using the new implementation, a difference of <strong>2,940</strong>.</p>
<p><strong>Note</strong></p>
<p>It should also be reminded that it was discovered in the proper dataset:</p>
<ul>
<li>Drugs seem to have the ConceptClass &#8220;<code>Comp:Compound</code>&#8220;.</li>
<li>Rather than the ConceptClass &#8220;<code>Compound</code>&#8220;.</li>
<li>There are 70 of these, none of which seem to bind to a target.</li>
<li>There are over 8k concepts of the former, so I will use this.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/06/14/abstract-motifs/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dustaride</title>
		<link>http://bio.adking.co.uk/2010/06/13/dustaride/</link>
		<comments>http://bio.adking.co.uk/2010/06/13/dustaride/#comments</comments>
		<pubDate>Sun, 13 Jun 2010 09:01:51 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Ondex]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Test]]></category>
		<category><![CDATA[Output]]></category>
		<category><![CDATA[Testing]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=110</guid>
		<description><![CDATA[Motif&#8217;s identified by traversing the java output and finding the relevant nodes in the Ondex suite:]]></description>
			<content:encoded><![CDATA[<p>Motif&#8217;s identified by traversing the java output and finding the relevant nodes in the Ondex suite:</p>
<p style="text-align: center;"><a href="http://bio.adking.co.uk/wp-content/uploads/2010/06/dustaride-motif.png"><img class="aligncenter size-medium wp-image-113" title="dustaride motif" src="http://bio.adking.co.uk/wp-content/uploads/2010/06/dustaride-motif-300x135.png" alt="" width="300" height="135" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/06/13/dustaride/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Getting data from relations and concepts in Ondex</title>
		<link>http://bio.adking.co.uk/2010/06/01/getting-data-from-relations-and-concepts-in-ondex/</link>
		<comments>http://bio.adking.co.uk/2010/06/01/getting-data-from-relations-and-concepts-in-ondex/#comments</comments>
		<pubDate>Tue, 01 Jun 2010 11:05:39 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Ondex]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Project]]></category>
		<category><![CDATA[Attribute]]></category>
		<category><![CDATA[Output]]></category>
		<category><![CDATA[Tanimoto]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=54</guid>
		<description><![CDATA[Did not know how to get attributes from the ondex data, researched into it by looking at various classes such as GDS and AttributeName. This is my current method of getting this type of data from the dataset:- Example &#8211; TanimotoCoefficient of a &#8220;sim&#8221; relation: int i = 0; for (ONDEXRelation r : relationview) {]]></description>
			<content:encoded><![CDATA[<p>Did not know how to get attributes from the ondex data, researched into it by looking at various classes such as <code>GDS</code> and <code>AttributeName</code>. This is my current method of getting this type of data from the dataset:-</p>
<p><strong>Example &#8211; TanimotoCoefficient of a &#8220;sim&#8221; relation:</strong></p>
<p><code><br />
int i = 0;<br />
for (ONDEXRelation r : relationview) {<br />
// Get an attribute test;<br />
AttributeName an = graph.getMetaData().getAttributeName("TanimotoCoefficient");<br />
GDS gds = r.getGDS(an);<br />
System.out.println(i + "-" + gds.getValue());<br />
i++;<br />
}<br />
</code></p>
<p>General terminal output seems to be done like this in other classes :-<br />
<code>fireEventOccurred(new GeneralOutputEvent("Unknown Relation type set :" + rts + " ignoring", "[Filter - start]"));</code></p>
<p><strong>Thoughts :-</strong></p>
<p>Store the semantic motifs as ONDEXGraph&#8217;s in a Set.</p>
<p>Create these subgraphs using the <code>drug1</code>, <code>drug2</code>, <code>drug1Relations</code> and <code>drug2Relations</code>, used in the code to count motifs. Does <code>ONDEXViewFunctions</code> provide this? (.and, .andNot and .or) Do these automatically give the concepts when &#8220;and&#8217;ing&#8221; to ONDEXView&#8217;s of relations.</p>
<p>- But only if it is a &#8216;<abbr title="What defines a match?">match</abbr>&#8216;.</p>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/06/01/getting-data-from-relations-and-concepts-in-ondex/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Previous Code</title>
		<link>http://bio.adking.co.uk/2010/05/30/test/</link>
		<comments>http://bio.adking.co.uk/2010/05/30/test/#comments</comments>
		<pubDate>Sun, 30 May 2010 13:56:22 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Ondex]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Project]]></category>
		<category><![CDATA[Code]]></category>
		<category><![CDATA[Fixed]]></category>
		<category><![CDATA[Motifs]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=59</guid>
		<description><![CDATA[The previous code used to count the abstract semantic motif within the data has been modified and extended so now it correctly functions. (i.e. coded in the latest version of java, new ondex functions, ondex deprecated methods etc&#8230;). Testing gave :- 157 semantic motifs for the cutdown_set.xml.gz dataset. 26,693 sematnic motifs for the ib2010_data.xml.gz dataset.]]></description>
			<content:encoded><![CDATA[<p>The previous code used to count the abstract semantic motif within the data has been modified and extended so now it correctly functions. (i.e. coded in the latest version of java, new ondex functions, ondex deprecated methods etc&#8230;).</p>
<p>Testing gave :-</p>
<ul>
<li>157 semantic motifs for the cutdown_set.xml.gz dataset.</li>
<li>26,693 sematnic motifs for the ib2010_data.xml.gz dataset.</li>
</ul>
<p>Which is the same as what the paper stated, showing the newest code is working correctly.</p>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/05/30/test/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Monday</title>
		<link>http://bio.adking.co.uk/2010/05/24/monday/</link>
		<comments>http://bio.adking.co.uk/2010/05/24/monday/#comments</comments>
		<pubDate>Mon, 24 May 2010 08:37:43 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Ondex]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Project]]></category>
		<category><![CDATA[Java]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=45</guid>
		<description><![CDATA[Browsed other Ondex filter classes for solutions to the iterator problem: Exception in thread "main" java.lang.NoSuchMethodError: net.sourceforget.ondex.core.ONDEXView.iterator()Ljava/util/Iterator; Tried a number of methods but still getting the Iterator error no matter what. Trying a Set/HashSet approach.]]></description>
			<content:encoded><![CDATA[<p>Browsed other Ondex filter classes for solutions to the iterator problem:</p>
<blockquote><p><code>Exception in thread "main" java.lang.NoSuchMethodError: net.sourceforget.ondex.core.ONDEXView.iterator()Ljava/util/Iterator;</code></p></blockquote>
<p>Tried a number of methods but still getting the Iterator error no matter what.</p>
<p>Trying a Set/HashSet approach.</p>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/05/24/monday/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Ondex Dataset</title>
		<link>http://bio.adking.co.uk/2010/05/20/ondex-dataset/</link>
		<comments>http://bio.adking.co.uk/2010/05/20/ondex-dataset/#comments</comments>
		<pubDate>Thu, 20 May 2010 14:09:01 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Ondex]]></category>
		<category><![CDATA[Project]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=42</guid>
		<description><![CDATA[ConceptClasses Compound (Chemical Compound (used as drug)) Disease (Human Diseases) EC Protein Publication Target (Target of Compound concept) RelationTypes bi_to (binds to) cat_c (catalysing class) inv_in (involved in) h_s_s is_a it_wi (interacts with) ﻿pub_in (published in) r (is related to) sim Data Sources UC (Unclassified) UC:UNIPROTKB (Collapsed CV Set) UNIPROTKB Evidence Types ACC (Accession based]]></description>
			<content:encoded><![CDATA[<p><strong>ConceptClasses</strong></p>
<ul>
<li>Compound <em>(Chemical Compound (used as drug))</em></li>
<li>Disease <em>(Human Diseases)</em></li>
<li>EC</li>
<li>Protein</li>
<li>Publication</li>
<li>Target <em>(Target of Compound concept)</em></li>
</ul>
<p><strong>RelationTypes</strong></p>
<ul>
<li>bi_to (<em>binds to</em>)</li>
<li>cat_c (<em>catalysing class</em>)</li>
<li>inv_in (<em>involved in</em>)</li>
<li>h_s_s</li>
<li>is_a</li>
<li>it_wi (<em>interacts with</em>)</li>
<li>﻿pub_in (<em>published in</em>)</li>
<li>r (<em>is related to</em>)</li>
<li>sim</li>
</ul>
<p><strong>Data Sources</strong></p>
<ul>
<li>UC (<em>Unclassified</em>)</li>
<li>UC:UNIPROTKB (<em>Collapsed CV Set</em>)</li>
<li>UNIPROTKB<strong><br />
</strong></li>
</ul>
<p><strong>Evidence Types</strong></p>
<ul>
<li>ACC (<em>Accession based mapping</em>)</li>
<li>BLAST (<em>BLAST based mapping</em>)</li>
<li>IMPD (<em>Imported from database</em>)</li>
<li>IMPD_MANUALLY_CURATED</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/05/20/ondex-dataset/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ondex</title>
		<link>http://bio.adking.co.uk/2010/05/19/ondex/</link>
		<comments>http://bio.adking.co.uk/2010/05/19/ondex/#comments</comments>
		<pubDate>Wed, 19 May 2010 13:11:02 +0000</pubDate>
		<dc:creator>Andrew</dc:creator>
				<category><![CDATA[Ondex]]></category>
		<category><![CDATA[Project]]></category>

		<guid isPermaLink="false">http://bio.adking.co.uk/?p=10</guid>
		<description><![CDATA[Downloaded latest version of the Ondex Visualisation Suite (May 2010). Discovered the compound Dutasteride within the dataset. After using the neighbourhood filter, the results showed that it is linked with Finasteride and Azelaic Acid (hair growth stimulant) are linked to treatment of benign prostatic hyperplasia (BPH) and off-label treatment of male pattern baldness (MPB).]]></description>
			<content:encoded><![CDATA[<p>Downloaded latest version of the <a title="Ondex" href="http://www.ondex.org" target="_blank">Ondex</a> Visualisation Suite (May 2010).<br />
Discovered the compound Dutasteride within the dataset. After using the neighbourhood filter, the results showed that it is linked with Finasteride and Azelaic Acid (hair growth stimulant) are linked to treatment of benign prostatic hyperplasia (BPH) and off-label treatment of male pattern baldness (MPB).</p>
<div id="attachment_9" class="wp-caption aligncenter" style="width: 310px"><a href="http://bio.adking.co.uk/wp-content/uploads/2010/05/Capture_Dutasteride.png"><img class="size-medium wp-image-9" title="Dutasteride" src="http://bio.adking.co.uk/wp-content/uploads/2010/05/Capture_Dutasteride-300x151.png" alt="Dusteride" width="300" height="151" /></a><p class="wp-caption-text">Ondex Network Visualisation</p></div>
]]></content:encoded>
			<wfw:commentRss>http://bio.adking.co.uk/2010/05/19/ondex/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

