<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Freelancing science &#187; Software</title>
	<atom:link href="http://freelancingscience.com/category/software/feed/" rel="self" type="application/rss+xml" />
	<link>http://freelancingscience.com</link>
	<description>visualization, protein science, open science and freelancing science</description>
	<lastBuildDate>Thu, 08 Apr 2010 21:36:42 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='freelancingscience.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/ca6331e4ebe8b5e624ddfd24badb4473?s=96&#038;d=http://s2.wp.com/i/buttonw-com.png</url>
		<title>Freelancing science &#187; Software</title>
		<link>http://freelancingscience.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://freelancingscience.com/osd.xml" title="Freelancing science" />
	<atom:link rel='hub' href='http://freelancingscience.com/?pushpress=hub'/>
		<item>
		<title>Basket as a writing tool, SCAN as a collector</title>
		<link>http://freelancingscience.com/2009/11/03/basket-as-a-writing-tool-scan-as-a-collector/</link>
		<comments>http://freelancingscience.com/2009/11/03/basket-as-a-writing-tool-scan-as-a-collector/#comments</comments>
		<pubDate>Tue, 03 Nov 2009 16:25:11 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Basket]]></category>
		<category><![CDATA[Metadata]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[SCAN]]></category>
		<category><![CDATA[Workflow]]></category>

		<guid isPermaLink="false">http://freelancingscience.com/?p=501</guid>
		<description><![CDATA[Basket has been my favourite notetaking software for a long time, until I had switched to mindmaps. Quite recently I&#8217;ve discovered another use for it &#8211; a writing aid. Basket in one-column mode allows to rearrange your notes just by dragging them up or down (there&#8217;re keyboard shortcuts for that as well). When I&#8217;m writing [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=501&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p><a title="Basket" href="http://basket.kde.org/">Basket</a> has been my favourite notetaking software for a long time, until I had switched to mindmaps. Quite recently I&#8217;ve discovered another use for it &#8211; a writing aid. Basket in one-column mode allows to rearrange your notes just by dragging them up or down (there&#8217;re keyboard shortcuts for that as well). When I&#8217;m writing a longer piece, I don&#8217;t need to hold a structure of the article in my head. I just collect all the pieces (quotes, blog posts fragments, my own notes, links, tweets etc.)  and then rearrange it as much as it&#8217;s needed. When the flow of the thoughts is optimal, I start to connect these pieces by writing some text in between <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</p>
<p><img class="aligncenter size-full wp-image-502" title="snapshot2" src="http://freesci.files.wordpress.com/2009/11/snapshot2.png?w=500&#038;h=507" alt="snapshot2" width="500" height="507" /></p>
<p>I don&#8217;t have <a class="zem_slink" title="DEVONthink" rel="homepage" href="http://www.devon-technologies.com/products/devonthink/">DevonThink</a> (I don&#8217;t have Mac) but for finding similar things in my archive I use <a title="About SCAN" href="http://scan.sourceforge.net/?page_id=2">SCAN</a>. SCAN can aggregate content from a number of sources (it has plugins to read PDFs, OpenOffice and MSOffice files or even RSS feeds), analyze it, automatically assign tags, extract metadata etc. It has <a class="zem_slink" title="Lucene" rel="wikipedia" href="http://en.wikipedia.org/wiki/Lucene">Lucene</a> engine built in and does quite a good job of finding related pieces in the archive. It&#8217;s quite buggy, doesn&#8217;t read all PDFs (such as encrypted), metadata extraction doesn&#8217;t work as expected but overall the tool has a potential (and there&#8217;s no similar program available on Linux platform anyway). Its development was recently restarted so there&#8217;s hope it&#8217;s going to be improved in a near future. Additionally, it has a nice eye-candy &#8211; a visual overview of relations between tags.</p>
<p><img class="aligncenter size-full wp-image-503" title="snapshot3" src="http://freesci.files.wordpress.com/2009/11/snapshot3.png?w=500&#038;h=192" alt="snapshot3" width="500" height="192" />This strategy is similar to the <a title="Stevem Johnson on writing workflow" href="http://www.stevenberlinjohnson.com/movabletype/archives/000230.html">workflow described by Steven Johnson</a>, but without DevonThink. So far I haven&#8217;t found anything better under Linux, but probably I need to check online apps &#8211; things do change every month.</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/e9e220cb-1b0a-4e0f-b538-69cdae02147f/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=e9e220cb-1b0a-4e0f-b538-69cdae02147f" alt="Reblog this post [with Zemanta]" /> </a></div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/501/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/501/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/501/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/501/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/501/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/501/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/501/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/501/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/501/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/501/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=501&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2009/11/03/basket-as-a-writing-tool-scan-as-a-collector/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2009/11/snapshot2.png" medium="image">
			<media:title type="html">snapshot2</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2009/11/snapshot3.png" medium="image">
			<media:title type="html">snapshot3</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=e9e220cb-1b0a-4e0f-b538-69cdae02147f" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Dreaming about bio-spreadsheet</title>
		<link>http://freelancingscience.com/2009/05/19/dreaming-about-bio-spreadsheet/</link>
		<comments>http://freelancingscience.com/2009/05/19/dreaming-about-bio-spreadsheet/#comments</comments>
		<pubDate>Tue, 19 May 2009 06:10:02 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[bioinformatics]]></category>

		<guid isPermaLink="false">http://freelancingscience.com/?p=459</guid>
		<description><![CDATA[One of the often occuring task in my work is to present results of an analysis in some kind of table. I have used for such purpose quite a number of approaches, starting from generating simple HTML file, through fetching of SQL data into table stored in a wiki, up to using Rails. One of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=459&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>One of the often occuring task in my work is to present results of an analysis in some kind of table. I have used for such purpose quite a number of approaches, starting from generating simple HTML file, through fetching of SQL data into table stored in a wiki, up to using Rails. One of the dreams I have recently is a web-based spreadsheet that would allow me to apply some specific piece of code over every row/column and show resulting table.</p>
<p><a href="http://freesci.files.wordpress.com/2009/05/screenshot.png"><img class="aligncenter size-full wp-image-460" title="Screenshot" src="http://freesci.files.wordpress.com/2009/05/screenshot.png?w=600&#038;h=274" alt="Screenshot" width="600" height="274" /></a>A simple mockup is shown above. In this example, a code:</p>
<pre>print " &amp;lt;img src="http://www.pdb.org/pdb/images/#{column_1}_bio_r_250.jpg&amp;gt;"</pre>
<p>&#8230; iterated over first column containing PDB codes, would substitute these codes with an image of a protein from PDB server.</p>
<p>In other words I dream about simple (single file would be the best &#8211; I like the approach <a title="Sinatra Framework" href="http://www.sinatrarb.com/">Sinatra framework</a> is taking) web-based programmable spreadsheet. Something like <a title="Resolver One" href="http://www.resolversystems.com/products/">Resolver One</a>, but simpler. Is there anything like that available?</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/459/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/459/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/459/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/459/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/459/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/459/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/459/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/459/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/459/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/459/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=459&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2009/05/19/dreaming-about-bio-spreadsheet/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2009/05/screenshot.png" medium="image">
			<media:title type="html">Screenshot</media:title>
		</media:content>
	</item>
		<item>
		<title>HMMER3 testing notes &#8211; my skills are (finally) becoming obsolete</title>
		<link>http://freelancingscience.com/2009/04/22/hmmer3-testing-notes-my-skills-are-finally-becoming-obsolete/</link>
		<comments>http://freelancingscience.com/2009/04/22/hmmer3-testing-notes-my-skills-are-finally-becoming-obsolete/#comments</comments>
		<pubDate>Wed, 22 Apr 2009 06:29:36 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[biology]]></category>
		<category><![CDATA[PFAM]]></category>
		<category><![CDATA[HMM]]></category>
		<category><![CDATA[HMMER]]></category>

		<guid isPermaLink="false">http://freelancingscience.com/?p=399</guid>
		<description><![CDATA[Image via Wikipedia It&#8217;s already quite a while since I&#8217;ve started to extensively test performance of HMMER3. As many other people noticed before, speed of the search has improved dramatically &#8211; I&#8217;m really impressed how fast it is. However, it&#8217;s only part of the story. The smaller part actually. As some of readers may know, [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=399&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="display:block;margin:1em;">
<div>
<dl class="wp-caption alignright">
<dt class="wp-caption-dt"><a href="http://commons.wikipedia.org/wiki/Image:HiddenMarkovModel.png"><img title="Hidden Markov Model with Output" src="http://upload.wikimedia.org/wikipedia/commons/thumb/2/2e/HiddenMarkovModel.png/200px-HiddenMarkovModel.png" alt="Hidden Markov Model with Output" width="200" height="160" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution">Image via <a href="http://commons.wikipedia.org/wiki/Image:HiddenMarkovModel.png">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>It&#8217;s already quite a while since I&#8217;ve started to extensively test performance of <a title="HMMER" href="http://hmmer.janelia.org/">HMMER3</a>. As many other people noticed before, speed of the search has improved dramatically &#8211; I&#8217;m really impressed how fast it is. However, it&#8217;s only part of the story. The smaller part actually.</p>
<p>As some of readers may know, most of my projects so far were revolving around protein sequence analysis and sequence-structure relationships. Mainly I was doing analysis of sequences that had no clear similarity to anything known, without functional annotation. Usual task was to run sequence comparison software and look at the end of the hit list, trying to make sense from hits beyond any reasonable E-value thresholds (for example I often run BLAST at E-value of 100 or 1000). I use very limited number of tools, because it takes quite a while to understand on which specific patterns a particular software fails.</p>
<p>The high-end tool I use most often is HHpred &#8211; HMM-HMM comparison software. It&#8217;s slow but very sensitive &#8211; my personal benchmarks show that it is able to identify very subtle patterns in sequence formed slightly above level of similar secondary structures (in other words, from the set of equally dissimilar sequences with identical secondary structure order, it correctly identifies the ones with similar tertiary structure).</p>
<p>The most surprising thing about HMMER3 is that in my personal benchmarks it&#8217;s almost as sensitive as HHpred. I wasn&#8217;t expecting that HMM-sequence comparison can be as good as HMM-HMM.  This observation suggests that there&#8217;s still a room for improvement for the latter approach, however it has already big implications.</p>
<p>PFAM will soon migrate to HMMER3 (t<a title="PFAM - migrating to HMMER3" href="http://xfam.wordpress.com/2009/03/19/hmmer3-migration-resolving-overlaps/">he PFAM team is now resolving overlaps between families</a> that arose due to increased sensitivity) and <strong>the moment it is be available, it will make a huge number of publications obsolete, or simply wrong.</strong> There are thousands of articles that discuss in detail evolutionary history of some particular domain (many of these will become obsolete) or draw some conclusions from the observation that some domain <strong>is not present</strong> in analyzed sequence/system (many of these will need to be revised). It will also make my skills quite obsolete, but that is always to be expected, no matter in what branch of science one is working. I also imagine that systems biology people will be very happy to have much better functional annotation of proteins.</p>
<p>I don&#8217;t want to call development of HMMER3 a revolution, but it will definitely have similar impact on biology as BLAST and HMMER2 had. Not only because of its speed, but also because it will create a picture of similarities between <strong>all</strong> proteins comparable to the picture state-of-the-art methods could only calculate for their small subset.</p>
<h6 class="zemanta-related-title" style="font-size:1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://mndoci.com/blog/2009/02/06/the-curse-of-blast/">The curse of BLAST</a> (mndoci.com)</li>
</ul>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Reblog this post [with Zemanta]" href="http://reblog.zemanta.com/zemified/a1d1e004-af00-46db-b774-7a3dd2903257/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=a1d1e004-af00-46db-b774-7a3dd2903257" alt="Reblog this post [with Zemanta]" /></a></div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/399/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/399/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/399/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=399&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2009/04/22/hmmer3-testing-notes-my-skills-are-finally-becoming-obsolete/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>

		<media:content url="http://upload.wikimedia.org/wikipedia/commons/thumb/2/2e/HiddenMarkovModel.png/200px-HiddenMarkovModel.png" medium="image">
			<media:title type="html">Hidden Markov Model with Output</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=a1d1e004-af00-46db-b774-7a3dd2903257" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Database query and ranked results</title>
		<link>http://freelancingscience.com/2009/01/22/database-query-and-fuzzy-answer/</link>
		<comments>http://freelancingscience.com/2009/01/22/database-query-and-fuzzy-answer/#comments</comments>
		<pubDate>Thu, 22 Jan 2009 19:21:27 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Data mining]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[Database]]></category>
		<category><![CDATA[PubMed]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Web search engine]]></category>

		<guid isPermaLink="false">http://freesci.wordpress.com/?p=67</guid>
		<description><![CDATA[Image via Wikipedia Already some time ago I&#8217;ve  read a piece by Marcelo Calbucci: Is it a database or a search engine?. While it deals with search information within a real estate database, I think his comments are applicable in the many areas of life sciences. In short, Marcelo points out that people miss a [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=67&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="float:right;display:block;margin:1em;">
<div>
<dl class="wp-caption">
<dt class="wp-caption-dt"><a href="http://en.wikipedia.org/wiki/Image:Autophagy500.jpg"><img title="The Autophagy network extracted from the recen..." src="http://upload.wikimedia.org/wikipedia/en/thumb/f/f1/Autophagy500.jpg/202px-Autophagy500.jpg" alt="The Autophagy network extracted from the recen..." width="202" height="138" /></a></dt>
<dd class="wp-caption-dd zemanta-img-attribution">Image via <a href="http://en.wikipedia.org/wiki/Image:Autophagy500.jpg">Wikipedia</a></dd>
</dl>
</div>
</div>
<p>Already some time ago I&#8217;ve  read a piece by Marcelo Calbucci:<a title="Is it a database or a search engine" href="http://marcelo.sampasite.com/marcelo-calbucci/brave-tech-world/redfin-dilemma-is-it-a-database.htm"> Is it a database or a search engine?</a>. While it deals with search information within a real estate database, I think his comments are applicable in the many areas of life sciences.</p>
<p>In short, Marcelo points out that people miss a lot of interesting entries while looking for a house, because of inflexibility of the query; number of bedrooms, price, distance from some point &#8211; these are all set. However, users are flexible and in such case need rather a search engine that gives them close enough answer or allows to specify weight to each filter.</p>
<p>In life sciences we do search for similarities and analogies all the time. Sometimes it&#8217;s direct comparison of sequences, on other occasion is high-level meta-comparison between two systems. And while we have various (statistical) metrics of similarities and they sometimes become a part of a database designs, interfaces of biological databases don&#8217;t allow to rank query results according to these metrics. For example I can easily find all human proteins related to disease X or disease Y or disease Z, although I cannot specify that I want proteins related to Z AND Y first on the list. Other example would be searching PubMed &#8211; I can look for articles related to &#8220;synthetic biology&#8221;, but I have no way to specify, that I want papers by <a class="zem_slink" title="James Collins (Boston University)" rel="wikipedia" href="http://en.wikipedia.org/wiki/James_Collins_%28Boston_University%29">James Collins</a> from <a class="zem_slink" title="Howard Hughes Medical Institute" rel="homepage" href="http://www.hhmi.org/">HHMI</a> AND articles related to these papers to be first on the list. I guess it is possible to obtain such results without going through the whole list, but I doubt the method will be very simple. Filtering still seems to be neglected aspect of database design in life sciences.</p>
<p>My dream biological search engine would have a series of sliders (or ideally, I would like to have a device with series of mechanical knobs attached to the computer) and would allow me to dynamically change weights of various aspects of the query and see immediately how it affects the results. It would be something resembling interactivity of <a title="Gapminder" href="http://www.gapminder.org/">Gapminder World</a>, but on dynamically generated data. Technology and proof of concept seems to be there, but I guess we need to wait quite a few years before this approach will be adopted within life sciences.</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/f13bfa4e-0535-4863-919e-33445b3b4ffa/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=f13bfa4e-0535-4863-919e-33445b3b4ffa" alt="Reblog this post [with Zemanta]" /></a></div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/67/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/67/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/67/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/67/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/67/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/67/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=67&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2009/01/22/database-query-and-fuzzy-answer/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>

		<media:content url="http://upload.wikimedia.org/wikipedia/en/thumb/f/f1/Autophagy500.jpg/202px-Autophagy500.jpg" medium="image">
			<media:title type="html">The Autophagy network extracted from the recen...</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=f13bfa4e-0535-4863-919e-33445b3b4ffa" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Collanos Workplace and scientific collaboration</title>
		<link>http://freelancingscience.com/2009/01/15/collanos-workplace-and-scientific-collaboration/</link>
		<comments>http://freelancingscience.com/2009/01/15/collanos-workplace-and-scientific-collaboration/#comments</comments>
		<pubDate>Thu, 15 Jan 2009 16:25:19 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Research skills]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Google apps]]></category>
		<category><![CDATA[Office suite]]></category>
		<category><![CDATA[Peer-to-peer]]></category>
		<category><![CDATA[Project management]]></category>

		<guid isPermaLink="false">http://freelancingscience.com/?p=303</guid>
		<description><![CDATA[For some time already I was looking for a tool that would eliminate a need for sending files back and forth between people collaborating on a the same project. While I&#8217;m perfectly aware of various solutions such as wikis, version control systems or online office suites, I didn&#8217;t feel like I could convince my collaborators [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=303&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div id="attachment_304" class="wp-caption aligncenter" style="width: 460px"><img class="size-full wp-image-304" title="blog-0016" src="http://freesci.files.wordpress.com/2009/01/blog-0016.png?w=450&#038;h=263" alt="One of my woskpaces in Collanos" width="450" height="263" /><p class="wp-caption-text">One of my workspaces in Collanos</p></div>
<p>For some time already I was looking for a tool that would eliminate a need for sending files back and forth between people collaborating on a the same project. While I&#8217;m perfectly aware of various solutions such as wikis, version control systems or online <a class="zem_slink" title="Office suite" rel="wikipedia" href="http://en.wikipedia.org/wiki/Office_suite">office suites</a>, I didn&#8217;t feel like I could convince my collaborators to use any of these. One of the reasons is always a feeling of insecurity when using publicly hosted platform (BTW, this is not that uncommon among scientists &#8211; I know at least one scientific institution in Western Europe that explicitly forbids using Google apps, especially Gmail for work-related stuff, because of Google&#8217;s privacy policy). The other reason was that such solutions are not the best choice when working on binary files (most of my projects do not involve collaborative programming). When I stumbled across <a class="zem_slink" title="Collanos" rel="homepage" href="http://www.collanos.com/">Collanos Workplace</a>, which offers peer-to-peer synchronization (although without revision control), instead of a central-server based, I&#8217;ve decided to give it a try. For the last couple of weeks I&#8217;ve been using Collanos to collaborate on one relatively simple project and the experience was quite positive.</p>
<p>At first, I thought that Collanos may serve mainly as a tool for secure peer-to-peer files sharing with an information who changed what etc. It turned out that this is a capable project management application, that has a chat and discussion panel, one can post notes, links add tasks and assign them to team members. Files are stored is a separate directory &#8211; after one adds a file to Collanos, it should be opened from the application, not from original folder. This seemed a mistake in design at first, but I appreciated it very quickly. Synchronization of project directory would mean sharing all of its contents and that can be sometimes in the range of many GBs. From time to time some bug appeared here or there, but overall it worked as expected. Peer-to-peer sharing means that both people have to be online for synchronization, but so far situation that I switched computer off before a person could download my changes happened only once and it was during a weekend.</p>
<p>As a side note, it&#8217;s nice to see that Eclipse becomes an application platform for quite a number of programs. See for example this <a title="Eclipse-based software" href="http://en.wikipedia.org/wiki/List_of_Eclipse-based_software">list of Eclipse-based software</a>.</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/5015c7c3-e759-42ec-8ff1-e6ed13b98cad/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=5015c7c3-e759-42ec-8ff1-e6ed13b98cad" alt="Reblog this post [with Zemanta]" /></a></div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/303/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/303/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/303/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/303/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/303/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/303/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/303/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/303/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/303/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/303/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=303&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2009/01/15/collanos-workplace-and-scientific-collaboration/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2009/01/blog-0016.png" medium="image">
			<media:title type="html">blog-0016</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=5015c7c3-e759-42ec-8ff1-e6ed13b98cad" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Bioinformatics is a visual analytics (sometimes)</title>
		<link>http://freelancingscience.com/2008/12/18/bioinformatics-is-a-visual-analytics-sometimes/</link>
		<comments>http://freelancingscience.com/2008/12/18/bioinformatics-is-a-visual-analytics-sometimes/#comments</comments>
		<pubDate>Thu, 18 Dec 2008 14:20:52 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Proteins]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[Visualization]]></category>
		<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[biology]]></category>
		<category><![CDATA[Chimera]]></category>
		<category><![CDATA[Cytoscape]]></category>
		<category><![CDATA[Online Services]]></category>
		<category><![CDATA[protein]]></category>
		<category><![CDATA[Protein family]]></category>
		<category><![CDATA[Visual analytics]]></category>

		<guid isPermaLink="false">http://freelancingscience.com/?p=266</guid>
		<description><![CDATA[Short description of my research interest is &#8220;I do proteins&#8221; (I took this phrase from my friend Ana). I try to figure out what particular protein, protein family, or set of proteins does in the wider context. Usually I start where automated methods have ended &#8211; I have all kinds of annotation so I try [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=266&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p style="text-align:left;">Short description of my research interest is &#8220;I do proteins&#8221; (I took this phrase from <a title="Ana Rojas" href="http://idoproteins.com/">my friend Ana</a>). I try to figure out what particular protein, protein family, or set of proteins does in the wider context. Usually I start where automated methods have ended &#8211; I have all kinds of annotation so I try to put data together and form some hypothesis. I recently realized that the process is basically visualizing different kind of data &#8211; or rather looking at the same issue from many different perspectives.</p>
<p style="text-align:left;">It starts with alignments. Lots of alignments. And they all end up in different forms of visual representation. Sometimes it&#8217;s a conservation with secondary structure prediction (with <a title="Alignment Viewer" href="http://toolkit.tuebingen.mpg.de/alnviz">AlignmentViewer</a> or <a title="Jalview" href="http://www.jalview.org/">Jalview</a>):</p>
<p style="text-align:center;"><img class="size-full wp-image-272 aligncenter" title="blog-0005" src="http://freesci.files.wordpress.com/2008/12/blog-0001.png?w=450&#038;h=127" alt="blog-0005" width="450" height="127" /></p>
<p style="text-align:left;">Sometimes I look for transmembrane beta-barrels (with <a title="ProfTMB" href="http://cubic.bioc.columbia.edu/services/proftmb/">ProfTMB</a>):</p>
<p style="text-align:center;"><img class="size-full wp-image-272 aligncenter" title="blog-0005" src="http://freesci.files.wordpress.com/2008/12/blog-0002.png?w=450&#038;h=127" alt="blog-0005" width="450" height="127" /></p>
<p style="text-align:left;">Sometimes I try to find a pattern in hydrophobicity and side-chain size values across the alignment (<a title="Aln2Plot" href="http://toolkit.tuebingen.mpg.de/aln2plot">Aln2Plot</a>):</p>
<p style="text-align:center;"><img class="size-full wp-image-272 aligncenter" title="blog-0005" src="http://freesci.files.wordpress.com/2008/12/blog-0003.png?w=450&#038;h=127" alt="blog-0005" width="450" height="127" /></p>
<p style="text-align:left;">Afterwards I seek for patterns and interesting correlations in domain organization (<a title="PFAM" href="http://pfam.sanger.ac.uk/">PFAM</a>, <a title="SMART" href="http://smart.embl-heidelberg.de/">Smart</a>):</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-275" title="blog-0008" src="http://freesci.files.wordpress.com/2008/12/blog-0008.png?w=450&#038;h=137" alt="blog-0008" width="450" height="137" /></p>
<p style="text-align:left;">Sometimes I map all these findings onto a structure or a model that I make somewhere in the meantime based on found data (<a title="Pymol" href="http://pymol.sourceforge.net/">Pymol</a>, <a title="VMD" href="http://www.ks.uiuc.edu/Research/vmd/">VMD</a>, <a title="Chimera" href="http://www.cgl.ucsf.edu/chimera/">Chimera</a>):</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-273" title="blog-0006" src="http://freesci.files.wordpress.com/2008/12/blog-0006.png?w=450&#038;h=271" alt="blog-0006" width="450" height="271" /></p>
<p style="text-align:left;">I also try to make sense out of genomic context (works for eukaryotic organisms as well &#8211; <a title="The SEED" href="http://www.theseed.org/wiki/Main_Page">The SEED</a>):</p>
<p style="text-align:center;"><img class="size-full wp-image-272 aligncenter" title="blog-0005" src="http://freesci.files.wordpress.com/2008/12/blog-0004.png?w=450&#038;h=127" alt="blog-0005" width="450" height="127" /></p>
<p style="text-align:left;">I investigate how the proteins cluster together according to their similarity (<a title="CLANS - cluster proteins in Java" href="http://bioinfoserver.rsbs.anu.edu.au/programs/clans/">CLANS</a>):</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-288" title="blog-0013" src="http://freesci.files.wordpress.com/2008/12/blog-0013.png?w=450&#038;h=177" alt="blog-0013" width="450" height="177" /></p>
<p style="text-align:left;">And figure out how the protein or the system I&#8217;m studying fits into interaction or metabolic networks (<a class="zem_slink" title="Cytoscape" rel="homepage" href="http://www.cytoscape.org/">Cytoscape</a>, <a title="Medusa Network Viewer" href="http://www.bork.embl.de/medusa/">Medusa</a>, <a title="STRING" href="http://string.embl.de/">STRING</a>, <a title="STITCH" href="http://stitch.embl.de/">STITCH</a>):</p>
<p style="text-align:center;"><img class="aligncenter size-full wp-image-274" title="blog-0007" src="http://freesci.files.wordpress.com/2008/12/blog-0007.png?w=450&#038;h=190" alt="blog-0007" width="450" height="190" /></p>
<p style="text-align:left;">If there&#8217;s some additional numerical information I dump it into analysis software (<a title="R" href="http://www.r-project.org/">R</a>, for simpler things <a title="DiVisa" href="http://math.nist.gov/mcsd/savg/software/divisa/">DiVisa</a>):</p>
<p style="text-align:center;"><img class="size-full wp-image-272 aligncenter" title="blog-0005" src="http://freesci.files.wordpress.com/2008/12/blog-0000.png?w=450&#038;h=260" alt="blog-0005" width="450" height="260" /></p>
<p style="text-align:center;">
<p style="text-align:center;">
<p style="text-align:center;">
<p>And I make note along the process in the form of a mindmap (<a title="Freemind" href="http://freemind.sourceforge.net/wiki/index.php/Main_Page">Freemind</a>, recently switched to <a href="http://xmind.net">Xmind</a>, because it allows to store attachments and images in the mindmap file, not just link to them like Freemind does):<img class="aligncenter size-full wp-image-277" title="blog-0010" src="http://freesci.files.wordpress.com/2008/12/blog-0010.png?w=450&#038;h=265" alt="blog-0010" width="450" height="265" /></p>
<p>So it turns out that I mainly do visual analytics. I spend considerable amount of time on preparing various representations of biological data and then the rest of the time I look at the pictures. While that&#8217;s not something every bioinformatician does, many of my colleagues have their own workflows that also rely heavily on pictures. For some areas it&#8217;s more prominent, for others it&#8217;s not, but the fact is that pictures are everywhere.</p>
<p>There are two reasons I use manual workflow with lots looking at intermediate results: I work with weak signals (for example, sometimes I need to run BLAST at E-value of 1000) or I need to deeply understand the system I study. Making connections between two seemingly unrelated biological entities requires wrapping one&#8217;s brain around the problem and&#8230; lots of looking at it.</p>
<p>And here comes the frustration. I counted that I use more than twenty (!) different programs for visualization. And even if I&#8217;m enjoying monitor setup 4500 pixels wide which is almost enough to put all that data onto screen, the main issue is that the software isn&#8217;t connected. AlignmentViewer cannot adjust its display automatically based on the domain I&#8217;m looking at or a network node I&#8217;m investigating &#8211; I need to do it by myself. Of course I can couple alignments and structure in Jalview, Chimera or VMD but I don&#8217;t find such solution to be usable on the long run. To have the best of all worlds, I need to juggle all these applications.</p>
<p>I&#8217;ve been longing for some time already for a generic visualization platform that is able to show 2D and 3D data within the single environment, so I follow development of <a title="Second Life Data Visualization" href="http://sldataviz.pbwiki.com/">SecondLife visualization environment</a> and <a title="Open Croquet" href="http://www.opencroquet.org/index.php/Main_Page">Croquet/Cobalt</a> initiatives. While these don&#8217;t look very exciting right now, I hope they will provide a common platform for different visualization methods (and of course visual collaboration environment).</p>
<p>But to be realistic, visual analytics in biology is not going to become a mainstream. It&#8217;s far more efficient to improve algorithms for multidimensional data analysis than to spend more time looking at pictures. I had already few such situations when I could see some weak signal and in a year or two it became obvious. But I&#8217;m still going to enjoy scientific visualization. I came to science for aesthetic reasons after all. <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/19241e88-7cad-4095-b0ff-9e73770c85c2/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=19241e88-7cad-4095-b0ff-9e73770c85c2" alt="Reblog this post [with Zemanta]" /></a></div>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/266/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/266/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/266/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/266/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/266/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=266&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2008/12/18/bioinformatics-is-a-visual-analytics-sometimes/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0001.png" medium="image">
			<media:title type="html">blog-0005</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0002.png" medium="image">
			<media:title type="html">blog-0005</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0003.png" medium="image">
			<media:title type="html">blog-0005</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0008.png" medium="image">
			<media:title type="html">blog-0008</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0006.png" medium="image">
			<media:title type="html">blog-0006</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0004.png" medium="image">
			<media:title type="html">blog-0005</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0013.png" medium="image">
			<media:title type="html">blog-0013</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0007.png" medium="image">
			<media:title type="html">blog-0007</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0000.png" medium="image">
			<media:title type="html">blog-0005</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/12/blog-0010.png" medium="image">
			<media:title type="html">blog-0010</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=19241e88-7cad-4095-b0ff-9e73770c85c2" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Skyrails and STRING</title>
		<link>http://freelancingscience.com/2008/09/09/skyrails-and-string/</link>
		<comments>http://freelancingscience.com/2008/09/09/skyrails-and-string/#comments</comments>
		<pubDate>Tue, 09 Sep 2008 10:08:37 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[Visualization]]></category>
		<category><![CDATA[Graph]]></category>
		<category><![CDATA[Information Visualization]]></category>
		<category><![CDATA[Skyrails]]></category>
		<category><![CDATA[STRING]]></category>

		<guid isPermaLink="false">http://freesci.wordpress.com/?p=191</guid>
		<description><![CDATA[Of course I couldn&#8217;t resist not to play a little bit with Skyrails after I saw it at Flowing Data blog. Skyrails is a graph visualization system that was designed with expandability and awesome look in mind. All menus can be programmed in odd-looking, but quite easy to learn language, which helps in writing customized [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=191&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>Of course I couldn&#8217;t resist not to play a little bit with <a title="Skyrails" href="http://cgi.cse.unsw.edu.au/~wyos/skyrails/">Skyrails</a> after <a title="Flowing Data on Skyrails" href="http://flowingdata.com/2008/09/08/interactive-graph-visualization-system-skyrails/">I saw it at Flowing Data blog</a>. Skyrails is a graph visualization system that was designed with expandability and awesome look in mind. All menus can be programmed in odd-looking, but quite easy to learn language, which helps in writing customized interface to particular data.</p>
<p>My quick attempt was to take some sample data from <a title="STRING" href="http://string.embl.de/">STRING</a>, feed it into Skyrails and see if that makes any sense. My choice was #1 example from STRING main page, which was <a title="STRING - trpA protein from E.coli K12" href="http://string.embl.de/newstring_cgi/display_single_node.pl?taskId=pBlaWopXKqO8&amp;node=545425&amp;targetmode=proteins">trpA protein from E. coli K12</a>. The main graph on the trpA interactions page looks as follows:</p>
<p><img class="aligncenter size-full wp-image-192" title="TrpA STRING" src="http://freesci.files.wordpress.com/2008/09/string_trpa.png?w=448&#038;h=305" alt="" width="448" height="305" /></p>
<p>The same graph in Skyrails:</p>
<p><img class="aligncenter size-large wp-image-193" title="TrpA Skyrails" src="http://freesci.files.wordpress.com/2008/09/trpa_skyrails.png?w=510&#038;h=361" alt="" width="510" height="361" /></p>
<p>Of course Skyrails has a 3D representation, is fully interactive, with a little work one can filter some of the connections out, put images of structures instead of green dots, etc. etc. It doesn&#8217;t look as clear as STRING, because it wasn&#8217;t optimized for such use &#8211; in practice it&#8217;s much clearer. The video below shows the basic interactions with this dataset.</p>
<p><span style="text-align:center; display: block;"><a href="http://freelancingscience.com/2008/09/09/skyrails-and-string/"><img src="http://img.youtube.com/vi/YePljkW9i4I/2.jpg" alt="" /></a></span></p>
<p>Is it useful? At the moment, not really. It has already lots of features that more mature programs lack (completely programmable menus are great idea), but usage is still crude and in some cases the flashy effects are disturbing. However, it&#8217;s worth to keep an eye on Skyrails. First, development is pretty much guaranteed, as the author said he starts a PhD on this project. Second, the basic roadmap includes features that again aren&#8217;t present anywhere else, like client-server architecture (so you can talk to Skyrails system from external application &#8211; dynamic, time-aware visualization?). And third &#8211; it&#8217;s the most cool-looking visualization system I&#8217;ve found so far (will it make into a movie, <a title="Ben Fry Genome Valence in movie" href="http://benfry.com/genomevalence/hulk/">like Genome Valence from Ben Fry did?</a>).</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/579536bb-0c71-41af-b84a-2252feae78d7/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=579536bb-0c71-41af-b84a-2252feae78d7" alt="Reblog this post [with Zemanta]" /></a></div>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/freesci.wordpress.com/191/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/freesci.wordpress.com/191/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/191/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/191/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/191/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=191&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2008/09/09/skyrails-and-string/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/09/string_trpa.png" medium="image">
			<media:title type="html">TrpA STRING</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/09/trpa_skyrails.png?w=510" medium="image">
			<media:title type="html">TrpA Skyrails</media:title>
		</media:content>

		<media:content url="http://img.youtube.com/vi/YePljkW9i4I/2.jpg" medium="image" />

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=579536bb-0c71-41af-b84a-2252feae78d7" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Ubiquity &#8211; coding something useful in less than 20 minutes</title>
		<link>http://freelancingscience.com/2008/08/27/ubiquity-coding-something-useful-in-less-than-20-minutes/</link>
		<comments>http://freelancingscience.com/2008/08/27/ubiquity-coding-something-useful-in-less-than-20-minutes/#comments</comments>
		<pubDate>Wed, 27 Aug 2008 12:57:51 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[protein structure]]></category>
		<category><![CDATA[Firefox]]></category>
		<category><![CDATA[Mozilla]]></category>
		<category><![CDATA[Ubiquity]]></category>
		<category><![CDATA[Mozilla Labs]]></category>
		<category><![CDATA[Protein Data Bank]]></category>

		<guid isPermaLink="false">http://freesci.wordpress.com/?p=174</guid>
		<description><![CDATA[Ubiquity is the new experimental extension to Firefox that will (I&#8217;m sure it will) make enormous impact on the way we use the browser. It allows to remix various services and extend functionality of the browser in very easy way (if you don&#8217;t get the point of Ubiquity yet, I recommend watching the video that [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=174&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone size-full wp-image-176" src="http://freesci.files.wordpress.com/2008/08/ubiexample.png?w=450&#038;h=222" alt="" width="450" height="222" /></p>
<p>Ubiquity is the new experimental extension to Firefox that will (I&#8217;m sure it will) make enormous impact on the way we use the browser. It allows to remix various services and extend functionality of the browser in very easy way (if you don&#8217;t get the point of Ubiquity yet, I recommend watching <a title="Ubituity - Mozilla Labs" href="http://labs.mozilla.com/2008/08/introducing-ubiquity/">the video that came with official announcement</a>; I needed to see that &#8211; description didn&#8217;t tell me much about how powerful it can be).</p>
<p>I didn&#8217;t have much time to play with it yet, but in spare 20 minutes I attempted to code a command that would show me the image of a structure from PDB given its code and eventually take me to its homepage. Suprisingly it was very easy (and I&#8217;m not a JS coder). The source is pasted below.</p>
<pre>CmdUtils.CreateCommand({
  name: "pdb",
  description: "Goes to Protein Data Bank given PDB code.",
  icon: "http://www.rcsb.org/favicon.ico",
  help: "You can specify the PDB code and pressing enter will take you to particular structure's homepage." +
    " If you type pdb code and press arrow down, you should see an image from PDB site.",

  takes: {"PDB code": noun_arb_text},

  execute: function( directObj) {
    var pdbcode = directObj.text;
    Utils.openUrlInBrowser("http://www.rcsb.org/pdb/explore/explore.do?structureId="+pdbcode);
  },

  preview: function( pblock, directObj ) {
    var pdbcode = directObj.text;

    pblock.innerHTML = "Preview of the structure:&lt;br/&gt;";
    pblock.innerHTML += "&lt;img src=\"http://www.rcsb.org/pdb/images/" + pdbcode + "_bio_r_250.jpg\" /&gt;";

  }
})</pre>
<p>It of course could be improved by using also a selected text, or allowing to keyword search the PDB (or basically any other biological database), but its current functionality suits me just fine. Ubiquity is not yet such a stable platform as <a class="zem_slink" title="Greasemonkey" rel="homepage" href="http://www.greasespot.net/">Greasemonkey</a> (or Chickenfoot), but it&#8217;s worth to keep an eye on it. I&#8217;m sure we will read sooner or later an article in peer-reviewed journal describing Ubiquity commands for life sciences <img src='http://s.wordpress.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> .</p>
<h6 class="zemanta-related-title" style="font-size:1em;">Related articles by Zemanta</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://scobleizer.com/2008/08/26/mozilla_ubiquity/">Mozilla gives the passionates one with Ubiquity</a></li>
</ul>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/a2a1c986-c293-4690-9f3c-739d75203b3b/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=a2a1c986-c293-4690-9f3c-739d75203b3b" alt="Reblog this post [with Zemanta]" /></a></div>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/freesci.wordpress.com/174/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/freesci.wordpress.com/174/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/174/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/174/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/174/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=174&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2008/08/27/ubiquity-coding-something-useful-in-less-than-20-minutes/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>

		<media:content url="http://freesci.files.wordpress.com/2008/08/ubiexample.png" medium="image" />

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=a2a1c986-c293-4690-9f3c-739d75203b3b" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Configuring Torque and InterProScan</title>
		<link>http://freelancingscience.com/2008/07/10/configuring-torque-and-interproscan/</link>
		<comments>http://freelancingscience.com/2008/07/10/configuring-torque-and-interproscan/#comments</comments>
		<pubDate>Thu, 10 Jul 2008 16:29:35 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Software]]></category>
		<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[InterProScan]]></category>
		<category><![CDATA[qsub]]></category>
		<category><![CDATA[queueing system]]></category>
		<category><![CDATA[torque]]></category>

		<guid isPermaLink="false">http://freesci.wordpress.com/?p=126</guid>
		<description><![CDATA[Image via Wikipedia If by the chance, you want to use InterProScan with Torque Resource Manager (queueing system based on PBS project) it doesn&#8217;t work by default (it&#8217;s tested with LSF, configuration files are supplied for original PBS and Sun Grid Engine). Fortunately there are two small changes needed in the InterProScan config files to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=126&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class="zemanta-img" style="float:right;display:block;margin:1em;"><span class="zemanta-img-attribution">Image via <a href="http://commons.wikipedia.org/wiki/Image:Bash_screenshot.png">Wikipedia</a></span></div>
<p>If by the chance, you want to use <a title="InterProScan README" href="ftp://ftp.ebi.ac.uk/pub/databases/interpro/iprscan/README.html">InterProScan</a> with <a class="zem_slink" title="Portable Batch System" rel="wikipedia" href="http://en.wikipedia.org/wiki/Portable_Batch_System">Torque Resource Manager</a> (queueing system based on PBS project) it doesn&#8217;t work by default (it&#8217;s tested with <a class="zem_slink" title="Platform LSF" rel="wikipedia" href="http://en.wikipedia.org/wiki/Platform_LSF">LSF</a>, configuration files are supplied for original PBS and <a class="zem_slink" title="Sun Grid Engine" rel="wikipedia" href="http://en.wikipedia.org/wiki/Sun_Grid_Engine">Sun Grid Engine</a>). Fortunately there are two small changes needed in the InterProScan config files to make it work. First, during iprscan configuration, choose PBS54 as your queueing system. Then, in the file pbs54.conf (IPRSCANHOME/conf) remove &#8220;-d&#8221; switch from following two lines:</p>
<pre>asyncsub=qsub [%optqueue][%optresource] -d -o /dev/null -e /dev/null "[%toolcmd]"
syncsub=qsub [%optqueue][%optresource] -d -o /dev/null -e /dev/null -I "[%toolcmd"]</pre>
<p>Assumming that Torque binaries are available in the global PATH (qsub, qdel etc., on my machine they sit under /usr/local/bin), change in default shell in the enviroment file pbs54env.sh &#8211; from #!/bin/sh to #!/bin/bash. Also, you can add another directories to the PATH in that file (I didn&#8217;t). Voilla. InterProScan jobs are now queued.</p>
<div class="zemanta-pixie" style="margin-top:10px;height:15px;"><a class="zemanta-pixie-a" title="Zemified by Zemanta" href="http://reblog.zemanta.com/zemified/d0484b25-8865-4fef-acf1-c4e19cfd20ef/"><img class="zemanta-pixie-img" style="border:medium none;float:right;" src="http://img.zemanta.com/reblog_e.png?x-id=d0484b25-8865-4fef-acf1-c4e19cfd20ef" alt="Zemanta Pixie" /></a></div>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/freesci.wordpress.com/126/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/freesci.wordpress.com/126/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/126/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/126/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/126/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/126/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/126/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/126/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/126/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/126/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/126/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/126/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=126&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2008/07/10/configuring-torque-and-interproscan/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=d0484b25-8865-4fef-acf1-c4e19cfd20ef" medium="image">
			<media:title type="html">Zemanta Pixie</media:title>
		</media:content>
	</item>
		<item>
		<title>Bug tracking systems in science</title>
		<link>http://freelancingscience.com/2008/04/18/bug-tracking-systems-in-science/</link>
		<comments>http://freelancingscience.com/2008/04/18/bug-tracking-systems-in-science/#comments</comments>
		<pubDate>Fri, 18 Apr 2008 10:21:11 +0000</pubDate>
		<dc:creator>Pawel Szczesny</dc:creator>
				<category><![CDATA[Comments]]></category>
		<category><![CDATA[Community]]></category>
		<category><![CDATA[Software]]></category>
		<category><![CDATA[bioinformatics]]></category>
		<category><![CDATA[bug tracking]]></category>
		<category><![CDATA[NPG]]></category>
		<category><![CDATA[science]]></category>

		<guid isPermaLink="false">http://freesci.wordpress.com/?p=109</guid>
		<description><![CDATA[I&#8217;m not going to describe painful process of correcting entries in biological databases or errors in publications when one is not the author &#8211; we all know how difficult and unrewarding it is. All major databases contain wrong entries &#8211; I see misannotated (or nonexistent) genes in Genbank, artificial domains in PFAM or poorly solved [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=109&subd=freesci&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m not going to describe painful process of correcting entries in biological databases or errors in publications when one is not the author &#8211; we all know how difficult and unrewarding it is. All major databases contain wrong entries &#8211; I see misannotated (or nonexistent) genes in Genbank, artificial domains in PFAM or poorly solved structures in PDB. It&#8217;s even worse in publications, where across the whole spectrum of journals I see errors which in theory shouldn&#8217;t slip through peer review (this includes such prominent publishers like NPG).</p>
<p>One of the best idea I heard that addressed this issue was to build a bug tracking system (I would like to give credit to the author, but I cannot find the source; wasn&#8217;t that one of biobloggers?). It&#8217;s simple and efficient. Something is wrong? Fill a bug report. It would be linking to the original entry, would be available for aggregation (for example to track report&#8217;s author activity), and possibly could be closed by somebody else than database maintainers or authors if it&#8217;s wrong. Because it would be external to all databases, maybe it could grow to provide &#8220;community corrected&#8221; versions of these databases?</p>
<p>What do you think? How useful such system could be?</p>
<br /><img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/freesci.wordpress.com/109/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/freesci.wordpress.com/109/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/freesci.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/freesci.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/freesci.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/freesci.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/freesci.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/freesci.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/freesci.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/freesci.wordpress.com/109/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/freesci.wordpress.com/109/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/freesci.wordpress.com/109/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=freelancingscience.com&blog=1482738&post=109&subd=freesci&ref=&feed=1" />]]></content:encoded>
			<wfw:commentRss>http://freelancingscience.com/2008/04/18/bug-tracking-systems-in-science/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/68883fb1792e3694835f60059aa0912e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">freesci</media:title>
		</media:content>
	</item>
	</channel>
</rss>