RSS

Category Archives: Research

Basket as a writing tool, SCAN as a collector

Basket has been my favourite notetaking software for a long time, until I had switched to mindmaps. Quite recently I’ve discovered another use for it – a writing aid. Basket in one-column mode allows to rearrange your notes just by dragging them up or down (there’re keyboard shortcuts for that as well). When I’m writing a longer piece, I don’t need to hold a structure of the article in my head. I just collect all the pieces (quotes, blog posts fragments, my own notes, links, tweets etc.) and then rearrange it as much as it’s needed. When the flow of the thoughts is optimal, I start to connect these pieces by writing some text in between :).

I don’t have DevonThink (I don’t have Mac) but for finding similar things in my archive I use SCAN. SCAN can aggregate content from a number of sources (it has plugins to read PDFs, OpenOffice and MSOffice files or even RSS feeds), analyze it, automatically assign tags, extract metadata etc. It has Lucene engine built in and does quite a good job of finding related pieces in the archive. It’s quite buggy, doesn’t read all PDFs (such as encrypted), metadata extraction doesn’t work as expected but overall the tool has a potential (and there’s no similar program available on Linux platform anyway). Its development was recently restarted so there’s hope it’s going to be improved in a near future. Additionally, it has a nice eye-candy – a visual overview of relations between tags.

This strategy is similar to the workflow described by Steven Johnson, but without DevonThink. So far I haven’t found anything better under Linux, but probably I need to check online apps – things do change every month.

1 Comment

Posted by Pawel Szczesny on November 3, 2009 in Research, Software

Tags: Basket, Metadata, RSS, SCAN, Workflow

Transitions, transitions

28 Oct

Quite a few things happened while I was away. If you’re interested, here’s not so short summary of my internet hiatus:

Research area

I think I’m done with bioinformatics. My current research area seems to be located somewhere between systems biology, theoretical biology and information/complex systems theory. I hope to build on Dawkins work, deal with emergence in biology and study subtle effects in biological systems. While I’m not sure if I will have anything interesting to show ever, I don’t have energy to do yet another project which involves programming/web interfaces/dealing with data/annotations/modelling etc. I’m done with analytics, time for synthesis :).

Carrer

Last year I wrote a post dreaming about small non-profit contract research organisation. This model of Research-as-a-Service has materialized in a virtual research institute which we have finally launched few days ago (materialized in something virtual, sign of times? 😉 ). The setup is quite simple – the institute gets a project (or applies for such) and then it searches for researchers/institutions/freelancers which are willing to subcontract parts of the project. We have outsourced not only research part, even money gathering (writing grants, etc.) is done by external company. The setup is quite flexible and pretty transparent – for example, we may represent somebody’s rights, but no intellectual property is owned by the institute. Why such institution? We become a single point of contact for a large and diverse group of scientists, which are willing to do some research for real money but don’t have time and energy to hunt for gigs by themselves. While I have an academic job, I’m in the middle of transition from being a freelancer, to being a jobs provider for freelance scientists. More on that in some other post.

Open science

I plan to spend way more time on advocating open science (all of its flavors), but… in Polish. This step is out of large frustration that even prominent figures in Polish science have no idea about changes in the science internet-aware researchers are watching and creating. Knowledge about even basic things like Open Access is dramatically low in Poland (a number of people here equals OA with low quality publications which have not been peer-reviewed). With few friends, we have a number of projects in the pipeline (for example, we hope to launch a nation-wide, created by professionals promotional campaign – bilboards, TV commercials etc. – for open science). If any of these actually works, I will let you know if we have any measureable success 😉 .

Labels, labels

Robert Anton Wilson tells a nice story in his book Prometheus Rising:

William James, father of American psychology, tells of meeting an old lady who told him the Earth rested on the back of a huge
turtle.

“But, my dear lady,” Professor James asked, as politely aspossible, “what holds up the turtle?”
“Ah,” she said, “that’s easy. He is standing on the back of another turtle.”
“Oh, I see,” said Professor James, still being polite. “But would you be so good as to tell me what holds up the second turtle?”
“It’s no use, Professor,” said the old lady, realizing he was trying to lead her into a logical trap. “It’s turtles-turtles-turtles, all the way!”

Another story is a comment from my advisor about putting my real research plans in some proposal (he supports these plans):

The most likely a reaction from reviewers will be something like this: “Nice start, some decent papers, PhD looks good. And then he got crazy.”

I feel like screaming “Labels, labels, labels, all the way!” when facing stiff schemas of what scientists “is” or what artists “is” etc. It’s a hard task by itself to integrate multiple passions and multiple interests into a coherent structure. I don’t need another set of issues because of labels people attach to seemingly creative professions. But limiting myself only to topics consistent with the image of an online scientist became even more frustrating. Therefore expect that this blog (or any other venue I choose to express myself) is going to become a lot more diverse in topics and form.

1 Comment

Posted by Pawel Szczesny on October 28, 2009 in Comments, Research, Science and Art

HMMER3 testing notes – my skills are (finally) becoming obsolete

22 Apr

: Image via Wikipedia

It’s already quite a while since I’ve started to extensively test performance of HMMER3. As many other people noticed before, speed of the search has improved dramatically – I’m really impressed how fast it is. However, it’s only part of the story. The smaller part actually.

As some of readers may know, most of my projects so far were revolving around protein sequence analysis and sequence-structure relationships. Mainly I was doing analysis of sequences that had no clear similarity to anything known, without functional annotation. Usual task was to run sequence comparison software and look at the end of the hit list, trying to make sense from hits beyond any reasonable E-value thresholds (for example I often run BLAST at E-value of 100 or 1000). I use very limited number of tools, because it takes quite a while to understand on which specific patterns a particular software fails.

The high-end tool I use most often is HHpred – HMM-HMM comparison software. It’s slow but very sensitive – my personal benchmarks show that it is able to identify very subtle patterns in sequence formed slightly above level of similar secondary structures (in other words, from the set of equally dissimilar sequences with identical secondary structure order, it correctly identifies the ones with similar tertiary structure).

The most surprising thing about HMMER3 is that in my personal benchmarks it’s almost as sensitive as HHpred. I wasn’t expecting that HMM-sequence comparison can be as good as HMM-HMM. This observation suggests that there’s still a room for improvement for the latter approach, however it has already big implications.

PFAM will soon migrate to HMMER3 (the PFAM team is now resolving overlaps between families that arose due to increased sensitivity) and the moment it is be available, it will make a huge number of publications obsolete, or simply wrong. There are thousands of articles that discuss in detail evolutionary history of some particular domain (many of these will become obsolete) or draw some conclusions from the observation that some domain is not present in analyzed sequence/system (many of these will need to be revised). It will also make my skills quite obsolete, but that is always to be expected, no matter in what branch of science one is working. I also imagine that systems biology people will be very happy to have much better functional annotation of proteins.

I don’t want to call development of HMMER3 a revolution, but it will definitely have similar impact on biology as BLAST and HMMER2 had. Not only because of its speed, but also because it will create a picture of similarities between all proteins comparable to the picture state-of-the-art methods could only calculate for their small subset.

The curse of BLAST (mndoci.com)

3 Comments

Posted by Pawel Szczesny on April 22, 2009 in bioinformatics, Research, Software

Tags: bioinformatics, biology, HMM, HMMER, PFAM

The Future of (Life) Scientists

26 Mar

This post is directly inspired by excellent essay by Michael Nielsen entitiled “The Future of Science“. While Michael writes about science itself (and how openness will be playing big role in scientific process) I wanted to write few words about how and where I see scientists in a near future (or rather how the research will be done – I’m not even touching the broad topic of alternative careers for scientists). While it sounds like a complementary essay to Michael’s work, I wouldn’t dare to call it so – think of it as a collection of loose notes gathered over months of learning from online science community. Also, please keep in mind that it’s written by a biologist and as such biased towards life sciences.

It’s no news that academic environment has changed so much that a joy of research spans only small fraction of day-to-day scientists’ life. “Publish or perish“, bureaucracy, money hunting, lack of tenure track positions, impact factor, ever-postdoc are only few of many issues within academic system. There’s quite a lot of interesting initiatives that aim at improving the system and some of them will certainly succeed by solving directly some of the issues above or more likely, by creating a niche within academia in which these issues will not apply. However, I think in the long run academia is not going to be the main environment where the research is being done and more importantly, there will be infinite gradation of research jobs, allowing people from many different fields with many different skills to contribute to scientific projects.

That said, I also believe that amount of data and knowledge produced will lead to enormous specialization of scientists. This does not contradict the previous statement: I don’t think that some teenager will design and develop in his spare time a new molecular dynamics algorithm, but finding new genetic associations or inventing another way to modify bacterial genome so it has better biodegradation features sounds to me like a reachable project for many people. Specialization will be one of many factors influencing creation of new types of scientists. And what are these types? Let me describe a few.

Mind, brain, intelligence amplification – future Nobel Prize winners

This category emerged pretty recently, after reading Deepak’s post on uniqueness (or lack of it) of someone’s contribution to science. I always had this notion that no matter what I did, it would be done in a near future by someone else, but this time I could put it into words: science is like sports – winner takes it all and there’s always a winner. Because prestige of an institution or fame of a scientist plays a big role in getting one’s research funded, competition for money will lead to development of procedures that will aim at producing Nobel Prize winners (or equivalents) analogous to sports training programs.

: Image via Wikipedia

Techniques like neurolinguistic programming, biofeedback or binaural bits (just to name a few) are surrounded by such a hype, that it’s hard to believe they are worth something. However I think there’s a solid field emerging from these inventions that aims at dealing with issues we create in our lives. Have you heard that Google had opened School of Personal Growth as a part of the Google University, teaching things like mental development, emotional development, holistic health, well-being and finally a Buddhist notion, “beyond the self”? I think it’s no mistake – it’s an attempt to help employees to consistently work at their optimal speed. And there’s story is published by Nature in April last year results of a poll of using brain-doping drugs among scientists. And there’s an inspiring talk by Juan Enriquez on arrival of Homo evolutis. I believe it’s just a matter of time big universities will launch (probably secretly) their own programs for training high profile scientists. And judging from the comments to the Nature’s poll I don’t think many people will object – science, unlike sports, doesn’t have to pretend it’s fair.

Getting research done – staff scientist

This type doesn’t require introduction. If one doesn’t have to waste time on advancing career and hunting for money, one becomes a very efficient scientist. Staff scientist positions are available in many countries and I wish it could be more of them in the future, especially in bioinformatics – where a single person can be trained to do everything from microarrays analysis to molecular dynamics in a relatively (!) short time (and become then a very important asset in the lab).

Experienced specialist – nomadic freelancer

: Image via Wikipedia

This is category I was aspiring to. Here you can read little details how I tried, and here when and why it failed. I still think it can be done, although not in every field and not all the time. My hope was that telecommuting is the future of freelance scientists, but Bora offered entirely different solution: co-researching spaces/science hostels:

A coworking space has three important components: the physical space, the technological infrastructure, and the people. A Science Hostel that accommodates people who need more than armchairs and wifi, would need to be topical – rooms designed as labs of a particular kind, common equipment that will be used by most people there, all the people being in roughly the same field who use roughly the same tools.

From what I’ve seen, people doing structural biology (especially NMR-related research) tend to enjoy similar to a freelancer status: they can do a crucial high tech task, which takes no more than several weeks to finish and often the task is needed so rarely that there’s no point in employing the specialist full time (or to do in-house training).

The main disadvantage of this mode is something called “consultant’s dilemma” (hat tip Harold Jarche): when you’re working you’re not generating new ideas or business, and vice versa.

In a failure of interdisciplinary approach – translator, integrator

I expect that lots of people will disagree with me on that, but I think on the long run interdisciplinary approaches are going to fail. The area where a reason for failure is most visible is genome sequencing. Deep knowledge about single simple organism such as bacteria is beyond capability of most (if not all) laboratories and teams and that’s why publishing a genome is just a starting point, not end to a process. It takes years of work of experts in their own small fields to extract all useful information from the single sequence.

Once this situation becomes more of an issue, scientific translators may emerge. Such person will track scientific literature in two (or three or four, such as language translators) small fields and will tell group of researchers from one field what important has been published in other field. Will similar service become part of libraries or such people will become independent consultants? I have no idea.

I don’t think that gaps in knowledge will be corrected by talking to colleagues or by review process. Here’s a perfect example (in used-to-be prestigious journal): neither authors nor reviewers have noticed that the structure containing so-called trimerization “octads” is a perfectly fine, quite regular, heptad-based coiled-coil (you guess it right, these “octads” were separated by six residues, giving together fourteen – two coiled-coil heptads). It was already visible in the sequence figure – but only if you knew that things like coiled-coils exist and were already studied by Francis Crick. After almost a year and a half correction wasn’t submitted which means the community does not care either.

Bioentrepreneur

As soon as we have our own Paul Graham and a clear, well-described path of how to make a startup in life sciences successful, we will have a bloom of bioentrepreneurs. Life science is a field comparable to high-tech, not software industry. It requires different skills and different approach, but no one has so far put it into words that we can follow. Also, we need more hardware providers in area of life sciences. If you want to build a mobile phone, it’s a matter of days to order its every single part. If you want to build your own sequencing machine, I wish you good luck, because it will take considerably longer (you need to wait until respective companies are built and offer their products).

Nevertheless, I’m sure it will happen. Streamlining life sciences is something that lots of people are talking about.

Clean data needed – biocurator

The more data the more errors. Recently, I’ve stumbled upon interesting functional annotation of a protein: will die slowly. Search on NCBI reveals few dozens of proteins with such annotation. This is a terse description of a phenotype, however I don’t think should be used as a protein name. Paul Davis suggested that this propagated from Drosophila, since fruit fly gene names have a long history of names blurb:

Early work refers to the gene as fruity, an apparent pun on both the common name of D. melanogaster, the fruit fly, as well as a slang word for homosexual. As social attitudes towards homosexuality changed, fruity came to be regarded as offensive, or at best, not politically correct. Thus, the gene was re-dubbed fruitless, alluding to the lack of offspring produced by flies with the mutation.

It’s nothing new that to reach holy grail of many fields (text mining, ontologies, automated discoveries, predictions), we need manual curation of biological data (even Wolfram Alpha is based on curated data). Similarly to staff scientists, biocurator jobs are already appearing in science job listing.

Science as creative hobby – “not even a scientist”

In the introduction I’ve mentioned a teenager inventing new genetic modification of an organism. While to some it may sound difficult, unquestionable success of iGEM competition shows that it doesn’t require 20 years of research experience to come up with such ideas. Lots of knowledge and lots of data create opportunity for people outside academia to jump in and make a valuable contribution. The necessary requirement in “openness” – as long as the data and publications are freely available, there’s a space for outsiders.

I expect (or I hope) amateur science to grow in the following years – especially in the less bureaucratic countries. If we don’t see many of such examples yet, it’s the education system to blame – kids don’t realize that remixing data and remixing video are very similar things that differ only by a target audience, but both can be cool :).

Knowing your position – “lighthouse” scientist

Lighthouse’s primary role is to assists in navigation – it helps you find your position on the map. Lighthouse is not a point of reference – as a point on the map is usually no more important than any other points. Lighthouse helps you understand where you are. Tech crowd has its own “lighthouse” people, for example Tim O’Reilly. Our small online science community has Bill Hooker. Neither of them seem to have outstanding resume (sorry to write that, I’ve seen better ones), but to understand where you are it’s worth to pay attention to what they say. They seem to understand particular part of our world much better than anybody else.

To put it in other words, a lighthouse scientist isn’t necessarily a person with the biggest achievements or a person who has a brilliant vision of the future – it’s a person who sees trends and movements, has a wider perspective and most importantly knows what’s important. In recent discussions on the blogosphere about bioinformatics as a field of science, Sean Eddy didn’t express his opinion – which I think is a very meaningful response.

Final thoughts

I’ve sketched this map to organize lots of thoughts and discussions around future directions of science. It is far from being complete and full of wishful thinking, but still helped me to wrap my mind around couple of issues in this area. Probably the most important thing I’ve realized is what was put into introduction: that the future may open lots of options for people willing to stay close to science. Those who realize this will benefit from them as first.

Update: there are interesting comments over at FriendFeed already.

3 Comments

Posted by Pawel Szczesny on March 26, 2009 in Comments, Community, Research

Tags: Academia, Future of Science, Google, Impact Factor, Openness, Publications, Science in Society, Scientific method

Structure prediction without structure – visual inspection of BLAST results

03 Feb

portschema My recent post on visual analytics in bioinformatics lacked a specific example, but I’m happy to finally provide one (happiness comes also from the fact that respective publication is finally in press). The image above shows a multiple pairwise alignment from BLAST of a putative inner membrane protein from Porphyromonas gingivalis. Image is small but it does not really matter – colour patches seem to be visible anyway.

Regions marked with ovals are clearly less conserved, than other part of the protein. There are five hydrophobic (green patches, underlined with blue lines) regions in this alignment (I ignore N-terminus, as this is likely the signal peptide), however the three inner ones appear to be of similar length, while the outer ones seem to be of the half as long as the inner ones. If we assume that the single unit is the short one, we can summarize the protein as follows: 8 beta structures, four long loops, for short loops. It looks like an eight-stranded outer membrane beta-barrel. Almost structure prediction, but without a structure.

I could end the story here, but the model didn’t fit previously published data. Its localization in the inner membrane was confirmed by an experiment, however pores in the inner membrane are considered very harmfull 😉 . Fortunately, one of my colleagues explained to me that particular localization technique is not 100% reliable, so I gathered more evidence, created detailed description of topology and the other group has designed experiments which confirmed my visual analysis.

Lessons learned? Maybe without this feedback on quality of that experimental technique, I would still claim that this is OM beta-barrel. Or maybe not. But I’ve learned that to safely ignore experimental results, one needs a more than a intuition. Also, it shows that sometimes looking at the results, is all one needs to make a reasonable prediction (I still have no idea what were E-values of these BLAST hits, but does it matter?).

7 Comments

Posted by Pawel Szczesny on February 3, 2009 in bioinformatics, Research, Visualization

Tags: bioinformatics, biology, Inner membrane, Membrane protein, Porphyromonas gingivalis, Visual analytics

Another collaborative environment: Project Wonderland

29 Dec

This is a short post on the Sun’s Project Wonderland. Citing from its home page

Project Wonderland is a 100% Java and open source toolkit for creating collaborative 3D virtual worlds. Within those worlds, users can communicate with high-fidelity, immersive audio, share live desktop applications and documents and conduct real business. Wonderland is completely extensible; developers and graphic artists can extend its functionality to create entire new worlds and new features in existing worlds.

In my recent post I’ve mentioned Second Life and Croquet: two platforms that can evolve into decent 3D visualization environments. Obviously I didn’t research the topic enough, as I’ve just found Project Wonderland. It seems to have the best of both worlds – professional team of developers, pretty flexible architecture and possibility of running your own instance of “virtual world”.

Have you spotted "Biogang" written on the whiteboard? 🙂

I didn’t play with it for a long time – current version is not very feature-rich (although it already contains video player with webcam support, PDF viewer, VNC viewer and a crude whiteboard), however the roadmap looks very interesting. I really liked extensive audio features – true stereo, sounds fade out with distance, special “cone of silence” (place where you can have a private conversation) – it proves that Sun is really trying to build an effective collaboration platform.

I haven’t seen yet much about data visualization in Wonderland – although below you can find interesting example of molecular simulation trajectory shown inside Wonderland.

Comments Off

Posted by Pawel Szczesny on December 29, 2008 in Education, Research, Visualization

Tags: collaboration, Online Services, Software, Visualization

Bioinformatics is a visual analytics (sometimes)

18 Dec

Short description of my research interest is “I do proteins” (I took this phrase from my friend Ana). I try to figure out what particular protein, protein family, or set of proteins does in the wider context. Usually I start where automated methods have ended – I have all kinds of annotation so I try to put data together and form some hypothesis. I recently realized that the process is basically visualizing different kind of data – or rather looking at the same issue from many different perspectives.

It starts with alignments. Lots of alignments. And they all end up in different forms of visual representation. Sometimes it’s a conservation with secondary structure prediction (with AlignmentViewer or Jalview):

blog-0005

Sometimes I look for transmembrane beta-barrels (with ProfTMB):

blog-0005

Sometimes I try to find a pattern in hydrophobicity and side-chain size values across the alignment (Aln2Plot):

blog-0005

Afterwards I seek for patterns and interesting correlations in domain organization (PFAM, Smart):

blog-0008

Sometimes I map all these findings onto a structure or a model that I make somewhere in the meantime based on found data (Pymol, VMD, Chimera):

blog-0006

I also try to make sense out of genomic context (works for eukaryotic organisms as well – The SEED):

blog-0005

I investigate how the proteins cluster together according to their similarity (CLANS):

blog-0013

And figure out how the protein or the system I’m studying fits into interaction or metabolic networks (Cytoscape, Medusa, STRING, STITCH):

blog-0007

If there’s some additional numerical information I dump it into analysis software (R, for simpler things DiVisa):

blog-0005

And I make note along the process in the form of a mindmap (Freemind, recently switched to Xmind, because it allows to store attachments and images in the mindmap file, not just link to them like Freemind does): blog-0010

So it turns out that I mainly do visual analytics. I spend considerable amount of time on preparing various representations of biological data and then the rest of the time I look at the pictures. While that’s not something every bioinformatician does, many of my colleagues have their own workflows that also rely heavily on pictures. For some areas it’s more prominent, for others it’s not, but the fact is that pictures are everywhere.

There are two reasons I use manual workflow with lots looking at intermediate results: I work with weak signals (for example, sometimes I need to run BLAST at E-value of 1000) or I need to deeply understand the system I study. Making connections between two seemingly unrelated biological entities requires wrapping one’s brain around the problem and… lots of looking at it.

And here comes the frustration. I counted that I use more than twenty (!) different programs for visualization. And even if I’m enjoying monitor setup 4500 pixels wide which is almost enough to put all that data onto screen, the main issue is that the software isn’t connected. AlignmentViewer cannot adjust its display automatically based on the domain I’m looking at or a network node I’m investigating – I need to do it by myself. Of course I can couple alignments and structure in Jalview, Chimera or VMD but I don’t find such solution to be usable on the long run. To have the best of all worlds, I need to juggle all these applications.

I’ve been longing for some time already for a generic visualization platform that is able to show 2D and 3D data within the single environment, so I follow development of SecondLife visualization environment and Croquet/Cobalt initiatives. While these don’t look very exciting right now, I hope they will provide a common platform for different visualization methods (and of course visual collaboration environment).

But to be realistic, visual analytics in biology is not going to become a mainstream. It’s far more efficient to improve algorithms for multidimensional data analysis than to spend more time looking at pictures. I had already few such situations when I could see some weak signal and in a year or two it became obvious. But I’m still going to enjoy scientific visualization. I came to science for aesthetic reasons after all. 🙂

6 Comments

Posted by Pawel Szczesny on December 18, 2008 in bioinformatics, Proteins, Research, Software, Visualization

Tags: bioinformatics, biology, Chimera, Cytoscape, Online Services, protein, Protein family, Visual analytics, Visualization

Photography is not a hobby. Updated CV and feedback request.

18 Nov

Yesterday I asked over at FriendFeed for the feedback on my early attempt of making visual CV (big thanks to all who commented). Here’s a revised version that hopefully looks much better. The key to read the image above (click to see larger version) is as follows: Y-axis represents time (with dotted line indicating more or less the present moment); areas of interest are along X-axis; color of the phrases indicates my confidence level; font size denotes amount of time I spent on the topic (so in this case I have spent lots of time using perl, but I still don’t feel very confident about it); placement of the phrases denotes which areas of interest particular project/phrase spans; area below the dotted line shows my approximate plans and hopes for the future.

The first version had “Photography” area instead of “Visualization”, but I needed to change that since it was confusing everybody and raised questions why I put a hobby on a professional CV. Photography (or visual arts) is not my hobby. My hobby is choir singing (which I do for over 14 years already, currently singing jazz and gospel). Visualization/Photography is there to indicate that I consider data visualization one of the most important elements of scientific method. What I’m trying to figure out is what kind of presentation can help us in understanding really complex systems, such as human (genetic, to make it easier) diseases. And when we understand them curing is going to be much easier. At least I hope it will.

Anyway, the true reason to post it is to ask my readers for feedback on missing elements of my plans. So far my ideas for the future research projects split into a few paths. First path is to work further on bacterial systems (or subsystems, such as secretion systems etc.). This work would translate later on into something I call Synthetic Biology Framework, which would be a tool helping in designing new biological systems, and maybe later would result in creating a programming language for a cell. My first ideas about the framework were to design engineered bacteria producing some important compounds, maybe drugs, but now I think the cooler use for the framework would be to design bionano machines. The second path is about modelling of human diseases, with important milestone which is analysis of human genome and metagenome (genobiome as I call it) – if the data will be available. Because I don’t think I could do better here than thousands of scientists if I were using the same information, here’s a moment where synthetic biology comes into play again – I hope that I could design nanomachines that would server as quick diagnostic tools or would be reporting the body state in some mostly non-invasive way (aiming at issue of “how is my cholesterol level building up”). The third path is mostly empty and concerns visualization methods. So far I have no clear idea how to build a system that would visually assist in understanding how cells work. I plan to experiment with 3D printing and 3D visualization of biological networks, but I have no clear idea where this will lead me.

So if you have some opinion, comment, idea how to connect some dots, how to jump from one area to another (for example I have no yet idea how to approach pharmacogenomics), or if you think that it doesn’t make sense at all feel free to comment.

3 Comments

Posted by Pawel Szczesny on November 18, 2008 in Career, Research

Tags: CV, Data visualization, FriendFeed, Human genome, Information Visualization, Photography, Resume, Scientific method, Scientific Visualization, Synthetic biology, Visualization

Thinking about RaaS: Research-as-a-Service

03 Nov

Image by Getty Images via Daylife

Instead of disclaimer: this is a bunch of loose thoughts on an element of a possible future of research. I’m only touching some issues here and still I don’t have coherent vision of the commercial side of research. So, feel free to show me I’m very wrong – you’ll save me lots of time of coming to your conclusions :).

According to Wikipedia, Everything as a Service is a:

concept of being able to call up re-usable, fine-grained software components across a network.

While the most common example is SaaS – Software as a Service, this concept can be applied to other functions such as communication, infrastructure or data (the last one sounds very interesting). It recently occurred to me, that investments (or maybe I should call these partnerships) of biotech and pharma companies in academic research institutions are good examples of RaaS, Research as a Service. I think every situation where the research is done after the agreement (buying or licensing patented innovation doesn’t qualify) can be called RaaS.

Why

To a company, there are obvious advantages of hiring scientists to get the research done, but I think there also would be plenty of good sides of such arrangement for us (of course I have no experience yet). Probably the biggest plus would be money and ability to get them in somehow predictable manner. I think it’s also important to be stretched intellectually from time to time (I assume that easy things aren’t worth outsourcing).

Many flavors of RaaS

Paying to an academic institution to come up with a new drug candidate is only one of many types of RaaS. There’s researching a given problem (something Innocentive or Nine Sigma are coordinating), coming up with an innovation (drug candidate example), providing expertise (consulting) or innovating and delivering (designing, building and implementing new machine, workflow or pipeline). We could find examples of all types happening everyday, but probably not in all scientific fields. Delivering something in biology is usually quite expensive and time consuming, while consulting gigs in quantum physics don’t appear all that often.

The point is that all these RaaS flavors can and are applied to academic institutions. In other words, many researchers provide commercial services using time and equipment paid from taxpayers money. And I think it’s not an issue – even more: it should be finally admitted and accepted (so we could get rid of the artificial division of researches into academic and all others; but that’s another story), and organized, so we could provide such services easier and more often.

Resources all over the place

The Health Commons project aims at building a framework that could help in sharing and organising research process aiming at developing new drugs. We seem to have lots of elements of such environment in place – we have many (or even too many 😉 ) scientists, some service providers, data centers and some work done on standards of operations and information exchange. If we forget about drug development, not much actually changes. We have workforce, some services aimed at researchers and lots of tools that help in communication in both directions.

Here’s example of research scenario: if I were to market a genetic test that identifies mutations resulting in oversensitivity or resistance to a drug (something which I believe will be the next hit after screening for disease markers), the whole research part wouldn’t require any significant involvement from my side. CRO (contract research organization) would take care of identifying patients with specific conditions, sequencing company would get me their genomes (currently $5000 each, but the price is dropping very fast) and as far as I know bioinformatics community, finding people to analyze the data wouldn’t be an issue at all. While such scenario is a bit too optimistic (I skipped lawyers in the process), we already have resources to make it happen.

Where is it going?

I imagine future RaaS provider as a small company (I’m not yet sure if a non-profit organization is a better fit for people interested in doing research; also, I don’t know how fast the issue of academic-commercial blur can be solved) made by a few scientists from different but closely related fields. The reason I see it small, is about mobility. And I don’t mean here physical mobility (which BTW may be required on some occasions) but mobility of focus – the main advantage of small organizations.

I imagine such company would be able to do consulting (and data analysis, maybe on the RedMonk model) and innovate at a software level. It would be able to do the work on site (small group again) and deliver the results quick (“bursty work”).

Pieces of this vision come from old Deepak’s posts and many FriendFeed discussions. I actually think about putting it into practice. What do you think I am missing here (other than marketing 😉 )?

6 Comments

Posted by Pawel Szczesny on November 3, 2008 in Comments, Research

Tags: Business, Cloud computing, Intellectual property, Knowledge Management, raas, Research, Saas, service, Software as a service, Venture capital

Open Access Day

14 Oct

Today is the world’s first Open Access Day . It aims at broadening awareness and understanding of OA. The approach is to make as many people as possible to blog today on the topic, possibly answering the following questions:

Why does Open Access matter to you?

In my case, where pretty soon I’ll have no support from a large institution, Open Access means ability to do research. OA is a vital help to small or underfunded research groups.

How did you first become aware of it?

Internal policy of my former employer required that all results should be published in OA journals. BTW, it didn’t change since then.

Why should scientific and medical research be an open-access resource for the world?

Ability to do research and to innovate shouldn’t be inhibited by access to knowledge and data produced by publicly funded research institutions.

What do you do to support Open Access, and what can others do?

I do publish in OA journals (four out of five publications I have so far are OA).

See more OA Day entries at FriendFeed Open Access Day room.

Comments Off

Posted by Pawel Szczesny on October 14, 2008 in Community, Research

Tags: Blog, Open Access, Publications, Research

What I use

Online grammar check
About this site
- "Freelancing science" is a blog about biology in silico, data visualization and open science. Written by Paweł Szczęsny.
- Contact: pawel at FreelancingScience dot com
- Original content of this site is licensed under a Creative Commons Attribution 3.0 Unported License, unless stated otherwise.
Other sites and projects
- New home site - Circle of complexity
- Sudden Infant Death Syndrome portal
Most popular posts
Twitter Updates
Tweets by freesci
bioblogs bioinformatics Biological engineering Career Clipped Comments Community Data mining Dump-all Education Fun Imaginary nanodevice Money open-science Papers Proteins PubMed Research Research skills Science and Art Secretion system Services Software Structural biology Structure prediction Synthetic biology Visualization
Shared items
- An error has occurred; the feed is probably down. Try again later.
GR starred items (not necessarily scientific)
- An error has occurred; the feed is probably down. Try again later.
My science-related images

More Photos
Archives
Archives

Category Archives: Research

Research area

Carrer

Open science

Labels, labels

Related articles by Zemanta

Mind, brain, intelligence amplification – future Nobel Prize winners

Getting research done – staff scientist

Experienced specialist – nomadic freelancer

In a failure of interdisciplinary approach – translator, integrator

Bioentrepreneur

Clean data needed – biocurator

Science as creative hobby – “not even a scientist”

Knowing your position – “lighthouse” scientist

Final thoughts

Why

Many flavors of RaaS

Resources all over the place

Where is it going?

What I use

About this site

Other sites and projects

Most popular posts

My science-related images

Archives