Title :
Taming complex bioinformatics workflows with weaver, makeflow, and starch
Author :
Thrasher, Andrew ; Carmichael, Rory ; Bui, Peter ; Yu, Li ; Thain, Douglas ; Emrich, Scott
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Notre Damem, Notre Damem, IN, USA
Abstract :
In this paper we discuss challenges of common bioinformatics applications when deployed outside their initial development environments. We propose a three-tiered approach to mitigate some of these issues by leveraging an encapsulation tool, a high-level workflow language, and a portable intermediary. As a case study, we apply this approach to refactor a custom EST analysis pipeline. The Starch tool encapsulates program dependencies to simplify task specification and deployment. The Weaver language provides abstractions for distributed computing and naturally encourages code modularity. The Makeflow workflow engine provides a batch system agnostic engine to execute compiled Weaver code. To illustrate the benefits of our framework, we compare implementations, show their performance, and discuss benefits derived from our new workflow approach relative to traditional bioinformatics development.
Keywords :
bioinformatics; programming languages; software tools; EST analysis pipeline; bioinformatics workflows; makeflow workflow engine; starch tool; weaver language; Bioinformatics; Encapsulation; Engines; Libraries; Pipelines; Runtime; Weaving;
Conference_Titel :
Workflows in Support of Large-Scale Science (WORKS), 2010 5th Workshop on
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-8989-3
Electronic_ISBN :
2151-1373
DOI :
10.1109/WORKS.2010.5671858