Title :
Designing Integrated Computational Biology Pipelines Visually
Author_Institution :
Dept. of Comput. Sci., Univ. of Idaho, Moscow, ID, USA
Abstract :
The long-term cost of developing and maintaining a computational pipeline that depends upon data integration and sophisticated workflow logic is too high to even contemplate "what if" or ad hoc type queries. In this paper, we introduce a novel application building interface for computational biology research, called VizBuilder, by leveraging a recent query language called BioFlow for life sciences databases. Using VizBuilder, it is now possible to develop ad hoc complex computational biology applications at throw away costs. The underlying query language supports data integration and workflow construction almost transparently and fully automatically, using a best effort approach. Users express their application by drawing it with VizBuilder icons and connecting them in a meaningful way. Completed applications are compiled and translated as BioFlow queries for execution by the data management system LifeDB, for which VizBuilder serves as a front end. We discuss VizBuilder features and functionalities in the context of a real life application after we briefly introduce BioFlow. The architecture and design principles of VizBuilder are also discussed. Finally, we outline future extensions of VizBuilder. To our knowledge, VizBuilder is a unique system that allows visually designing computational biology pipelines involving distributed and heterogeneous resources in an ad hoc manner.
Keywords :
ad hoc networks; biological techniques; biology computing; data integration; distributed databases; query languages; visual programming; BioFlow queries; LifeDB; VizBuilder icons; ad hoc complex computational biology applications; ad hoc type queries; computational biology research; data integration; data management system; distributed resources; front end; heterogeneous resources; integrated computational biology pipeline; life sciences databases; query language; real life application; what if queries; workflow construction; workflow logic; Computational biology; Data integration; Databases; Object recognition; Pipelines; Programming; Visualization; BioFlow queries; Computational biology; Data integration; Databases; LifeDB; Object recognition; Pipelines; Programming; Visualization; VizBuilder icons; ad hoc complex computational biology applications; ad hoc networks; ad hoc queries; ad hoc type queries; biological techniques; biology computing; computational biology research; computational pipelines; data integration; data management system; distributed databases; distributed resources; front end; heterogeneous resources; integrated computational biology pipeline; life sciences databases; query language; query languages; real life application; systems biology; visual programming; what if queries; workflow; workflow construction; workflow logic;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2013.69