Title :
Towards Composing Data Aware Systems Biology Workflows on Cloud Platforms: A MeDICi-Based Approach
Author :
Gorton, Ian ; Liu, Yan ; Jian, Yin ; Kulkarni, Anand ; Wynne, Adam
Author_Institution :
Pacific Northwest Nat. Lab., Richland, WA, USA
Abstract :
Cloud computing is being increasingly adopted for deploying systems biology scientific workflows. Scientists developing these workflows use a wide variety of fragmented and competing data sets and computational tools of all scales to support their research. To this end, the synergy of client side workflow tools with cloud platforms is a promising approach to share and reuse data and workflows. In such systems, the location of data and computation is essential consideration in terms of quality of service for composing a scientific workflow across remote cloud platforms. In this paper, we describe a cloud-based workflow for genome annotation processing that is underpinned by MeDICi -- a middleware designed for data intensive scientific applications. The workflow implementation incorporates an execution layer for exploiting data locality that routes the workflow requests to the processing steps that are colocated with the data. We demonstrate our approach by composing two workflows with the MeDICi pipelines.
Keywords :
biology computing; cloud computing; data analysis; genomics; middleware; quality of service; scientific information systems; client side workflow tools; cloud computing; computational tools; data aware systems; data reuse; data sets; data share; genome annotation processing; middleware; quality of service; systems biology scientific workflows; Bioinformatics; Cloud computing; Connectors; Genomics; Peptides; Pipelines; Systems biology; Scientific workflow; cloud computing; systems biology;
Conference_Titel :
Services (SERVICES), 2011 IEEE World Congress on
Conference_Location :
Washington, DC
Print_ISBN :
978-1-4577-0879-4
Electronic_ISBN :
978-0-7695-4461-8
DOI :
10.1109/SERVICES.2011.22