Title :
Enabling genomic analysis on federated clouds
Author :
Fan Jiang ; Shoffner, Michael ; Castillo, Claris ; Schmitt, C.
Author_Institution :
Renaissance Comput. Inst., Univ. of North Carolina at Chapel Hill, Chapel Hill, NC, USA
Abstract :
Genomic research involves a considerable amount of intensive computational and data management challenges including varying demand for computing resources, large data staging, management of complex workflows, and managing data and metadata across thousands of experiments and datasets. To address these challenges, researchers are typically forced to acquire and maintain experienced IT staff and informatics infrastructure. Increasingly researchers have explored cloud technologies, yet these lack key capabilities for data and workflow management. We introduce work to address these issues through use of federated cloud infrastructure coupled with data and workflow management technology. We present preliminary work toward the integration of three major technologies: ExoGENI, integrated Rule Oriented Data System (iRODS), and Pegasus/HTCondor, to develop a software infrastructure that better supports data-and workflow-centric genomic analysis.
Keywords :
cloud computing; genomics; knowledge based systems; meta data; workflow management software; ExoGENI; Pegasus/HTCondor; cloud technologies; complex workflows management; computing resources; data management; data-centric genomic analysis; federated clouds; genomic research; iRODS; integrated rule oriented data system; large data staging; meta data; software infrastructure; workflow-centric genomic analysis; Bioinformatics; Data transfer; Distributed databases; Genomics; Informatics; Internet; Runtime; ExoGENI; cloud computing; data management; genomic analysis;
Conference_Titel :
Big Data (Big Data), 2014 IEEE International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/BigData.2014.7004485