Title :
A Survey of Approaches and Frameworks to Carry Out Genomic Data Analysis on the Cloud
Author :
Church, Philip C. ; Goscinski, Andrzej
Author_Institution :
Sch. of Inf. Technol., Deakin Univ., Geelong, VIC, Australia
Abstract :
High Performance Computing (HPC) clouds have started to change the way how research in science, in particular medicine and genomics (bioinformatics) is being carried out. Researchers who have taken advantage of this technology can process larger amounts of data and speed up scientific discovery. However, most HPC clouds are provided at an Infrastructure as a Service (IaaS) level, users are presented with a set of virtual servers which need to be put together to form HPC environments via time consuming resource management and software configuration tasks, which make them practically unusable by discipline, non-computing specialists. In response, there is a new trend to expose cloud applications as services to simplify access and execution on clouds. This paper firstly examines commonly used cloud-based genomic analysis services (Tuxedo Suite, Galaxy and Cloud Bio Linux). As a follow up, we propose two new solutions (HPCaaS and Uncinus), which aim to automate aspects of the service development and deployment process. By comparing and contrasting these five solutions, we identify key mechanisms of service creation, execution and access that are required to support genomic research on the SaaS cloud, in particular by discipline specialists.
Keywords :
biology computing; cloud computing; genomics; parallel processing; resource allocation; Cloud Bio Linux; Galaxy; HPC clouds; HPC environments; HPCaaS; IaaS; SaaS cloud; Tuxedo Suite; Uncinus; cloud access; cloud applications; cloud execution; cloud-based genomic analysis services; genomic data analysis; genomic research; high performance computing; infrastructure as a service; resource management; scientific discovery; service creation; service deployment process; service development; service execution; software configuration tasks; virtual servers; Bioinformatics; Cloud computing; Genomics; Servers; Software as a service; Virtual machining; Bio-informatics; HPC; Software as a Service;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2014 14th IEEE/ACM International Symposium on
Conference_Location :
Chicago, IL
DOI :
10.1109/CCGrid.2014.127