Title :
A Cloud Infrastructure for Optimization of a Massive Parallel Sequencing Workflow
Author :
Terzo, Olivier ; Mossucca, Lorenzo ; Acquaviva, Andrea ; Abate, Francesco ; Provenzano, Rosalba
Author_Institution :
Ist. Superiore Mario Boella (ISMB), Torino, Italy
Abstract :
Massive Parallel Sequencing is a term used to describe several revolutionary approaches to DNA sequencing, the so-called Next Generation Sequencing technologies. These technologies generate millions of short sequence fragments in a single run and can be used to measure levels of gene expression and to identify novel splice variants of genes allowing more accurate analysis. The proposed solution provides novelty on two fields, firstly an optimization of the read mapping algorithm has been designed, in order to parallelize processes, secondly an implementation of an architecture that consists of a Grid platform, composed of physical nodes, a Virtual platform, composed of virtual nodes set up on demand, and a scheduler that allows to integrate the two platforms.
Keywords :
cloud computing; grid computing; parallel processing; DNA sequencing; cloud infrastructure; describe several revolutionary; grid platform; massive parallel sequencing workflow; optimization; read mapping algorithm; so-called next generation sequencing technology; virtual nodes; virtual platform; Bioinformatics; Cloud computing; Computer architecture; DNA; Genomics; Optimization; cloud computing; grid computing; hybrid architecture; massive parallel sequencing; next generation sequencing; virtual environment;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on
Conference_Location :
Ottawa, ON
Print_ISBN :
978-1-4673-1395-7
DOI :
10.1109/CCGrid.2012.91