DocumentCode :
3090944
Title :
Enabling Large-Scale Bioinformatics Data Analysis with Cloud Computing
Author :
Karlsson, J. ; Torreno, O. ; Ramet, Daniel ; Klambauer, Günter ; Cano, M. ; Trelles, O.
Author_Institution :
Dept. of Comput. Archit., Malaga Univ., Malaga, Spain
fYear :
2012
fDate :
10-13 July 2012
Firstpage :
640
Lastpage :
645
Abstract :
The petabyte scale of the Big Data generation in bioinformatics requires the introduction of advanced computational techniques to enable efficient knowledge discovery from data. Many data analysis tools in bioinformatics have been developed but few have been adapted to take advantage of high performance computing (HPC) resources. For some of these tools, an attractive option is to employ a map/reduce strategy. On the other hand, Cloud Computing could be an important platform to run such tools in parallel because it provides on-demand, elastic computational resources. This paper presents a software suite for Microsoft Azure which supports legacy software (without modifications of the algorithm). We demonstrate the feasibility of the approach by benchmarking a typical bioinformatics tool, namely dotplot.
Keywords :
bioinformatics; cloud computing; data analysis; data mining; Microsoft Azure; big data generation; cloud computing; computational techniques; dotplot; high performance computing resources; knowledge discovery; large-scale bioinformatics data analysis; map-reduce strategy; software suite; Benchmark testing; Bioinformatics; Cloud computing; Computer architecture; Educational institutions; Schedules; Big Data; Bioinformatics; Map Reduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing with Applications (ISPA), 2012 IEEE 10th International Symposium on
Conference_Location :
Leganes
Print_ISBN :
978-1-4673-1631-6
Type :
conf
DOI :
10.1109/ISPA.2012.95
Filename :
6280355
Link To Document :
بازگشت