DocumentCode :
167061
Title :
Hadoop based enhanced cloud architecture for bioinformatic algorithms
Author :
Alshammari, Hamoud ; Bajwa, Hassan ; Jeongkyu Lee
Author_Institution :
Dept. of Comput. Sci., Univ. of Bridgeport, Bridgeport, CT, USA
fYear :
2014
fDate :
2-2 May 2014
Firstpage :
1
Lastpage :
5
Abstract :
Explosion of biological data due to large-scale genomic research and advances in high throughput data generation tools result in massive distributed datasets. Analysis of such large non-relational, heterogeneous, and distributed datasets is emerging challenge in data driven biomedical industries. Highly complex biological data require unconventional computational approaches and knowledge-based solutions. Distributed datasets need to be reduced to smaller datasets that can be efficiently queried. Since genomic and biological data is generated in large volume and is stored in geographically diverse locations, distributed computing on multiple clusters, our objective here is to assess the feasibility of using Cloud based platform to analyze genomic big data. In this paper we present an enhanced Hadoop architecture to reduce computation by utilizing “Common Features” before performing redundant computation. The enhanced Hadoop architecture allows the jobs to share these common features among them. The common features describe the contents of data in blocks and can be used to determine DataNodes that store the required data.
Keywords :
Big Data; bioinformatics; cloud computing; genomics; DataNodes determination; Hadoop architecture; bioinformatic algorithms; cloud architecture; common feature utilization; genomic big data analysis; Bioinformatics; Biological cells; Computer architecture; DNA; Distributed databases; File systems; Genomics; BigData; Hadoop; Hadoop Architecture; MapReduce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Applications and Technology Conference (LISAT), 2014 IEEE Long Island
Conference_Location :
Farmingdale, NY
Type :
conf
DOI :
10.1109/LISAT.2014.6845204
Filename :
6845204
Link To Document :
بازگشت