Title :
Classification and clustering in metagenomics with unified data management and computational framework
Author :
Rasheed, Z. ; Rangwala, Huzefa ; Gillevet, P.
Author_Institution :
Dept. of Comput. Sci., George Mason Univ., Fairfax, VA, USA
Abstract :
The new generation of genomic technologies have allowed researchers to determine the collective DNA of organisms co-existing as communities across different environments. There is a need for the computational approaches to analyze and annotate the large volumes of available sequence data from such microbial communities (metagenomes). In this work, we develop an efficient and accurate metagenome classification and clustering approaches that reduce the computational complexity associated with comparing sequences. Our empirical results show the strength of the developed approaches in comparison to state-of-the-art algorithms with regards to computational efficiency and accuracy. We are also developing a web based metabiome portal that contains different computational tools and is used for the scalable, accurate, and easy analysis of large volumes of data available for the human metabiome.
Keywords :
DNA; biology computing; computational complexity; genomics; pattern classification; pattern clustering; portals; Web based metabiome portal; collective DNA; computational complexity; computational framework; human metabiome; metagenome classification approach; metagenome clustering approach; metagenomics; microbial communities; sequence data analysis; unified data management; Bioinformatics; Classification algorithms; Clustering algorithms; Communities; Genomics; Portals; Systems biology; classification; clustering; hashing; metagenomics; species diversity; system biology;
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2012 IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-1-4673-2746-6
Electronic_ISBN :
978-1-4673-2744-2
DOI :
10.1109/BIBMW.2012.6470285