Title :
Large-scale metagenomic clustering via quasi clique enumeration and read assignment ambiguity resolution
Author_Institution :
Electr. & Comput. Eng., Iowa State Univ., Ames, IA, USA
Abstract :
Clustering of metagenomic reads from the set of sampled species at the resolution of a taxonomic level is an important problem in metagenomics. The problem is challenging due to 1) large scale of metagenomic sequence data, and 2) difficulty in distinguishing between homologous reads obtained from different species. In addition, classification at a hierarchy of taxonomic units is important, and it is difficult to judge what level of read similarity is appropriate to achieve the best quality clustering for any given taxonomic unit.
Keywords :
genomics; pattern clustering; large scale metagenomic clustering; quasi clique enumeration; read assignment ambiguity resolution; taxonomic level; Density measurement; Time measurement;
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2013 IEEE 3rd International Conference on
Conference_Location :
New Orleans, LA
DOI :
10.1109/ICCABS.2013.6629239