DocumentCode :
2413051
Title :
Identification and quantification of abundant species from pyrosequences of 16S rRNA by consensus alignment
Author :
Ye, Yuzhen
Author_Institution :
Sch. of Inf. & Comput., Indiana Univ., Bloomington, IN, USA
fYear :
2010
fDate :
18-21 Dec. 2010
Firstpage :
153
Lastpage :
157
Abstract :
16S rRNA gene profiling has recently been boosted by the development of pyrosequencing methods. A common analysis is to group pyrosequences into Operational Taxonomic Units (OTUs), such that reads in an OTU are likely sampled from the same species. However, species diversity estimated from error-prone 16S rRNA pyrosequences may be inflated because the reads sampled from the same 16S rRNA gene may appear different, and current OTU inference approaches typically involve time-consuming pairwise/multiple distance calculation and clustering. I propose a novel approach Abun-dantOTU based on a Consensus Alignment (CA) algorithm, which infers consensus sequences, each representing an OTU, taking advantage of the sequence redundancy for abundant species. Pyrosequencing reads can then be recruited to the consensus sequences to give quantitative information for the corresponding species. As tested on 16S rRNA pyrosequence datasets from mock communities with known species, Abun-dantOTU rapidly reported identified sequences of the source 16S rRNAs and the abundances of the corresponding species. AbundantOTU was also applied to 16S rRNA pyrosequence datasets derived from real microbial communities and the results are in general agreement with previous studies.
Keywords :
bioinformatics; biological techniques; genetics; microorganisms; molecular biophysics; molecular configurations; organic compounds; pattern matching; 16S rRNA gene profiling; 16S rRNA pyrosequences; AbundantOTU; abundant species identification; abundant species quantification; consensus alignment algorithm; consensus sequence inference; microbial communities; operational taxonomic units; pyrosequence grouping; pyrosequencing methods; sequence redundancy; species diversity estimation; Bioinformatics; Communities; Databases; Genomics; Heuristic algorithms; Inference algorithms; Skin; 16S rRNA gene; Operational Tax-onomic Unit (OTU); abundant species; pyrosequencing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-8306-8
Electronic_ISBN :
978-1-4244-8307-5
Type :
conf
DOI :
10.1109/BIBM.2010.5706555
Filename :
5706555
Link To Document :
بازگشت