DocumentCode :
1784756
Title :
Commet: Comparing and combining multiple metagenomic datasets
Author :
Maillet, Nicolas ; Collet, Guillaume ; Vannier, Thomas ; Lavenier, Dominique ; Peterlongo, Pierre
Author_Institution :
EPI GenScale, INRIA, Rennes, France
fYear :
2014
fDate :
2-5 Nov. 2014
Firstpage :
94
Lastpage :
98
Abstract :
Metagenomics offers a way to analyze biotopes at the genomic level and to reach functional and taxonomical conclusions. The bio-analyzes of large metagenomic projects face critical limitations: complex metagenomes cannot be assembled and the taxonomical or functional annotations are much smaller than the real biological diversity. This motivated the development of de novo metagenomic read comparison approaches to extract information contained in metagenomic datasets. However, these new approaches do not scale up large metagenomic projects, or generate an important number of large intermediate and result files. We introduce Commet (“COmpare Multiple METagenomes”), a method that provides similarity overview between all datasets of large metagenomic projects. Directly from non-assembled reads, all against all comparisons are performed through an efficient indexing strategy. Then, results are stored as bit vectors, a compressed representation of read files, that can be used to further combine read subsets by common logical operations. Finally, Commet computes a clusterization of metagenomic datasets, which is visualized by dendrogram and heatmaps.
Keywords :
bioinformatics; genomics; dendrogram visualization; efficient indexing strategy; functional annotations; heatmap visualization; multiple metagenomic dataset clusterization; taxonomical annotations; Bioinformatics; Genomics; Heating; Indexes; Soil; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
Conference_Location :
Belfast
Type :
conf
DOI :
10.1109/BIBM.2014.6999135
Filename :
6999135
Link To Document :
بازگشت