DocumentCode :
64881
Title :
Comparison of Dimensional Reduction Methods for Detecting and Visualizing Novel Patterns in Human and Marine Microbiome
Author :
Xingpeng Jiang ; Xiaohua Hu ; Weiwei Xu ; Tingting He ; Park, E.K.
Author_Institution :
Coll. of Inf. Sci. & Technol., Drexel Univ., Philadelphia, PA, USA
Volume :
12
Issue :
3
fYear :
2013
fDate :
Sept. 2013
Firstpage :
199
Lastpage :
205
Abstract :
Using metagenomics to detect the global structure of microbial community remains a significant challenge. The structure of a microbial community and its functions are complicated because of not only the complex interactions among microbes but also their interactions with confounding environmental factors. Recently dimension reduction methods have been employed extensively to investigate the complex structure embedded in metagenomic profiles which summarize the abundance of functional or taxonomic categorizations in metagenomic studies. However, metagenomic profiles are not necessary to meet the “Assumption of Linearity” behind these methods. Therefore it is worth to investigate whether nonlinear methods are appropriate methods which can be utilized in metagenomic analysis. In this paper, we compare the applications of several methods, including two linear methods (Principle component analysis and nonnegative matrix factorization) and a nonlinear manifold learning method-Isomap on visualizing and analyzing metagenomic profiles. These methods are applied and compared on a taxonomic profile from 33 human gut metagenomes and a large-scale Pfam profile which are derived from 45 metagenomes in Global Ocean Sampling expedition. We find that all three methods can discover interesting structures of the taxonomic profile from human gut. Furthermore, Isomap identified a novel nonlinear structure of protein families. The relationships among the identified nonlinear components and environmental factors of global ocean are explored. The results indicate that nonlinear methods could be a complementary technique to current linear methods in analyzing metagenomic dataset.
Keywords :
bioinformatics; data analysis; data structures; genetics; genomics; learning (artificial intelligence); matrix decomposition; microorganisms; molecular biophysics; molecular configurations; nonlinear dynamical systems; principal component analysis; proteins; Isomap; dimensional reduction method; environmental factor; global ocean sampling expedition; human gut metagenome; human microbiome pattern detection; large-scale Pfam profile; linearity assumption; marine microbiome pattern visualization; metagenomic dataset analysis; metagenomic profile analysis; metagenomic profile visualization; microbe interaction; microbial community function; microbial community structure detection; nonlinear manifold learning method; nonnegative matrix factorization; principle component analysis; protein nonlinear structure; taxonomic profile categorization; taxonomic profile structure; Isomap; metagenomic profile; non-negative matrix factorization; nonlinear dimension reduction; principle component analysis; Animals; Databases, Genetic; Feces; Humans; Metagenome; Metagenomics; Microbiota; Nonlinear Dynamics; Oceans and Seas; Principal Component Analysis; Proteins; Water Microbiology;
fLanguage :
English
Journal_Title :
NanoBioscience, IEEE Transactions on
Publisher :
ieee
ISSN :
1536-1241
Type :
jour
DOI :
10.1109/TNB.2013.2263287
Filename :
6516954
Link To Document :
بازگشت