DocumentCode :
3196148
Title :
Online semi-supervised learning: Algorithm and application in metagenomics
Author :
Imangaliyev, Sultan ; Keijser, Bart ; Crielaard, Wim ; Tsivtsivadze, Evgeni
Author_Institution :
Top Inst. Food & Nutrition, Wageningen, Netherlands
fYear :
2013
fDate :
18-21 Dec. 2013
Firstpage :
521
Lastpage :
525
Abstract :
As the amount of metagenomic data grows rapidly, online statistical learning algorithms are poised to play key role in metagenome analysis tasks. Frequently, data are only partially labeled, namely dataset contains partial information about the problem of interest. This work presents an algorithm and a learning framework that is naturally suitable for the analysis of large scale, partially labeled metagenome datasets. We propose an online multi-output algorithm that learns by sequentially co-regularizing prediction functions on unlabeled data points and provides improved performance in comparison to several supervised methods. We evaluate predictive performance of the proposed methods on NIH Human Microbiome Project dataset. In particular we address the task of predicting relative abundance of Porphyromonas species in the oral cavity. In our empirical evaluation the proposed method outperforms several supervised regression techniques as well as leads to notable computational benefits when training the predictive model.
Keywords :
Internet; bioinformatics; data analysis; genomics; learning (artificial intelligence); meta data; microorganisms; statistical analysis; NIH Human Microbiome Project dataset; Porphyromonas species; empirical evaluation; metagenomic data analysis; online multioutput algorithm; online semisupervised learning algorithms; online statistical learning algorithms; oral cavity; predictive model; sequentially coregularizing prediction functions; supervised regression techniques; Data models; Diseases; Hidden Markov models; Prediction algorithms; Predictive models; Support vector machines; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location :
Shanghai
Type :
conf
DOI :
10.1109/BIBM.2013.6732550
Filename :
6732550
Link To Document :
بازگشت