DocumentCode :
1988220
Title :
Automatic recognition of regions of intrinsically poor multiple alignment using machine learning
Author :
Shan, Yunfeng ; Milios, Evangelos E. ; Roger, Andrew J. ; Blouin, Christian ; Susko, Edward
Author_Institution :
Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
fYear :
2003
fDate :
11-14 Aug. 2003
Firstpage :
482
Lastpage :
483
Abstract :
Phylogenetic analysis requires alignment of gene or protein sequences. Some regions of genes evolve fast and suffer numerous insertion and deletion events and cannot be aligned reliably with automatic alignment algorithms. Such regions of intrinsically uncertain alignment are currently detected and deleted manually before performing phylogenetic analysis. We present the results of a machine learning approach to detect regions of poor alignment automatically. We compare the results obtained from Naive Bayes (NB), C4.5 decision tree (C4.5) and support vector machine (SVM) approaches.
Keywords :
Bayes methods; biology computing; decision trees; evolution (biological); genetics; learning (artificial intelligence); proteins; support vector machines; C4.5 decision tree; Naive Bayes; SVM; automatic alignment algorithm; automatic recognition; gene; intrinsically uncertain alignment; machine learning; phylogenetic analysis; protein sequence; support vector machine approach; Bioinformatics; Decision trees; Genomics; Machine learning; Niobium; Phylogeny; Sequences; Support vector machine classification; Support vector machines; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN :
0-7695-2000-6
Type :
conf
DOI :
10.1109/CSB.2003.1227381
Filename :
1227381
Link To Document :
بازگشت