DocumentCode
1988220
Title
Automatic recognition of regions of intrinsically poor multiple alignment using machine learning
Author
Shan, Yunfeng ; Milios, Evangelos E. ; Roger, Andrew J. ; Blouin, Christian ; Susko, Edward
Author_Institution
Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
fYear
2003
fDate
11-14 Aug. 2003
Firstpage
482
Lastpage
483
Abstract
Phylogenetic analysis requires alignment of gene or protein sequences. Some regions of genes evolve fast and suffer numerous insertion and deletion events and cannot be aligned reliably with automatic alignment algorithms. Such regions of intrinsically uncertain alignment are currently detected and deleted manually before performing phylogenetic analysis. We present the results of a machine learning approach to detect regions of poor alignment automatically. We compare the results obtained from Naive Bayes (NB), C4.5 decision tree (C4.5) and support vector machine (SVM) approaches.
Keywords
Bayes methods; biology computing; decision trees; evolution (biological); genetics; learning (artificial intelligence); proteins; support vector machines; C4.5 decision tree; Naive Bayes; SVM; automatic alignment algorithm; automatic recognition; gene; intrinsically uncertain alignment; machine learning; phylogenetic analysis; protein sequence; support vector machine approach; Bioinformatics; Decision trees; Genomics; Machine learning; Niobium; Phylogeny; Sequences; Support vector machine classification; Support vector machines; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN
0-7695-2000-6
Type
conf
DOI
10.1109/CSB.2003.1227381
Filename
1227381
Link To Document