• DocumentCode
    1988220
  • Title

    Automatic recognition of regions of intrinsically poor multiple alignment using machine learning

  • Author

    Shan, Yunfeng ; Milios, Evangelos E. ; Roger, Andrew J. ; Blouin, Christian ; Susko, Edward

  • Author_Institution
    Fac. of Comput. Sci., Dalhousie Univ., Halifax, NS, Canada
  • fYear
    2003
  • fDate
    11-14 Aug. 2003
  • Firstpage
    482
  • Lastpage
    483
  • Abstract
    Phylogenetic analysis requires alignment of gene or protein sequences. Some regions of genes evolve fast and suffer numerous insertion and deletion events and cannot be aligned reliably with automatic alignment algorithms. Such regions of intrinsically uncertain alignment are currently detected and deleted manually before performing phylogenetic analysis. We present the results of a machine learning approach to detect regions of poor alignment automatically. We compare the results obtained from Naive Bayes (NB), C4.5 decision tree (C4.5) and support vector machine (SVM) approaches.
  • Keywords
    Bayes methods; biology computing; decision trees; evolution (biological); genetics; learning (artificial intelligence); proteins; support vector machines; C4.5 decision tree; Naive Bayes; SVM; automatic alignment algorithm; automatic recognition; gene; intrinsically uncertain alignment; machine learning; phylogenetic analysis; protein sequence; support vector machine approach; Bioinformatics; Decision trees; Genomics; Machine learning; Niobium; Phylogeny; Sequences; Support vector machine classification; Support vector machines; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
  • Print_ISBN
    0-7695-2000-6
  • Type

    conf

  • DOI
    10.1109/CSB.2003.1227381
  • Filename
    1227381