• DocumentCode
    3106832
  • Title

    Searching Single Nucleotide Polymorphism Markers to Complex Diseases Using Genetic Algorithm Framework and a BoostMode Support Vector Machine

  • Author

    Anekboon, Khantharat ; Phimoltares, Suphakant ; Lursinsap, Chidchanok ; Tongsima, Sissades ; Fucharoen, Suthat

  • Author_Institution
    Dept. of Math., Chulalongkorn Univ., Bangkok, Thailand
  • fYear
    2010
  • fDate
    18-20 June 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    With the advent of large-scale high density single nucleotide polymorphism (SNP) arrays, case-control association studies have been performed to identify predisposing genetic factors that influence many common complex diseases. These genotyping platforms provide very dense SNP coverage per one chip. Much research has been focusing on multivariate genetic model to identify genes that can predict the disease status. However, increasing the number of SNPs generates large number of combined genetic outcomes to be tested. This work presents a new mathematical algorithm for SNP analysis called IFGA that uses a "BoostMode" support vector machine (SVM) to select the best set of SNP markers that can predict a state of complex diseases. The proposed algorithm has been applied to test for the association study in two diseases, namely Crohn\´s and severity spectrum of βo/Hb E Thalassemia diseases. The results revealed that our predicted SNPs can respectively best classify both diseases at 71.57% and 71.06% accuracy using 10-fold cross validation comparing with the optimum random forest (ORF) and classification and regression trees (CART) techniques.
  • Keywords
    bioinformatics; diseases; genetic algorithms; genetics; lab-on-a-chip; regression analysis; support vector machines; β0/Hb E Thalassemia; 10-fold cross validation; BoostMode support vector machine; CART; Crohn´s disease; IFGA; ORF; SNP analysis; SNP array; SNP markers; SVM; classification and regression trees; genetic algorithm; multivariate genetic model; optimum random forest; predisposing genetic factors; single nucleotide polymorphism; Bioinformatics; Biological cells; Classification tree analysis; Diseases; Encoding; Genetic algorithms; Genomics; Mathematics; Support vector machines; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
  • Conference_Location
    Chengdu
  • ISSN
    2151-7614
  • Print_ISBN
    978-1-4244-4712-1
  • Electronic_ISBN
    2151-7614
  • Type

    conf

  • DOI
    10.1109/ICBBE.2010.5515780
  • Filename
    5515780