• DocumentCode
    3074706
  • Title

    A Human DNA Methylation Site Predictor Based on SVM

  • Author

    Sun, Yi-Ming ; Liu, Baw-Jhiune ; Liao, Wei-Li ; Chang, Cheng-Wei ; Huang, Hsien-Da ; Horng, Jorng-Tzong ; Wu, Li-Ching

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Central Univ., Jhongli, Taiwan
  • fYear
    2009
  • fDate
    22-24 June 2009
  • Firstpage
    22
  • Lastpage
    29
  • Abstract
    During gene expression, transcription factors are unable to bind to a transcription binding site (TFBS) involved in regulation if DNA methylation has occurred at the TFBS. Methyl-CpG-binding proteins may also occupy the TFBS and prevent the functioning of a transcription factor. Thus, the methylation status of CpG sites is an important issue when trying to understand gene regulation and shows strong correlation with the TFBS involved. In addition, CpG islands would seem to undergo cell-specific and tissue-specific methylation. Such differential methylation is presented at numerous genetic loci that are essential for development. Current DNA methylation site prediction tools need to be improved so that they include TFBS features and have greater accuracy in terms of the DNA region that is involved in methylation. We developed models that compare the differences across these regions and tissues. The TFBSs, DNA properties and DNA distribution were used as features for this classification. From the results, we found some TFBSs that were able to discriminate whether a sequence was methylated or not. The sensitivity, specificity and accuracy estimated using 10-fold cross validation were 90.8%, 80.54%, and 86.07%, respectively. Thus, for these four regions and twelve tissues, the performance levels (ACC) were all greater than 80%. We propose that the differential features or methylations vary between the different regions because the features common to each DNA region made up only 50% of the top 70 features. An online predictor based on EpiMeP is available at http://140.115.51.41/EpiMeP/. Supplementary file is available at http://140.115.51.41/EpiMeP/supplementary.doc.
  • Keywords
    DNA; biochemistry; biological tissues; biology computing; cellular biophysics; genetics; molecular biophysics; proteins; support vector machines; SVM; cell-specific methylation; gene expression; human DNA methylation site predictor; methyl-CpG-binding protein; tissue-specific methylation; transcription binding site; Bioinformatics; Computer science; DNA; Gene expression; Genetics; Humans; Predictive models; Proteins; Support vector machines; Systems biology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and BioEngineering, 2009. BIBE '09. Ninth IEEE International Conference on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-0-7695-3656-9
  • Type

    conf

  • DOI
    10.1109/BIBE.2009.22
  • Filename
    5211324