DocumentCode :
3074706
Title :
A Human DNA Methylation Site Predictor Based on SVM
Author :
Sun, Yi-Ming ; Liu, Baw-Jhiune ; Liao, Wei-Li ; Chang, Cheng-Wei ; Huang, Hsien-Da ; Horng, Jorng-Tzong ; Wu, Li-Ching
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Central Univ., Jhongli, Taiwan
fYear :
2009
fDate :
22-24 June 2009
Firstpage :
22
Lastpage :
29
Abstract :
During gene expression, transcription factors are unable to bind to a transcription binding site (TFBS) involved in regulation if DNA methylation has occurred at the TFBS. Methyl-CpG-binding proteins may also occupy the TFBS and prevent the functioning of a transcription factor. Thus, the methylation status of CpG sites is an important issue when trying to understand gene regulation and shows strong correlation with the TFBS involved. In addition, CpG islands would seem to undergo cell-specific and tissue-specific methylation. Such differential methylation is presented at numerous genetic loci that are essential for development. Current DNA methylation site prediction tools need to be improved so that they include TFBS features and have greater accuracy in terms of the DNA region that is involved in methylation. We developed models that compare the differences across these regions and tissues. The TFBSs, DNA properties and DNA distribution were used as features for this classification. From the results, we found some TFBSs that were able to discriminate whether a sequence was methylated or not. The sensitivity, specificity and accuracy estimated using 10-fold cross validation were 90.8%, 80.54%, and 86.07%, respectively. Thus, for these four regions and twelve tissues, the performance levels (ACC) were all greater than 80%. We propose that the differential features or methylations vary between the different regions because the features common to each DNA region made up only 50% of the top 70 features. An online predictor based on EpiMeP is available at http://140.115.51.41/EpiMeP/. Supplementary file is available at http://140.115.51.41/EpiMeP/supplementary.doc.
Keywords :
DNA; biochemistry; biological tissues; biology computing; cellular biophysics; genetics; molecular biophysics; proteins; support vector machines; SVM; cell-specific methylation; gene expression; human DNA methylation site predictor; methyl-CpG-binding protein; tissue-specific methylation; transcription binding site; Bioinformatics; Computer science; DNA; Gene expression; Genetics; Humans; Predictive models; Proteins; Support vector machines; Systems biology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and BioEngineering, 2009. BIBE '09. Ninth IEEE International Conference on
Conference_Location :
Taichung
Print_ISBN :
978-0-7695-3656-9
Type :
conf
DOI :
10.1109/BIBE.2009.22
Filename :
5211324
Link To Document :
بازگشت