DocumentCode :
3076838
Title :
Mining knowledge for the methylation status of CpG islands using alternating decision trees
Author :
Carson, Matthew B. ; Langlois, Robert ; Lu, Hui
Author_Institution :
Bioinformatics Program, Department of Bioengineering, University of Ilinois at Chicago, 60607 USA
fYear :
2008
fDate :
20-25 Aug. 2008
Firstpage :
3787
Lastpage :
3790
Abstract :
CpG island (CpGI) methylation is an epigenetic modification that occurs in eukaryotes and is based on the addition of a methyl group to the number 5 carbon of the pyrimidine ring of cytosine. When methylation of a CpGI occurs, the associated gene (if any) is not expressed [1]. Aberrant methylation is thought to be a causative agent in disease [2] and drug sensitivity [3], [4]. In this work, we have predicted the methylation status of CpGIs in human chromosome 21 using sequence patterns. These patterns showed a significantly different distribution between methylated and unmethylated islands in a previous work [5]. Using C4.5 with bagging and cost-sensitive learning, we achieved 85.6% accuracy, 82.8% sensitivity, and 86.4% specificity. We then constructed 1000 alternating decision trees using a bootstrapping method and analyzed the nodes that were conserved between the trees. This allowed us to find specific combinations of sequence patterns that distinguished between methylated and unmethylated CpGIs. Analysis of these characteristics offers certain insight into the conditions that permit or prevent methylation.
Keywords :
Bioinformatics; Biological cells; DNA; Decision trees; Diseases; Drugs; Genomics; Humans; Sequences; Support vector machines; Algorithms; Chromosomes, Human, Pair 21; CpG Islands; DNA Methylation; Decision Support Techniques; Gene Expression; Gene Silencing; Genome, Human; Humans; Models, Statistical; Models, Theoretical; Promoter Regions, Genetic; Reproducibility of Results;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, 2008. EMBS 2008. 30th Annual International Conference of the IEEE
Conference_Location :
Vancouver, BC
ISSN :
1557-170X
Print_ISBN :
978-1-4244-1814-5
Electronic_ISBN :
1557-170X
Type :
conf
DOI :
10.1109/IEMBS.2008.4650033
Filename :
4650033
Link To Document :
بازگشت