DocumentCode
2767293
Title
Identification of CpG islands in DNA sequences using supervised classification
Author
Raghavendra, B.S. ; Bopardikar, Ajit S.
Author_Institution
Samsung India Software Oper., Samsung Adv. Inst. of Technol. India, Bangalore, India
fYear
2011
fDate
12-15 Nov. 2011
Firstpage
958
Lastpage
960
Abstract
In this paper, we propose a supervised classification approach based on Euclidean distance for identifying CpG islands in DNA sequences. We first extract features from the training data set which is extracted from annotated DNA sequences. CpG island locations in a test data sequence are identified by calculating Euclidean distance in the feature space. A moving window method has been used to screen the input test sequence. The performance of the proposed method is verified experimentally on EMBL human DNA database. Proposed approach gives superior performance results over most of the available CpG island detectors and has potential application in annotating CpG islands in large human sequences.
Keywords
DNA; bioinformatics; biological techniques; feature extraction; genetics; genomics; image classification; learning (artificial intelligence); molecular biophysics; molecular configurations; CpG island detectors; DNA sequences; EMBL human DNA database; Euclidean distance; feature extraction; feature space; input test sequence; moving window method; supervised classification; Bioinformatics; Conferences; DNA; Databases; Feature extraction; Humans; Training; CpG islands; Epigenomics; Euclidean distance; supervised classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on
Conference_Location
Atlanta, GA
Print_ISBN
978-1-4577-1612-6
Type
conf
DOI
10.1109/BIBMW.2011.6112519
Filename
6112519
Link To Document