DocumentCode :
2767293
Title :
Identification of CpG islands in DNA sequences using supervised classification
Author :
Raghavendra, B.S. ; Bopardikar, Ajit S.
Author_Institution :
Samsung India Software Oper., Samsung Adv. Inst. of Technol. India, Bangalore, India
fYear :
2011
fDate :
12-15 Nov. 2011
Firstpage :
958
Lastpage :
960
Abstract :
In this paper, we propose a supervised classification approach based on Euclidean distance for identifying CpG islands in DNA sequences. We first extract features from the training data set which is extracted from annotated DNA sequences. CpG island locations in a test data sequence are identified by calculating Euclidean distance in the feature space. A moving window method has been used to screen the input test sequence. The performance of the proposed method is verified experimentally on EMBL human DNA database. Proposed approach gives superior performance results over most of the available CpG island detectors and has potential application in annotating CpG islands in large human sequences.
Keywords :
DNA; bioinformatics; biological techniques; feature extraction; genetics; genomics; image classification; learning (artificial intelligence); molecular biophysics; molecular configurations; CpG island detectors; DNA sequences; EMBL human DNA database; Euclidean distance; feature extraction; feature space; input test sequence; moving window method; supervised classification; Bioinformatics; Conferences; DNA; Databases; Feature extraction; Humans; Training; CpG islands; Epigenomics; Euclidean distance; supervised classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1612-6
Type :
conf
DOI :
10.1109/BIBMW.2011.6112519
Filename :
6112519
Link To Document :
بازگشت