• DocumentCode
    2767293
  • Title

    Identification of CpG islands in DNA sequences using supervised classification

  • Author

    Raghavendra, B.S. ; Bopardikar, Ajit S.

  • Author_Institution
    Samsung India Software Oper., Samsung Adv. Inst. of Technol. India, Bangalore, India
  • fYear
    2011
  • fDate
    12-15 Nov. 2011
  • Firstpage
    958
  • Lastpage
    960
  • Abstract
    In this paper, we propose a supervised classification approach based on Euclidean distance for identifying CpG islands in DNA sequences. We first extract features from the training data set which is extracted from annotated DNA sequences. CpG island locations in a test data sequence are identified by calculating Euclidean distance in the feature space. A moving window method has been used to screen the input test sequence. The performance of the proposed method is verified experimentally on EMBL human DNA database. Proposed approach gives superior performance results over most of the available CpG island detectors and has potential application in annotating CpG islands in large human sequences.
  • Keywords
    DNA; bioinformatics; biological techniques; feature extraction; genetics; genomics; image classification; learning (artificial intelligence); molecular biophysics; molecular configurations; CpG island detectors; DNA sequences; EMBL human DNA database; Euclidean distance; feature extraction; feature space; input test sequence; moving window method; supervised classification; Bioinformatics; Conferences; DNA; Databases; Feature extraction; Humans; Training; CpG islands; Epigenomics; Euclidean distance; supervised classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • Print_ISBN
    978-1-4577-1612-6
  • Type

    conf

  • DOI
    10.1109/BIBMW.2011.6112519
  • Filename
    6112519