• DocumentCode
    3140040
  • Title

    An effective computational method for human splice sites identification

  • Author

    Jiuqiang Han ; Ying Cui ; Jun Liu ; Xinman Zhang

  • Author_Institution
    Minist. of Educ. Key Lab. for Intell. Network, Xi´an Jiaotong Univ., Xi´an, China
  • fYear
    2013
  • fDate
    23-26 June 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Owing to the vast amount of DNA sequence data, the prediction of the complete structure of genes from the genomic DNA sequence becomes an important issue. For the eukaryotes, especially for the human genome, the splice sites identification plays a crucial role in gene structure prediction. A hybrid feature extraction approach which combing the position weight matrix (PWM) with the increment of diversity (ID) was proposed. Based on the extracted features, the support vector machine (SVM) was applied to classify authentic and false splice sites. The new algorithm was shown to be effective and simple. By the proposed algorithm, 92.98% of donor sites and 90.46% of acceptor sites were correctly classified. It is anticipated that the novel computational method is promising for the identification of splice sites in human genome.
  • Keywords
    DNA; biology computing; feature extraction; genetics; genomics; matrix algebra; support vector machines; DNA sequence data; PWM; SVM; acceptor sites; authentic splice sites; computational method; donor sites; eukaryotes; false splice sites; gene structure prediction; genes; genomic DNA sequence; human genome; human splice sites identification; hybrid feature extraction approach; increment of diversity; position weight matrix; support vector machine; Bioinformatics; Feature extraction; Genomics; Pulse width modulation; Support vector machines; Testing; Training; increment of diversity; position weight matrix; splice sites; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Conference (ASCC), 2013 9th Asian
  • Conference_Location
    Istanbul
  • Print_ISBN
    978-1-4673-5767-8
  • Type

    conf

  • DOI
    10.1109/ASCC.2013.6606395
  • Filename
    6606395