DocumentCode :
3140040
Title :
An effective computational method for human splice sites identification
Author :
Jiuqiang Han ; Ying Cui ; Jun Liu ; Xinman Zhang
Author_Institution :
Minist. of Educ. Key Lab. for Intell. Network, Xi´an Jiaotong Univ., Xi´an, China
fYear :
2013
fDate :
23-26 June 2013
Firstpage :
1
Lastpage :
4
Abstract :
Owing to the vast amount of DNA sequence data, the prediction of the complete structure of genes from the genomic DNA sequence becomes an important issue. For the eukaryotes, especially for the human genome, the splice sites identification plays a crucial role in gene structure prediction. A hybrid feature extraction approach which combing the position weight matrix (PWM) with the increment of diversity (ID) was proposed. Based on the extracted features, the support vector machine (SVM) was applied to classify authentic and false splice sites. The new algorithm was shown to be effective and simple. By the proposed algorithm, 92.98% of donor sites and 90.46% of acceptor sites were correctly classified. It is anticipated that the novel computational method is promising for the identification of splice sites in human genome.
Keywords :
DNA; biology computing; feature extraction; genetics; genomics; matrix algebra; support vector machines; DNA sequence data; PWM; SVM; acceptor sites; authentic splice sites; computational method; donor sites; eukaryotes; false splice sites; gene structure prediction; genes; genomic DNA sequence; human genome; human splice sites identification; hybrid feature extraction approach; increment of diversity; position weight matrix; support vector machine; Bioinformatics; Feature extraction; Genomics; Pulse width modulation; Support vector machines; Testing; Training; increment of diversity; position weight matrix; splice sites; support vector machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (ASCC), 2013 9th Asian
Conference_Location :
Istanbul
Print_ISBN :
978-1-4673-5767-8
Type :
conf
DOI :
10.1109/ASCC.2013.6606395
Filename :
6606395
Link To Document :
بازگشت