DocumentCode
3140040
Title
An effective computational method for human splice sites identification
Author
Jiuqiang Han ; Ying Cui ; Jun Liu ; Xinman Zhang
Author_Institution
Minist. of Educ. Key Lab. for Intell. Network, Xi´an Jiaotong Univ., Xi´an, China
fYear
2013
fDate
23-26 June 2013
Firstpage
1
Lastpage
4
Abstract
Owing to the vast amount of DNA sequence data, the prediction of the complete structure of genes from the genomic DNA sequence becomes an important issue. For the eukaryotes, especially for the human genome, the splice sites identification plays a crucial role in gene structure prediction. A hybrid feature extraction approach which combing the position weight matrix (PWM) with the increment of diversity (ID) was proposed. Based on the extracted features, the support vector machine (SVM) was applied to classify authentic and false splice sites. The new algorithm was shown to be effective and simple. By the proposed algorithm, 92.98% of donor sites and 90.46% of acceptor sites were correctly classified. It is anticipated that the novel computational method is promising for the identification of splice sites in human genome.
Keywords
DNA; biology computing; feature extraction; genetics; genomics; matrix algebra; support vector machines; DNA sequence data; PWM; SVM; acceptor sites; authentic splice sites; computational method; donor sites; eukaryotes; false splice sites; gene structure prediction; genes; genomic DNA sequence; human genome; human splice sites identification; hybrid feature extraction approach; increment of diversity; position weight matrix; support vector machine; Bioinformatics; Feature extraction; Genomics; Pulse width modulation; Support vector machines; Testing; Training; increment of diversity; position weight matrix; splice sites; support vector machine;
fLanguage
English
Publisher
ieee
Conference_Titel
Control Conference (ASCC), 2013 9th Asian
Conference_Location
Istanbul
Print_ISBN
978-1-4673-5767-8
Type
conf
DOI
10.1109/ASCC.2013.6606395
Filename
6606395
Link To Document