DocumentCode
3089750
Title
A Novel Method for Splice Sites Recognition Using Comprehensive Information
Author
Wang, Kejun ; Lv, Junjie ; Feng, Weixing ; Wang, Xin
Author_Institution
Coll. of Autom., Harbin Eng. Univ., Harbin, China
fYear
2010
fDate
17-19 Sept. 2010
Firstpage
986
Lastpage
989
Abstract
To identify splice sites more accurately and efficiently, a method for the recognition of splice sites based on comprehensive information is proposed. By analyzing the splicing signals, splicing sequences, secondary structures of flank sequence, different splicing factor mechanism of action and other characteristics of donor sites and acceptor sites, donor sites identification signal model, acceptor sites identification signal model, donor sites identification sequence model, acceptor sites identification sequence model were built respectively. Then the Mfold package in Vienna soft was used to predict the most stable secondary structure of flank sequences. The traditional four-letter alphabet was converted into eight-letter alphabet sequence. The sequence-structure combination strings were used for training signal models, sequence models, then recognized splice sites by the well trained models. Our results show that the accuracy of splice site recognition is greater than 95%, suggesting that the method has great potential to achieve a good performance for splice sites identification.
Keywords
biology computing; macromolecules; molecular biophysics; pattern recognition; sequences; Mfold package; Vienna soft; comprehensive information; flank sequences; sequence-structure combination strings; splice sites recognition; Accuracy; Artificial neural networks; Biological system modeling; Hidden Markov models; Predictive models; Splicing; Tin; alternative splice; secondary structure; splice sites; splicing sequences; splicing signals;
fLanguage
English
Publisher
ieee
Conference_Titel
Pervasive Computing Signal Processing and Applications (PCSPA), 2010 First International Conference on
Conference_Location
Harbin
Print_ISBN
978-1-4244-8043-2
Electronic_ISBN
978-0-7695-4180-8
Type
conf
DOI
10.1109/PCSPA.2010.243
Filename
5635959
Link To Document