DocumentCode :
3673173
Title :
Classification of RNA sequences with pseudoknots using features based on partial sequences
Author :
Kwok-Kit Tong;Kwan-Yau Cheung;Kin-Hong Lee;Kwong-Sak Leung
Author_Institution :
Department of Computer Science and Engineering, The Chinese University of Hong Kong, Shatin N.T. Hong Kong
fYear :
2015
Firstpage :
1
Lastpage :
8
Abstract :
Classification on pseudoknots existence is a challenging and meaningful problem in Bioinformatics. As predicting RNA secondary structures with pseudoknots is NP-complete problem while predicting pseudoknot-free structures can be done in O(n3) time, if a preliminary pseudoknots existence classification of RNA sequence can be done before the prediction, the classification result can enhance the efficiency of RNA secondary structure prediction. In this paper, a classification of the existence of pseudoknots in an RNA sequence is presented. A set of features have been chosen by partial sequence content and thousands of RNA sequences with validated structures are used to train the classifier. Using a validated testing dataset, this classification method is shown to achieve a very good performance that the best result get 87% accuracy in 10-fold cross validation and around 75% accuracy in testing data. Moreover it may reveal how partial sequence content can affect the formation of pseudoknots.
Keywords :
"RNA","Partitioning algorithms","Training","Periodic structures","Testing","Measurement","Accuracy"
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2015 IEEE Conference on
Type :
conf
DOI :
10.1109/CIBCB.2015.7300277
Filename :
7300277
Link To Document :
بازگشت