DocumentCode
3312176
Title
The TF-IDF and Neural Networks Approach for Translation Initiation Site Prediction
Author
Kongmanee, Tarintorn ; Vanichayobon, Sirirut ; Wettayaprasit, Wiphada
Author_Institution
Comput. Sci. Dept., Prince of Songkla Univ., Songkla, Thailand
fYear
2009
fDate
8-11 Aug. 2009
Firstpage
318
Lastpage
322
Abstract
The precise prediction of translation initiation site is an important task for the analysis of genomic sequence. This study aims to increase the accuracy for the prediction of translation initiation site using a TF-IDF-NN-TIS model (TF-IDF and neural networks approach for translation initiation site prediction). This study creates feature using 1-gram and 2-gram techniques for both upstream and downstream. Determining feature value uses TF-IDF approach and feature selection by correlation-based feature selection method. Evaluation prediction results use 10-fold cross validation. This study performed experiments on three different datasets that are Vertebrate, Arabidopsis thaliana, and TIS+50. The results of the study indicate that the proposed model gives highest accuracy with less processing time.
Keywords
biology computing; correlation methods; neural nets; TF-IDF; TF-IDF-NN-TIS model; correlation-based feature selection method; genomic sequence analysis; neural networks; translation initiation site prediction; Accuracy; Artificial intelligence; Artificial neural networks; Bioinformatics; Computer science; DNA; Genomics; Laboratories; Neural networks; Sequences; TF-IDF; correlation-based feature selection; neural networks; translation initation sites;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-4519-6
Electronic_ISBN
978-1-4244-4520-2
Type
conf
DOI
10.1109/ICCSIT.2009.5234582
Filename
5234582
Link To Document