• DocumentCode
    3312176
  • Title

    The TF-IDF and Neural Networks Approach for Translation Initiation Site Prediction

  • Author

    Kongmanee, Tarintorn ; Vanichayobon, Sirirut ; Wettayaprasit, Wiphada

  • Author_Institution
    Comput. Sci. Dept., Prince of Songkla Univ., Songkla, Thailand
  • fYear
    2009
  • fDate
    8-11 Aug. 2009
  • Firstpage
    318
  • Lastpage
    322
  • Abstract
    The precise prediction of translation initiation site is an important task for the analysis of genomic sequence. This study aims to increase the accuracy for the prediction of translation initiation site using a TF-IDF-NN-TIS model (TF-IDF and neural networks approach for translation initiation site prediction). This study creates feature using 1-gram and 2-gram techniques for both upstream and downstream. Determining feature value uses TF-IDF approach and feature selection by correlation-based feature selection method. Evaluation prediction results use 10-fold cross validation. This study performed experiments on three different datasets that are Vertebrate, Arabidopsis thaliana, and TIS+50. The results of the study indicate that the proposed model gives highest accuracy with less processing time.
  • Keywords
    biology computing; correlation methods; neural nets; TF-IDF; TF-IDF-NN-TIS model; correlation-based feature selection method; genomic sequence analysis; neural networks; translation initiation site prediction; Accuracy; Artificial intelligence; Artificial neural networks; Bioinformatics; Computer science; DNA; Genomics; Laboratories; Neural networks; Sequences; TF-IDF; correlation-based feature selection; neural networks; translation initation sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology, 2009. ICCSIT 2009. 2nd IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-4519-6
  • Electronic_ISBN
    978-1-4244-4520-2
  • Type

    conf

  • DOI
    10.1109/ICCSIT.2009.5234582
  • Filename
    5234582