• DocumentCode
    3692944
  • Title

    A methodology for the prediction of Embryophyta protein functions using mismatch kernels

  • Author

    A. F. Cardona-Escobar;J. C. Pineda-Iral;N. Guarnizo-Cutiva;J. A. Jaramillo-Garzón

  • Author_Institution
    Instituto Tecnoló
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    This work implements a type of string kernel called Mismatch kernel, together with a methodology involving Support Vector Machines (SVM) for solving 14 molecular function classification problems of land plants (Embryophyta). The implemented methodology uses metaheuristic bio-inspired algorithms for finding optimal hyperparameters of the SVM, to solve the problem of imbalanced data class weights are also taken as hyperparameters in order to avoid sampling methods. The results were compared with the RBF (radial basis function) kernel over the same methodology. Geometric mean between specificity and sensitivity was used as the performance measure, showing that string kernels are the most suitable choice for the problem at hand.
  • Keywords
    "Kernel","Support vector machines","Proteins","Sensitivity","Bioinformatics","Training data"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, Images and Computer Vision (STSIVA), 2015 20th Symposium on
  • Type

    conf

  • DOI
    10.1109/STSIVA.2015.7330400
  • Filename
    7330400