DocumentCode
3692944
Title
A methodology for the prediction of Embryophyta protein functions using mismatch kernels
Author
A. F. Cardona-Escobar;J. C. Pineda-Iral;N. Guarnizo-Cutiva;J. A. Jaramillo-Garzón
Author_Institution
Instituto Tecnoló
fYear
2015
Firstpage
1
Lastpage
7
Abstract
This work implements a type of string kernel called Mismatch kernel, together with a methodology involving Support Vector Machines (SVM) for solving 14 molecular function classification problems of land plants (Embryophyta). The implemented methodology uses metaheuristic bio-inspired algorithms for finding optimal hyperparameters of the SVM, to solve the problem of imbalanced data class weights are also taken as hyperparameters in order to avoid sampling methods. The results were compared with the RBF (radial basis function) kernel over the same methodology. Geometric mean between specificity and sensitivity was used as the performance measure, showing that string kernels are the most suitable choice for the problem at hand.
Keywords
"Kernel","Support vector machines","Proteins","Sensitivity","Bioinformatics","Training data"
Publisher
ieee
Conference_Titel
Signal Processing, Images and Computer Vision (STSIVA), 2015 20th Symposium on
Type
conf
DOI
10.1109/STSIVA.2015.7330400
Filename
7330400
Link To Document