DocumentCode
3692962
Title
A comparison of class-balance strategies for SVM in the problem of protein function prediction
Author
Luis Roberto Mercado-Díaz;Julián Navarro-García;Jorge Alberto Jaramillo-Garzón
Author_Institution
Grupo de Automá
fYear
2015
Firstpage
1
Lastpage
5
Abstract
This paper presents a comparison of three strategies for managing the imbalance problem: undersampling, SMOTE and Weighted SVM. Undersampling is a strategy where the samples of the majority class are discarded; SMOTE (Synthetic Minority Over-sampling Technique) is a method in which synthetic samples of the minority class are added to the dataset; Weighted SVM keeps the number of samples of each class but assigns weights in the training of the SVM, so it can improve its performance for the minority class. Results show that Weighted SVM and SMOTE achieved comparable results, although Weighted SVM requires less computational effort. Undersampling, on the other hand, achieved lower performance results presumably due to the loss of information produced when discarding data.
Keywords
"Support vector machines","Sensitivity","Yttrium","Bioinformatics"
Publisher
ieee
Conference_Titel
Signal Processing, Images and Computer Vision (STSIVA), 2015 20th Symposium on
Type
conf
DOI
10.1109/STSIVA.2015.7330418
Filename
7330418
Link To Document