DocumentCode :
3758610
Title :
Classification of peptides using ensembles by applying different strategies that deal with imbalanced data and combination rules
Author :
C. Lastre- Dom?nguez;P. Rond?n-Villarreal;D. A. Sierra
Author_Institution :
Escuela de Ingenier?as El?ctrica, Electr?nica y de Telecomunicaciones, Universidad Industrial de Santander, Bucaramanga, Colombia
fYear :
2015
Firstpage :
1
Lastpage :
6
Abstract :
The discovery and synthesis of peptides with antimicrobial properties is a promising alternative to fight against multi-resistant bacteria. There are multiple studies that deal with the classification of peptides according with their probability to possess antimicrobial activity. One of the challenges in these classification processes is related with the amount of available data. For the case of antibacterial peptides classifiers, the size of the positive class is much bigger than the negative class. In this work, we propose two strategies to deal with the imbalance situation of the data by using ensembles. The first one is based on algorithm modifications and the second one with data management. For each strategy we used five combination rules. The performance of the ensembles was calculated using the area under the ROC curve (AUC). Our results suggest that care must be taken with ensembles and that individual classifiers must be studied in-depth.
Keywords :
"Support vector machines","Media","Silicon compounds","Peptides","Classification algorithms","Electronic mail","Microorganisms"
Publisher :
ieee
Conference_Titel :
Central American and Panama Convention (CONCAPAN XXXV), 2015 IEEE Thirty Fifth
Type :
conf
DOI :
10.1109/CONCAPAN.2015.7428467
Filename :
7428467
Link To Document :
بازگشت