Title of article :
Using OVA modeling to improve classification performance for large datasets
Author/Authors :
Lutu، نويسنده , , Patricia E.N. and Engelbrecht، نويسنده , , Andries P. Engelbrecht، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2012
Pages :
19
From page :
4358
To page :
4376
Abstract :
One-Versus-All (OVA) classification is a classifier construction method where a k-class prediction task is decomposed into k 2-class sub-problems. One base model is constructed for each sub-problem and the base models are then combined into one model. Aggregate model implementation is the process of constructing several base models which are then combined into a single model for prediction. In essence, OVA classification is a method of aggregate modeling. This paper reports studies that were conducted to establish whether OVA classification can provide predictive performance gains when large volumes of data are available for modeling as is commonly the case in data mining. It is demonstrated in this paper that firstly, OVA modeling can be used to increase the amount of training data while at the same time using base model training sets whose size is much smaller than the total amount of available training data. Secondly, OVA models created from large datasets provide a higher level of predictive performance compared to single k-class models. Thirdly, the use of boosted OVA base models can provide higher predictive performance compared to un-boosted OVA base models. Fourthly, when the combination algorithm for base model predictions is able to resolve tied predictions, the resulting aggregate models provide a higher level of predictive performance.
Keywords :
Dataset sampling , ROC analysis , Model aggregation , OVA classification , Boosting , Dataset selection , Dataset partitioning , Ensemble classification
Journal title :
Expert Systems with Applications
Serial Year :
2012
Journal title :
Expert Systems with Applications
Record number :
2351459
Link To Document :
بازگشت