Title of article :
Combining instance selection methods based on data characterization: An approach to increase their effectiveness
Author/Authors :
Yoel Caises، نويسنده , , Antonio Gonz?lez، نويسنده , , Enrique Leyva، نويسنده , , Ra?l Pérez، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Pages :
19
From page :
4780
To page :
4798
Abstract :
Although there are several proposals in the instance selection field, none of them consistently outperforms the others over a wide range of domains. In recent years many authors have come to the conclusion that data must be characterized in order to apply the most suitable selection criterion in each case. In light of this hypothesis, herein we propose a set of measures to characterize databases. These measures were used in decision rules which, given their values for a database, select from some pre-selected methods, the method, or combination of methods, that is expected to produce the best results. The rules were extracted based on an empirical analysis of the behaviors of several methods on several data sets, then integrated into an algorithm which was experimentally evaluated over 20 databases and with six different learning paradigms. The results were compared with those of five well-known state-of-the-art methods.
Keywords :
Prototype selection , Instance selection , data reduction , Machine Learning
Journal title :
Information Sciences
Serial Year :
2011
Journal title :
Information Sciences
Record number :
1214704
Link To Document :
بازگشت