Title :
A study on different backward feature selection criteria over high-dimensional databases
Author :
Bermejo, Pablo ; De La Ossa, Luis ; Gamez, Jose A. ; Puerta, Jose M.
Author_Institution :
Computings Syst. Dept., Univ. of Castilla-La Mancha, Albacete, Spain
Abstract :
Feature subset selection has become an expensive process due to the relatively recent appearance of high-dimensional databases. Thus, not only the need has arisen for reducing the dimensionality of these datasets, but also for doing it in an efficient way. We propose a new backward search, where attributes are removed given several smart criteria found in the literature and, besides, it is guided using a heuristic which reduces the cost and needed number of evaluations commonly expected from a backward search. Besides, we do not only propose the design of a new forward-backward algorithm but we also provide an experimental study of different criteria to decide the removal of attributes. The result is a very competitive algorithm which does not exceed the in-practice linear complexity while obtaining selected subsets of features with lower cardinality than other state-of-the-art algorithms.
Keywords :
classification; computational complexity; database management systems; information retrieval; backward feature selection criteria; backward search; cardinality; competitive algorithm; dimensionality; feature subset selection; forward-backward algorithm; high-dimensional databases; in-practice linear complexity; state-of-the-art algorithms; Algorithm design and analysis; Complexity theory; Databases; Frequency selective surfaces; Measurement; Proposals; Signal processing algorithms; feature; high-dimensional; hybrid; selection; sequential;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2011 11th International Conference on
Conference_Location :
Cordoba
Print_ISBN :
978-1-4577-1676-8
DOI :
10.1109/ISDA.2011.6121839