Title :
Fast dependency-aware feature selection in very-high-dimensional pattern recognition
Author :
Somol, Petr ; Grim, JiYí ; Pudil, Pavel
Author_Institution :
Dept. of Pattern Recognition, Inst. of Inf. Theor. & Autom., Prague, Czech Republic
Abstract :
The paper addresses the problem of making dependency-aware feature selection feasible in pattern recognition problems of very high dimensionality. The idea of individually best ranking is generalized to evaluate the contextual quality of each feature in a series of randomly generated feature subsets. Each random subset is evaluated by a criterion function of arbitrary choice (permitting functions of high complexity). Eventually, the novel dependency-aware feature rank is computed, expressing the average benefit of including a feature into feature subsets. The method is efficient and generalizes well especially in very-high-dimensional problems, where traditional context-aware feature selection methods fail due to prohibitive computational complexity or to over-fitting. The method is shown well capable of over-performing the commonly applied individual ranking which ignores important contextual information contained in data.
Keywords :
computational complexity; feature extraction; learning (artificial intelligence); set theory; ubiquitous computing; context-aware feature selection method; dependency-aware feature rank; dependency-aware feature selection; prohibitive computational complexity; randomly generated feature subset; very high dimensionality; very-high-dimensional pattern recognition; Accuracy; Computational complexity; Context; Optimization; Pattern recognition; Probes; classification; feature selection; generalization; high dimensionality; machine learning; over-fitting; pattern recognition; ranking; stability;
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4577-0652-3
DOI :
10.1109/ICSMC.2011.6083733