Title :
A New Measure of Feature Selection Algorithms´ Stability
Author :
Novovicova, J. ; Somol, Petr ; Pudil, Pavel
Author_Institution :
Dept. of Pattern Recognition, Acad. of Sci. of the Czech Republic, Prague, Czech Republic
Abstract :
Stability or robustness of feature selection methods is a topic of recent interest. A new stability measure based on the Shannon entropy is proposed in this paper to evaluate the overall occurrence of individual features in selected subsets of possibly varying cardinality. We compare the new measure to stability measures proposed recently by Somol et al. The new measure is computationally very efficient and adds another type of insight into the stability problem. All considered measures have been used to compare the stability of several feature selection methods (individually best ranking, sequential forward selection, sequential forward floating selection and dynamic oscillating search) on a set of examples.
Keywords :
data mining; entropy; search problems; Shannon entropy; dynamic oscillating search; feature selection algorithm; sequential forward floating selection; sequential forward selection; stability measure; Area measurement; Automation; Computational complexity; Conferences; Data mining; Data preprocessing; Entropy; Information theory; Pattern recognition; Robust stability;
Conference_Titel :
Data Mining Workshops, 2009. ICDMW '09. IEEE International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-5384-9
Electronic_ISBN :
978-0-7695-3902-7
DOI :
10.1109/ICDMW.2009.32