Title :
A Dilemma in Assessing Stability of Feature Selection Algorithms
Author :
Alelyani, Salem ; Zhao, Zheng ; Liu, Huan
Author_Institution :
Dept. of Comput. Sci. & Eng., Arizona State Univ., Tempe, AZ, USA
Abstract :
In realm, feature selection is an effective means for handling high-dimensional data that becomes increasingly abundant. The stability of a feature selection algorithm is becoming crucial for determining the fitness of the algorithm. Below, we review existing methods of stability assessment and analyse how they assess the stability of a feature selection algorithm. A common approach is to evaluate the similarity between the selected subsets of features produced by that algorithm over different training samples or over distributed datasets. We point out challenges facing the existing evaluation methods and suggest how to improve stability assessment of feature selection algorithms.
Keywords :
data handling; distributed datasets; feature selection algorithm; high-dimensional data handling; stability assessment; Algorithm design and analysis; Classification algorithms; Current measurement; Indexes; Stability criteria; Training; Jaccard Index; algorithm´s stability; distributed datasets; feature selection; stability assessment;
Conference_Titel :
High Performance Computing and Communications (HPCC), 2011 IEEE 13th International Conference on
Conference_Location :
Banff, AB
Print_ISBN :
978-1-4577-1564-8
Electronic_ISBN :
978-0-7695-4538-7
DOI :
10.1109/HPCC.2011.99