Title :
Irrelevant Features, Class Separability, and Complexity of Classification Problems
Author_Institution :
Dept. of Comput. Sci. & Inf. Syst., Univ. of Jyvaskyla, Jyvaskyla, Finland
Abstract :
In this paper, analysis of class separability measures is performed in attempt to relate their descriptive abilities to geometrical properties of classification problems in presence of irrelevant features. The study is performed on synthetic and benchmark data with known irrelevant features and other characteristics of interest, such as class boundaries, shapes, margins between classes, and density. The results have shown that some measures are individually informative, while others are less reliable and only can provide complimentary information. Classification problem complexity measurements on selected data sets are made to gain additional insights on the obtained results.
Keywords :
computational complexity; data mining; pattern classification; benchmark data; class separability measures; classification problem complexity measurements; feature relevance; synthetic data; Complexity theory; Covariance matrix; Density measurement; Error analysis; Feature extraction; Measurement uncertainty; Shape; class separability; feature relevance; geometrical complexity of classification problems; local feature weighting;
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4577-2068-0
Electronic_ISBN :
1082-3409
DOI :
10.1109/ICTAI.2011.171