Title :
Unsupervised Feature Selection: Minimize Information Redundancy of Features
Author :
Yen, Chun-Chao ; Chen, Liang-Chieh ; Lin, Shou-De
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
This paper proposes an unsupervised feature selection method to remove the redundant features from datasets. The major contributions are twofold. First, we propose an eigen-decomposition method to rank the hyperplanes (which describes the relations between features) based on their linear dependency characteristic, and then design an efficient Gaussian-elimination method to sequentially remove the feature that is best represented by the rest of the features. Second, we provide a proof showing that our method is similar to removing the features that contribute the most to the Principal Components with the smallest eigenvalue, but considering the effect of each removal of features with complexity about max(O(nm), O(n2)) instead of O(n3), where n is the number of features and m is the number of observations. We perform experiments on an artificial and real-world datasets. The results show that our method can almost perfectly remove those dependent features without losing any independent dimension in the artificial dataset and outperforms two other competitive algorithms in the realworld datasets.
Keywords :
Gaussian processes; computational complexity; eigenvalues and eigenfunctions; feature extraction; principal component analysis; unsupervised learning; Gaussian elimination method; eigendecomposition method; eigenvalue; information redundancy minimization; linear dependency characteristic; principal component analysis; redundant features removal; unsupervised feature selection; Gaussian-elimination; PCA; eigendecomposition; unsupervised feature selection;
Conference_Titel :
Technologies and Applications of Artificial Intelligence (TAAI), 2010 International Conference on
Conference_Location :
Hsinchu City
Print_ISBN :
978-1-4244-8668-7
Electronic_ISBN :
978-0-7695-4253-9
DOI :
10.1109/TAAI.2010.49