Title :
Estimating optimal feature subsets using efficient estimation of high-dimensional mutual information
Author :
Chow, Tommy W S ; Huang, D.
Author_Institution :
City Univ. of Hong Kong, China
Abstract :
A novel feature selection method using the concept of mutual information (MI) is proposed in this paper. In all MI based feature selection methods, effective and efficient estimation of high-dimensional MI is crucial. In this paper, a pruned Parzen window estimator and the quadratic mutual information (QMI) are combined to address this problem. The results show that the proposed approach can estimate the MI in an effective and efficient way. With this contribution, a novel feature selection method is developed to identify the salient features one by one. Also, the appropriate feature subsets for classification can be reliably estimated. The proposed methodology is thoroughly tested in four different classification applications in which the number of features ranged from less than 10 to over 15000. The presented results are very promising and corroborate the contribution of the proposed feature selection methodology.
Keywords :
feature extraction; pattern classification; Parzen window estimator; feature selection; optimal feature subset estimation; quadratic mutual information; Data compression; Distributed computing; Filters; Higher order statistics; Histograms; Mutual information; Scalability; Statistical distributions; Testing; Two dimensional displays; Feature selection; Parzen window estimator; quadratic mutual information (QMI); supervised data compression; Algorithms; Artificial Intelligence; Cluster Analysis; Computing Methodologies; Information Storage and Retrieval; Pattern Recognition, Automated;
Journal_Title :
Neural Networks, IEEE Transactions on
DOI :
10.1109/TNN.2004.841414