DocumentCode :
1558995
Title :
Input feature selection for classification problems
Author :
Kwak, Nojun ; Choi, Chong-Ho
Author_Institution :
Sch. of Electr. Eng., Seoul Nat. Univ., South Korea
Volume :
13
Issue :
1
fYear :
2002
fDate :
1/1/2002 12:00:00 AM
Firstpage :
143
Lastpage :
159
Abstract :
Feature selection plays an important role in classifying systems such as neural networks (NNs). We use a set of attributes which are relevant, irrelevant or redundant and from the viewpoint of managing a dataset which can be huge, reducing the number of attributes by selecting only the relevant ones is desirable. In doing so, higher performances with lower computational effort is expected. In this paper, we propose two feature selection algorithms. The limitation of mutual information feature selector (MIFS) is analyzed and a method to overcome this limitation is studied. One of the proposed algorithms makes more considered use of mutual information between input attributes and output classes than the MIFS. What is demonstrated is that the proposed method can provide the performance of the ideal greedy selection algorithm when information is distributed uniformly. The computational load for this algorithm is nearly the same as that of MIFS. In addition, another feature selection algorithm using the Taguchi method is proposed. This is advanced as a solution to the question as to how to identify good features with as few experiments as possible. The proposed algorithms are applied to several classification problems and compared with MIFS. These two algorithms can be combined to complement each other´s limitations. The combined algorithm performed well in several experiments and should prove to be a useful method in selecting features for classification problems
Keywords :
neural nets; pattern classification; Taguchi method; classifying systems; feature selection; mutual information feature selector; neural networks; Biological neural networks; Data mining; Decision trees; Educational programs; Educational technology; High performance computing; Information analysis; Mutual information; Neural networks; Principal component analysis;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/72.977291
Filename :
977291
Link To Document :
بازگشت