DocumentCode :
797062
Title :
Multiclass MTS for Simultaneous Feature Selection and Classification
Author :
Su, Chao-Ton ; Hsiao, Yu-Hsiang
Author_Institution :
Nat. Tsing Hua Univ., Hsinchu
Volume :
21
Issue :
2
fYear :
2009
Firstpage :
192
Lastpage :
205
Abstract :
Multiclass Mahalanobis-Taguchi system (MMTS), the extension of MTS, is developed for simultaneous multiclass classification and feature selection. In MMTS, the multiclass measurement scale is constructed by establishing an individual Mahalanobis space for each class. To increase the validity of the measurement scale, the Gram-Schmidt process is performed to mutually orthogonalize the features and eliminate the multicollinearity. The important features are identified using the orthogonal arrays and the signal-to-noise ratio, and are then used to construct a reduced model measurement scale. The contribution of each important feature to classification is also derived according to the effect gain to develop a weighted Mahalanobis distance which is finally used as the distance metric for the classification of MMTS. Using the reduced model measurement scale, an unknown example will be classified into the class with minimum weighted Mahalanobis distance considering only the important features. For evaluating the effectiveness of MMTS, a numerical experiment is implemented, and the results show that MMTS outperforms other well-known algorithms not only on classification accuracy but also on feature selection efficiency. Finally, a real case about gestational diabetes mellitus is studied, and the results indicate the practicality of MMTS in real-world applications.
Keywords :
Taguchi methods; data analysis; data mining; Gram-Schmidt process; Mahalanobis space; gestational diabetes mellitus; multiclass Mahalanobis-Taguchi system; orthogonal arrays; signal-to-noise ratio; simultaneous feature selection; weighted Mahalanobis distance; Classification; Clustering; Data mining; Gram-Schmidt orthogonalization process; Mahalanobis-Taguchi system (MTS); Mining methods and algorithms; and association rules; classification; feature selection; gestational diabetes mellitus.; multiclass problem; weighted Mahalanobis distance;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2008.128
Filename :
4564454
Link To Document :
بازگشت