Title :
Classification improvement based on feature combination and topic vector model
Author :
Yeh, Jian-hua ; Lin, Chen ; Chang, Yuan-ling
Author_Institution :
Dept. of Comp. Sci. & Inf. Eng., Aletheia Univ., Taipei, Taiwan
Abstract :
In this paper, we demonstrate a feature processing procedure which emphasizes on the combination of original features with redundancy trimming steps. This procedure shows better classification result than traditional classification models. In our experiment, several key feature processing steps were proposed according to the type of the feature. These steps contains numerical to categorical feature value conversion, feature combination, feature redundancy discrimination, and latent structure discovery based on the concatenation of original features and extended feature set. The UCI machine learning repository is chosen as our demonstration to show the effect of our approach. In our preliminary result, it shows that the classification accuracy outperforms the traditional SVM classifier(SVM-only) while the ROC benchmark equals to the SVM-only scenario. This result is believed to be a promising one on the feature processing procedure research.
Keywords :
pattern classification; UCI machine learning repository; categorical feature value conversion; classification accuracy; classification improvement; classification model; feature combination; feature concatenation; feature processing procedure; feature redundancy discrimination; feature set; latent structure discovery; numerical feature value conversion; topic vector model; Accuracy; Computational modeling; Data models; Feature extraction; Machine learning; Redundancy; Support vector machine classification; clustering; feature combination; feature redundancy discrminination; latent topics;
Conference_Titel :
Systems and Informatics (ICSAI), 2012 International Conference on
Conference_Location :
Yantai
Print_ISBN :
978-1-4673-0198-5
DOI :
10.1109/ICSAI.2012.6223526