DocumentCode
624131
Title
Classification model with subspace data-dependent balls
Author
Klakhaeng, Nattapon ; Kangkachit, Thanapat ; Rakthanmanon, Thanawin ; Waiyamai, Kitsana
Author_Institution
Dept. of Comput. Eng., Kasetsart Univ., Bangkok, Thailand
fYear
2013
fDate
29-31 May 2013
Firstpage
211
Lastpage
216
Abstract
Data-Dependent Ball (DDB) is a pre-processing algorithm that transforms quantitative into binary data by mapping them into a set of balls. In datasets with large number of dimensions, data-dependent balls are less significant due to the distance calculation in the mapping process. To reduce number of ball dimensions, this paper proposes a method for subspace data-dependent balls (SDDB) generation. SDDB starts by ranking features using information gain, and then eliminating input features based on ratio r. Subspace data-dependent balls are then created and filtered out with respect to their size and purity. Finally, a C4.5 decision tree classification model is constructed using subspace data-dependent balls as features. Experimental results from 8 TICI datasets show that the accuracy from a combination of SDDB and C4.5 is better than the combination of DDB and C4.5 in terms of accuracy.
Keywords
decision trees; distance measurement; pattern classification; C4.5 decision tree classification model; DDB; SDDB generation; TICI datasets; ball dimension reduction; binary data; distance calculation; feature elimination; information gain; pre-processing algorithm; quantitative data; subspace data-dependent ball generation; Accuracy; Classification algorithms; Decision trees; Glass; Ionosphere; Training; Transforms; classification; feature selections; subspace data-dependent balls;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Software Engineering (JCSSE), 2013 10th International Joint Conference on
Conference_Location
Maha Sarakham
Print_ISBN
978-1-4799-0805-9
Type
conf
DOI
10.1109/JCSSE.2013.6567347
Filename
6567347
Link To Document