DocumentCode :
3126818
Title :
Classifying Categorical Data by Rule-Based Neighbors
Author :
Wang, Jiabing ; Zhang, Pei ; Wen, Guihua ; Wei, Jia
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
fYear :
2011
fDate :
11-14 Dec. 2011
Firstpage :
1248
Lastpage :
1253
Abstract :
A new learning algorithm for categorical data, named CRN (Classification by Rule-based Neighbors) is proposed in this paper. CRN is a nonmetric and parameter-free classifier, and can be regarded as a hybrid of rule induction and instance-based learning. Based on a new measure of attributes quality and the separate-and-conquer strategy, CRN learns a collection of feature sets such that for each pair of instances belonging to different classes, there is a feature set on which the two instances disagree. For an unlabeled instance I and a labeled instance I´, I´ is a neighbor of I if and only if they agree on all attributes of a feature set. Then, CRN classifies an unlabeled instance I based on I´s neighbors on those learned feature sets. To validate the performance of CRN, CRN is compared with six state-of-the-art classifiers on twenty-four datasets. Experimental results demonstrate that although the underlying idea of CRN is simple, the predictive accuracy of CRN is comparable to or better than that of the state-of-the-art classifiers on most datasets.
Keywords :
data handling; knowledge based systems; learning (artificial intelligence); pattern classification; categorical data classification; feature sets; instance-based learning; learning algorithm; nonmetric classifier; parameter-free classifier; rule induction; rule-based neighbors; separate-and-conquer strategy; Accuracy; Classification algorithms; Entropy; Impurities; Measurement; Training; categorical data; classification; feature selection; rule-based neighbors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
ISSN :
1550-4786
Print_ISBN :
978-1-4577-2075-8
Type :
conf
DOI :
10.1109/ICDM.2011.34
Filename :
6137346
Link To Document :
بازگشت