DocumentCode
2597601
Title
The Generalized Condensed Nearest Neighbor Rule as A Data Reduction Method
Author
Chou, Chien-Hsing ; Kuo, Bo-Han ; Chang, Fu
Author_Institution
Inst. of Inf. Sci., Acad. Sinica, Taipei
Volume
2
fYear
0
fDate
0-0 0
Firstpage
556
Lastpage
559
Abstract
In this paper, we propose a new data reduction algorithm that iteratively selects some samples and ignores others that can be absorbed, or represented, by those selected. This algorithm differs from the condensed nearest neighbor (CNN) rule in its employment of a strong absorption criterion, in contrast to the weak criterion employed by CNN; hence, it is called the generalized CNN (GCNN) algorithm. The new criterion allows GCNN to incorporate CNN as a special case, and can achieve consistency, or asymptotic Bayes-risk efficiency, under certain conditions. GCNN, moreover, can yield significantly better accuracy than other instance-based data reduction methods. We demonstrate the last claim through experiments on five datasets, some of which contain a very large number of samples
Keywords
Bayes methods; data reduction; asymptotic Bayes-risk efficiency; data reduction; generalized condensed nearest neighbor algorithm; generalized condensed nearest neighbor rule; Absorption; Cellular neural networks; Clustering algorithms; Employment; Information science; Iterative algorithms; Learning systems; Nearest neighbor searches; Prototypes; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location
Hong Kong
ISSN
1051-4651
Print_ISBN
0-7695-2521-0
Type
conf
DOI
10.1109/ICPR.2006.1119
Filename
1699266
Link To Document