DocumentCode :
2909599
Title :
Data condensation in large databases by incremental learning with support vector machines
Author :
Mitra, Pabitra ; Murthy, C.A. ; Pal, Sankar K.
Author_Institution :
Machine Intelligence Unit, Indian Stat. Inst., Calcutta, India
Volume :
2
fYear :
2000
fDate :
2000
Firstpage :
708
Abstract :
An algorithm for data condensation using support vector machines (SVM) is presented. The algorithm extracts data points lying close to the class boundaries, which form a much reduced but critical set for classification. The problem of large memory requirements for training SVM in batch mode is circumvented by adopting an active incremental learning algorithm. The learning strategy is motivated from the condensed nearest neighbor classification technique. Experimental results presented show that such active incremental learning enjoy superiority in terms of computation time and condensation ratio, over related methods
Keywords :
computational complexity; data warehouses; learning (artificial intelligence); learning automata; pattern classification; SVM; active incremental learning algorithm; batch mode training; class boundaries; computation time; condensed nearest neighbor classification technique; data condensation; data point extraction; incremental learning; large databases; large memory requirements; pattern classification; support vector machines; Data mining; Databases; Machine intelligence; Machine learning; Machine learning algorithms; Nearest neighbor searches; Quadratic programming; Sampling methods; Support vector machine classification; Support vector machines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2000. Proceedings. 15th International Conference on
Conference_Location :
Barcelona
ISSN :
1051-4651
Print_ISBN :
0-7695-0750-6
Type :
conf
DOI :
10.1109/ICPR.2000.906173
Filename :
906173
Link To Document :
بازگشت