Title :
Adaptive boosting with leader based learners for classification of large handwritten data
Author :
Babu, T. Ravindra ; Murty, M. Narasimha ; Agrawal, V.K.
Author_Institution :
Dept. of Comput. Sci. & Autom., Indian Inst. of Sci., Bangalore, India
Abstract :
Boosting is a general method for improving the accuracy of a learning algorithm. AdaBoost, short form for adaptive boosting method, consists of repeated use of a weak or a base learning algorithm to find corresponding weak hypothesis by adapting to the error rates of the individual weak hypotheses. A large, complex handwritten data is under study. A repeated use of weak learner on the huge data results in large amount of processing time. In view of this, instead of using the entire training data for learning, we propose to use only prototypes. Further, in the current work, the base learner consists of a nearest neighbour classifier that employs prototypes generated using "leader" clustering algorithm. The leader algorithm is a single pass algorithm and is linear in terms of time as well as computation complexity. The prototype set alone is used as training data. In the process of developing an algorithm, domain knowledge of the Handwritten data, which is under study, is made use of. With the fusion of clustering, prototype selection, AdaBoost and Nearest Neighbour classifier, a very high classification accuracy, which is better than reported earlier on the considered data, is obtained in less number of iterations. The procedure integrates clustering outcome in terms of prototypes with boosting.
Keywords :
computational complexity; handwriting recognition; learning (artificial intelligence); pattern classification; pattern clustering; very large databases; AdaBoost; Nearest Neighbour classifier; adaptive boosting method; computation complexity; large handwritten data classification; learning algorithm; Boosting; Clustering algorithms; Error analysis; Error correction; Machine learning; Machine learning algorithms; Prototypes; Satellites; Testing; Training data;
Conference_Titel :
Hybrid Intelligent Systems, 2004. HIS '04. Fourth International Conference on
Print_ISBN :
0-7695-2291-2
DOI :
10.1109/ICHIS.2004.15