DocumentCode
2484216
Title
Document-zone classification using partial least squares and hybrid classifiers
Author
Abd-Almageed, Wael ; Agrawal, Mudit ; Seo, Wontaek ; Doermann, David
Author_Institution
Inst. for Adv. Studies, Univ. of Maryland, College Park, MD
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
This paper introduces a novel document-zone classification algorithm. Low level image features are first extracted from document zones and partial least squares is used on pairs of classes to compute discriminating pairwise features. Rather than using the popular one-against-all and one-against-one voting schemes, we introduce a novel hybrid method which combines the benefits of the two schemes. The algorithm is applied on the University of Washington dataset and 97.3% classification accuracy is obtained.
Keywords
document image processing; feature extraction; image classification; least squares approximations; support vector machines; SVM classifier; document-zone classification algorithm; image feature extraction; one-against-all voting scheme; one-against-one voting scheme; partial least squares; Classification algorithms; Data mining; Educational institutions; Feature extraction; Image segmentation; Least squares methods; Optical character recognition software; Support vector machine classification; Support vector machines; Voting;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761553
Filename
4761553
Link To Document