DocumentCode
1636046
Title
A Hierarchical Classification Model for Document Categorization
Author
Xu, Jian-Wu ; Singh, Vartika ; Govindaraju, Venu ; Neogi, Depankar
Author_Institution
Copanion Inc., Andover, MA, USA
fYear
2009
Firstpage
486
Lastpage
490
Abstract
We propose a novel hierarchical classification method for documents categorization in this paper. The approach consists of multiple levels of classification for different hierarchies. Regularized Least Square (RLS)binary classifiers are applied in the middle levels of the hierarchy to classify documents into smaller set of categories and K-nearest-neighbor (KNN) multi-class classifiers are used at the bottom to classify documents into final classes. Experiments on large-scale real world tax documents show that the proposed hierarchical approach outperforms traditional flat classification method.
Keywords
document image processing; image classification; least squares approximations; document categorization; hierarchical classification model; k-nearest-neighbor multiclass classifier; regularized least square binary classifier; Large-scale systems; Least squares methods; Classifier Combination; Document Classification; Hierarchical Classification; Multiple-classifiers;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location
Barcelona
ISSN
1520-5363
Print_ISBN
978-1-4244-4500-4
Electronic_ISBN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2009.187
Filename
5277620
Link To Document