DocumentCode
384090
Title
A method for document zone content classification
Author
Wang, Yalin ; Phillips, Ihsin T. ; Haralick, Robert M.
Author_Institution
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
Volume
3
fYear
2002
fDate
2002
Firstpage
196
Abstract
This paper describes an algorithm to classify each given document zone into one of nine classes and provides a protocol for its performance evaluation. The classification scheme uses an optimized binary decision tree and Viterbi algorithm for HMM to find the optimal solution. Our algorithm was trained and tested on a total of 24,177 zones within the 1600 images from UWCDROM III database. Its accuracy rate is 98.45% with a mean false alarm rate of 0.50%.
Keywords
binary decision diagrams; decision trees; document image processing; hidden Markov models; image classification; image segmentation; performance evaluation; visual databases; HMM; UWCDROM III database; Viterbi algorithm; document zone content classification; false alarm rate; hidden Markov model; optimized binary decision tree; performance evaluation; visual database; Classification tree analysis; Context modeling; Decision trees; Educational institutions; Hidden Markov models; Image databases; Optimization methods; Spatial databases; Testing; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2002. Proceedings. 16th International Conference on
ISSN
1051-4651
Print_ISBN
0-7695-1695-X
Type
conf
DOI
10.1109/ICPR.2002.1047828
Filename
1047828
Link To Document