DocumentCode :
1582226
Title :
Zone content classification and its performance evaluation
Author :
Wang, Yalin ; Haralick, Robert ; Phillips, Ihsin T.
Author_Institution :
Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
540
Lastpage :
544
Abstract :
We present an improved zone content classification method and its performance evaluation. We added two new features to the feature vector from one previously published method (Sivaramakrishnan et al., 1995). We assumed different independence relationships in two zone sets. We used an optimized binary decision tree to estimate the maximum zone content class probability in one set while using the Viterbi algorithm to find the optimal solution for a zone sequence in the other set. The training, pruning and testing data set for the algorithm include 1,600 images drawn from the UWCDROM III document image database. The classifier is able to classify each given scientific and technical document zone into one of the nine classes, 2 text classes (of font size 4 - 18pt and font size 19 - 32 pt), math, table, halftone, map/drawing, ruling, logo, and others. Compared with our previous work (Wang et al., 2000), it raised the accuracy rate to 98.52% from 97.53% and reduced the mean false alarm rate to 0.53% from 1.26%
Keywords :
decision trees; document image processing; image classification; performance evaluation; probability; visual databases; UWCDROM III database; Viterbi algorithm; document classification; document image database; feature vector; font size; independence relationship; maximum zone content class probability; optimized binary decision tree; performance evaluation; zone content classification; Classification tree analysis; Computer science; Decision trees; Educational institutions; Equations; Hidden Markov models; Image databases; Optical character recognition software; Testing; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
Type :
conf
DOI :
10.1109/ICDAR.2001.953847
Filename :
953847
Link To Document :
بازگشت