• DocumentCode
    384090
  • Title

    A method for document zone content classification

  • Author

    Wang, Yalin ; Phillips, Ihsin T. ; Haralick, Robert M.

  • Author_Institution
    Dept. of Electr. Eng., Washington Univ., Seattle, WA, USA
  • Volume
    3
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    196
  • Abstract
    This paper describes an algorithm to classify each given document zone into one of nine classes and provides a protocol for its performance evaluation. The classification scheme uses an optimized binary decision tree and Viterbi algorithm for HMM to find the optimal solution. Our algorithm was trained and tested on a total of 24,177 zones within the 1600 images from UWCDROM III database. Its accuracy rate is 98.45% with a mean false alarm rate of 0.50%.
  • Keywords
    binary decision diagrams; decision trees; document image processing; hidden Markov models; image classification; image segmentation; performance evaluation; visual databases; HMM; UWCDROM III database; Viterbi algorithm; document zone content classification; false alarm rate; hidden Markov model; optimized binary decision tree; performance evaluation; visual database; Classification tree analysis; Context modeling; Decision trees; Educational institutions; Hidden Markov models; Image databases; Optimization methods; Spatial databases; Testing; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2002. Proceedings. 16th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-1695-X
  • Type

    conf

  • DOI
    10.1109/ICPR.2002.1047828
  • Filename
    1047828