• DocumentCode
    2031120
  • Title

    Document image decoding using Markov source models

  • Author

    Kopec, Gary E. ; Chou, Phil A.

  • Author_Institution
    Xerox Palo Alto Res. Center, CA, USA
  • Volume
    5
  • fYear
    1993
  • fDate
    27-30 April 1993
  • Firstpage
    85
  • Abstract
    The authors describe a communication theory approach to document image reconstruction, patterned after the use of hidden Markov models in speech recognition. A document recognition problem is viewed as consisting of three elements-an image generator, a noisy channel, and an image decoder. A document image generator is a Markov source which combines a message source with an imager. The message source produces a string of symbols which contains the information to be transmitted. The imager is modeled as a finite-state transducer, which converts the message into an ideal bitmap. The channel transforms the ideal image into a noisy observed image. The decoder estimates the message from the observed image by finding the a posteriori most probable path through the combined source and channel models using a Viterbi-like algorithm. Application of the proposed method to decoding telephone yellow pages is described.<>
  • Keywords
    decoding; document image processing; finite state machines; hidden Markov models; image recognition; Markov source models; Viterbi-like algorithm; document image generator; document image reconstruction; finite-state transducer; ideal bitmap; image decoder; noisy channel;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
  • Conference_Location
    Minneapolis, MN, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1993.319753
  • Filename
    319753