• DocumentCode
    1115125
  • Title

    Document image decoding using Markov source models

  • Author

    Kopec, Gary E. ; Chou, Philip A.

  • Author_Institution
    Inf. Sci. & Technol. Lab., Xerox Palo Alto Res. Center, CA, USA
  • Volume
    16
  • Issue
    6
  • fYear
    1994
  • fDate
    6/1/1994 12:00:00 AM
  • Firstpage
    602
  • Lastpage
    617
  • Abstract
    Document image decoding (DID) is a communication theory approach to document image recognition. In DID, a document recognition problem is viewed as consisting of three elements: an image generator, a noisy channel and an image decoder. A document image generator is a Markov source (stochastic finite-state automaton) that combines a message source with an imager. The message source produces a string of symbols, or text, that contains the information to be transmitted. The imager is modeled as a finite-state transducer that converts the 1D message string into an ideal 2D bitmap. The channel transforms the ideal image into a noisy observed image. The decoder estimates the message, given the observed image, by finding the a posteriori most probable path through the combined source and channel models using a Viterbi-like dynamic programming algorithm. The proposed approach is illustrated on the problem of decoding scanned telephone yellow pages to extract names and numbers from the listings. A finite-state model for yellow page columns was constructed and used to decode a database of scanned column images containing about 1100 individual listings
  • Keywords
    document image processing; dynamic programming; hidden Markov models; image coding; 1D message string; 2D bitmap; Markov source models; Viterbi-like dynamic programming; channel models; communication theory; decoder; document image decoding; document image recognition; finite state model; message source; stochastic finite state automaton; Automata; Decoding; Dynamic programming; Heuristic algorithms; Image converters; Image generation; Image recognition; Noise generators; Stochastic processes; Transducers;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.295905
  • Filename
    295905