• DocumentCode
    419654
  • Title

    Decoder banks: versatility, automation, and high accuracy without supervised training

  • Author

    Sarkar, Prateek ; Baird, Henry S.

  • Author_Institution
    PaIo Alto Res. Center, Palo Alto, CA, USA
  • Volume
    2
  • fYear
    2004
  • fDate
    23-26 Aug. 2004
  • Firstpage
    646
  • Abstract
    A methodology using decoder banks is proposed for high-accuracy, fully automatic recognition of machine printed text across a wide range of challenging image qualities, without requiring manual intervention or supervised training. This approach is made possible by two crucial properties of document image decoding (DID) technology: (1) it is trainable for high accuracy across a wide range of explicitly parameterized image degradations; and (2) decoders for arbitrary parameter settings can be generated automatically. We report the results of large-scale experiments on synthetic images which demonstrate that, when many pretrained decoders are applied in parallel to an input image with unknown parameters, the decoder that yields the highest accuracy is often the one that exhibits the highest DID posterior ´Viterbi score´. When implemented naively, in a brute-force manner, decoder banks are computationally intensive: but we suggest ways that this cost may be reduced with no loss of versatility, automation, or accuracy.
  • Keywords
    Viterbi decoding; character recognition; Viterbi score; decoder banks; document image decoding technology; fully automatic machine printed text recognition; Automation; Costs; Decoding; Degradation; Error analysis; Focusing; Image quality; Image recognition; Testing; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-2128-2
  • Type

    conf

  • DOI
    10.1109/ICPR.2004.1334331
  • Filename
    1334331