• DocumentCode
    3174254
  • Title

    A family of European page readers

  • Author

    Baird, Henry S. ; Ilbert, Derrickg ; Ittner, Davidj

  • Author_Institution
    AT&T Bell Labs., Murray Hill, NJ, USA
  • Volume
    2
  • fYear
    1994
  • fDate
    9-13 Oct 1994
  • Firstpage
    540
  • Abstract
    We have demonstrated a high degree of automation in the engineering of complex machine vision systems, by building ten printed-text page readers, each specialized to a European language, at the pace of one language per week. The page readers provide these functions: page layout analysis, polyfont symbol recognition, typographical morphology, lexicon-driven contextual analysis, and Unicode output encoding. The accuracy and speed of the resulting readers are usably high, and can be easily improved if required by comparatively routine enhancements of subsystems. This exercise illustrates the advantages of a research strategy that emphasizes versatility before, but not at the expense of, accuracy and speed
  • Keywords
    document image processing; European language; European page readers; Unicode output encoding; complex machine vision systems; lexicon-driven contextual analysis; page layout analysis; polyfont symbol recognition; printed-text page readers; typographical morphology; Automation; Computer architecture; Design engineering; Encoding; Machine vision; Morphology; Natural languages; Prototypes; Runtime; System software;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 1994. Vol. 2 - Conference B: Computer Vision & Image Processing., Proceedings of the 12th IAPR International. Conference on
  • Conference_Location
    Jerusalem
  • Print_ISBN
    0-8186-6270-0
  • Type

    conf

  • DOI
    10.1109/ICPR.1994.577014
  • Filename
    577014