• DocumentCode
    344192
  • Title

    A new method of character line extraction from mixed-unformatted document image for Japanese mail address recognition

  • Author

    Wang, Xian ; Tsutsumida, Toshio

  • Author_Institution
    Center of Excellence for Document Analysis & Recognition, State Univ. of New York, Buffalo, NY, USA
  • fYear
    1999
  • fDate
    20-22 Sep 1999
  • Firstpage
    769
  • Lastpage
    772
  • Abstract
    Presents a new method of horizontal and vertical character line extraction in mixed (handwritten/printed) unformatted document images, in various character sizes, gaps and orientations nested among advertisement characters, drawings and photographs. We use the inherent features of a character line, such as the number and size of the characters it contains and the angular spectrum of the characters. When an area has characters along both horizontal and vertical lines, then competitive judgment is applied. Using multi-set thresholds in a bottom-up methodology, we can successfully extract Japanese mail address character lines. 957 address character lines, taken from 252 pieces of mail, were tested, and a 95.9% correct extraction rate was achieved
  • Keywords
    document image processing; image segmentation; mailing systems; Japanese mail address recognition; advertisements; bottom-up methodology; character angular spectrum; character line extraction; character line inherent features; character number; character orientation; character size; competitive judgment; drawings; handwritten documents; horizontal character lines; inter-character gap; mixed unformatted document images; multi-set thresholds; photographs; printed documents; vertical character lines; Character recognition; Electronic switching systems; Image analysis; Image recognition; Merging; Postal services; Read only memory; Seals; Testing; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1999. ICDAR '99. Proceedings of the Fifth International Conference on
  • Conference_Location
    Bangalore
  • Print_ISBN
    0-7695-0318-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1999.791901
  • Filename
    791901