• DocumentCode
    553225
  • Title

    A post-processing approach to Chinese address recognition

  • Author

    Xinyu Yao ; Yue Lu

  • Author_Institution
    Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
  • Volume
    3
  • fYear
    2011
  • fDate
    26-28 July 2011
  • Firstpage
    1906
  • Lastpage
    1910
  • Abstract
    In recent years, it has become a focus to make use of the address recognition technology to improve the performance of mail sorting machines. The research in the postal address recognition, which extends the context relation from words to sentences with the use of the address information in post-processing, can effectively improve the recognition performance. In this paper, we propose a divide-and-rule method. The address is divided into high level address and low level address. For high level address, the similarity method with HLA database is presented. For low level address, the Syllable-based language model which applies to Pinyin is discussed. Then a new similarity method in multi-mode is introduced. After post-processing, for high level address, the hit rate rises from 87.63% to 95.12% while the accuracy rate declines by 2.67%, and for low level address, the hit rate increases from 58.16% to 91.81% while the accuracy rate decreases by 9.40%.
  • Keywords
    mailing systems; natural language processing; optical character recognition; word processing; Chinese address recognition; HLA database; OCR; Syllable-based language model; address information; divide and rule method; high level address; low level address; mail sorting machines; optical character recognition; post-processing approach; postal address recognition; Accuracy; Cities and towns; Databases; Optical character recognition software; Pattern recognition; Postal services; Sorting; OCR; fuzzy match; post-processing; similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-61284-180-9
  • Type

    conf

  • DOI
    10.1109/FSKD.2011.6019901
  • Filename
    6019901