• DocumentCode
    2013682
  • Title

    Integrated Segmentation and Recognition of Mixed Chinese/English Document

  • Author

    Xia, Yong ; Xiao, Bai-Hua ; Wang, Chun-Heng ; Dai, Ru-Wei

  • Author_Institution
    Chinese Acad. of Sci., Beijing
  • Volume
    2
  • fYear
    2007
  • fDate
    23-26 Sept. 2007
  • Firstpage
    704
  • Lastpage
    708
  • Abstract
    This paper presents a general frame to integrate segmentation and recognition and gives a novel method to identify lingual attribute of mixed Chinese/English characters. The outstanding performance of this method is as follows. First, a text- line rather than a character segment is regarded as a process unit. Second, multi-feature is adopted based on multi-phase segmentation. Third, two types of feedbacks, including from character recognition and from character feature statistic within a text-line, are adopted throughout the whole segmentation and recognition. Fourth, it is adaptive to the quality and genre of documents.
  • Keywords
    character recognition; document image processing; image recognition; image segmentation; character feature statistics; character recognition; integrated recognition; integrated segmentation; mixed Chinese/English characters; mixed Chinese/English document; multiphase segmentation; Automation; Character recognition; Engines; Feature extraction; Feedback; Intelligent systems; Laboratories; Natural languages; Optical character recognition software; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
  • Conference_Location
    Parana
  • ISSN
    1520-5363
  • Print_ISBN
    978-0-7695-2822-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.2007.4377006
  • Filename
    4377006