DocumentCode
2013682
Title
Integrated Segmentation and Recognition of Mixed Chinese/English Document
Author
Xia, Yong ; Xiao, Bai-Hua ; Wang, Chun-Heng ; Dai, Ru-Wei
Author_Institution
Chinese Acad. of Sci., Beijing
Volume
2
fYear
2007
fDate
23-26 Sept. 2007
Firstpage
704
Lastpage
708
Abstract
This paper presents a general frame to integrate segmentation and recognition and gives a novel method to identify lingual attribute of mixed Chinese/English characters. The outstanding performance of this method is as follows. First, a text- line rather than a character segment is regarded as a process unit. Second, multi-feature is adopted based on multi-phase segmentation. Third, two types of feedbacks, including from character recognition and from character feature statistic within a text-line, are adopted throughout the whole segmentation and recognition. Fourth, it is adaptive to the quality and genre of documents.
Keywords
character recognition; document image processing; image recognition; image segmentation; character feature statistics; character recognition; integrated recognition; integrated segmentation; mixed Chinese/English characters; mixed Chinese/English document; multiphase segmentation; Automation; Character recognition; Engines; Feature extraction; Feedback; Intelligent systems; Laboratories; Natural languages; Optical character recognition software; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location
Parana
ISSN
1520-5363
Print_ISBN
978-0-7695-2822-9
Type
conf
DOI
10.1109/ICDAR.2007.4377006
Filename
4377006
Link To Document