DocumentCode :
2149502
Title :
Recognition of Multi-oriented, Multi-sized, and Curved Text
Author :
Chiang, Yao-Yi ; Knoblock, Craig A.
Author_Institution :
Inf. Sci. Inst., Univ. of Southern California, Marina del Rey, CA, USA
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
1399
Lastpage :
1403
Abstract :
Text recognition is difficult from documents that contain multi-oriented, curved text lines of various character sizes. This is because layout analysis techniques, which most optical character recognition (OCR) approaches rely on, do not work well on unstructured documents with non-homogeneous text. Previous work on recognizing non-homogeneous text typically handles specific cases, such as horizontal and/or straight text lines and single-sized characters. In this paper, we present a general text recognition technique to handle non-homogeneous text by exploiting dynamic character grouping criteria based on the character sizes and maximum desired string curvature. This technique can be easily integrated with classic OCR approaches to recognize non-homogeneous text. In our experiments, we compared our approach to a commercial OCR product using a variety of raster maps that contain multi-oriented, curved and straight text labels of multi-sized characters. Our evaluation showed that our approach produced accurate text recognition results and outperformed the commercial product at both the word and character level accuracy.
Keywords :
natural language processing; optical character recognition; text analysis; OCR; curved text; dynamic character; multioriented recognition; optical character recognition; string curvature; text recognition; text recognition technique; unstructured documents; Character recognition; Computers; Educational institutions; Image color analysis; Joining processes; Optical character recognition software; Text recognition; curved text; map processing; ocr; raster maps;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.281
Filename :
6065540
Link To Document :
بازگشت