DocumentCode :
2143988
Title :
Hypothesis Preservation Approach to Scene Text Recognition with Weighted Finite-State Transducer
Author :
Yamazoe, Takafumi ; Etoh, Minoru ; Yoshimura, Takeshi ; Tsujino, Kousuke
Author_Institution :
Service & Solution Dev. Dept., NTT DOCOMO, Japan
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
359
Lastpage :
363
Abstract :
This paper shows that the use of Weighted Finite-State Transducer (WFST) significantly eliminates large-scale ambiguity in scene text recognition, especially for Japanese Kanji characters. The proposed method consists of two WFSTs called WFST-OCR and WFST-Lexicon. WFST-OCR handles the multiple hypotheses caused by erroneous text location, character segmentation and character recognition processes. The following WFST-Lexicon and its convolution of WFST-OCR resolve the hypotheses. The WFSTs integrate the conventional OCR and post-processing processes into one process. The benefit from the proposed method is that all the ambiguities are held as WFST data, and solved in one integrated step, the system outputs texts that are statistically consistent with regard to segmentation possibilities and the given language model. An experimental system demonstrates practical performance in spite of the hypothesis complexity inherent in the ICDAR test set and Kanji character texts.
Keywords :
optical character recognition; text analysis; ICDAR test set; Japanese Kanji character text; WFST data; WFST-Lexicon; WFST-OCR; Weighted Finite-State Transducer; character recognition process; character segmentation; erroneous text location; hypothesis complexity; hypothesis preservation; scene text recognition; weighted finite-state transducer; Character recognition; Cleaning; Complexity theory; Image segmentation; Optical character recognition software; Text recognition; Transducers; Kanji character; WFST; character recognition; natural scene; scene text; text extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.80
Filename :
6065335
Link To Document :
بازگشت