DocumentCode
1310877
Title
A Hybrid Approach to Detect and Localize Texts in Natural Scene Images
Author
Pan, Yi-Feng ; Hou, Xinwen ; Liu, Cheng-Lin
Author_Institution
Nat. Lab. of Pattern Recognition (NLPR), Chinese Acad. of Sci. (CASIA), Beijing, China
Volume
20
Issue
3
fYear
2011
fDate
3/1/2011 12:00:00 AM
Firstpage
800
Lastpage
813
Abstract
Text detection and localization in natural scene images is important for content-based image analysis. This problem is challenging due to the complex background, the non-uniform illumination, the variations of text font, size and line orientation. In this paper, we present a hybrid approach to robustly detect and localize texts in natural scene images. A text region detector is designed to estimate the text existing confidence and scale information in image pyramid, which help segment candidate text components by local binarization. To efficiently filter out the non-text components, a conditional random field (CRF) model considering unary component properties and binary contextual component relationships with supervised parameter learning is proposed. Finally, text components are grouped into text lines/words with a learning-based energy minimization method. Since all the three stages are learning-based, there are very few parameters requiring manual tuning. Experimental results evaluated on the ICDAR 2005 competition dataset show that our approach yields higher precision and recall performance compared with state-of-the-art methods. We also evaluated our approach on a multilingual image dataset with promising results.
Keywords
character recognition; content-based retrieval; image processing; random processes; text analysis; visual databases; ICDAR 2005 competition dataset; conditional random field model; confidence information; connected component analysis; content-based image analysis; image pyramid; learning-based energy minimization method; line orientation; local binarization; multilingual image dataset; natural scene images; non-uniform illumination; scale information; text detection; text localisation; text region detector; Artificial neural networks; Detectors; Feature extraction; Image color analysis; Image edge detection; Image segmentation; Pixel; Conditional random field (CRF); connected component analysis (CCA); text detection; text localization;
fLanguage
English
Journal_Title
Image Processing, IEEE Transactions on
Publisher
ieee
ISSN
1057-7149
Type
jour
DOI
10.1109/TIP.2010.2070803
Filename
5560830
Link To Document