DocumentCode :
1310877
Title :
A Hybrid Approach to Detect and Localize Texts in Natural Scene Images
Author :
Pan, Yi-Feng ; Hou, Xinwen ; Liu, Cheng-Lin
Author_Institution :
Nat. Lab. of Pattern Recognition (NLPR), Chinese Acad. of Sci. (CASIA), Beijing, China
Volume :
20
Issue :
3
fYear :
2011
fDate :
3/1/2011 12:00:00 AM
Firstpage :
800
Lastpage :
813
Abstract :
Text detection and localization in natural scene images is important for content-based image analysis. This problem is challenging due to the complex background, the non-uniform illumination, the variations of text font, size and line orientation. In this paper, we present a hybrid approach to robustly detect and localize texts in natural scene images. A text region detector is designed to estimate the text existing confidence and scale information in image pyramid, which help segment candidate text components by local binarization. To efficiently filter out the non-text components, a conditional random field (CRF) model considering unary component properties and binary contextual component relationships with supervised parameter learning is proposed. Finally, text components are grouped into text lines/words with a learning-based energy minimization method. Since all the three stages are learning-based, there are very few parameters requiring manual tuning. Experimental results evaluated on the ICDAR 2005 competition dataset show that our approach yields higher precision and recall performance compared with state-of-the-art methods. We also evaluated our approach on a multilingual image dataset with promising results.
Keywords :
character recognition; content-based retrieval; image processing; random processes; text analysis; visual databases; ICDAR 2005 competition dataset; conditional random field model; confidence information; connected component analysis; content-based image analysis; image pyramid; learning-based energy minimization method; line orientation; local binarization; multilingual image dataset; natural scene images; non-uniform illumination; scale information; text detection; text localisation; text region detector; Artificial neural networks; Detectors; Feature extraction; Image color analysis; Image edge detection; Image segmentation; Pixel; Conditional random field (CRF); connected component analysis (CCA); text detection; text localization;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2010.2070803
Filename :
5560830
Link To Document :
بازگشت