DocumentCode
3423510
Title
Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors
Author
Weilin Huang ; Zhe Lin ; Jianchao Yang ; Jue Wang
Author_Institution
Shenzhen Key Lab. of Comp. Vis & Pat. Rec., Shenzhen Inst. of Adv. Technol., Shenzhen, China
fYear
2013
fDate
1-8 Dec. 2013
Firstpage
1241
Lastpage
1248
Abstract
In this paper, we present a new approach for text localization in natural images, by discriminating text and non-text regions at three levels: pixel, component and text line levels. Firstly, a powerful low-level filter called the Stroke Feature Transform (SFT) is proposed, which extends the widely-used Stroke Width Transform (SWT) by incorporating color cues of text pixels, leading to significantly enhanced performance on inter-component separation and intra-component connection. Secondly, based on the output of SFT, we apply two classifiers, a text component classifier and a text-line classifier, sequentially to extract text regions, eliminating the heuristic procedures that are commonly used in previous approaches. The two classifiers are built upon two novel Text Covariance Descriptors (TCDs) that encode both the heuristic properties and the statistical characteristics of text stokes. Finally, text regions are located by simply thresholding the text-line confident map. Our method was evaluated on two benchmark datasets: ICDAR 2005 and ICDAR 2011, and the corresponding F-measure values are 0.72 and 0.73, respectively, surpassing previous methods in accuracy by a large margin.
Keywords
feature extraction; image classification; image colour analysis; text detection; ICDAR 2005; ICDAR 2011; SFT; SWT; TCD; intercomponent separation; intracomponent connection; natural images; stroke feature transform; stroke width transform; text component classifier; text covariance descriptors; text localization; text pixel color cues; text-line classifier; text-line confident map; Color; Covariance matrices; Feature extraction; Image color analysis; Image edge detection; Transforms; Vectors; Low-level filter; stroke width transform; text component; text covariance descriptors;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location
Sydney, NSW
ISSN
1550-5499
Type
conf
DOI
10.1109/ICCV.2013.157
Filename
6751264
Link To Document