DocumentCode :
2142949
Title :
Text Detection in Natural Scene Images by Stroke Gabor Words
Author :
Yi, Chucai ; Tian, YingLi
Author_Institution :
Dept. of Comput. Sci., City Univ. of New York, New York, NY, USA
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
177
Lastpage :
181
Abstract :
In this paper, we propose a novel algorithm, based on stroke components and descriptive Gabor filters, to detect text regions in natural scene images. Text characters and strings are constructed by stroke components as basic units. Gabor filters are used to describe and analyze the stroke components in text characters or strings. We define a suitability measurement to analyze the confidence of Gabor filters in describing stroke component and the suitability of Gabor filters on an image window. From the training set, we compute a set of Gabor filters that can describe principle stroke components of text by their parameters. Then a K-means algorithm is applied to cluster the descriptive Gabor filters. The clustering centers are defined as Stroke Gabor Words (SGWs) to provide a universal description of stroke components. By suitability evaluation on positive and negative training samples respectively, each SGW generates a pair of characteristic distributions of suitability measurements. On a testing natural scene image, heuristic layout analysis is applied first to extract candidate image windows. Then we compute the principle SGWs for each image window to describe its principle stroke components. Characteristic distributions generated by principle SGWs are used to classify text or non-text windows. Experimental results on benchmark datasets demonstrate that our algorithm can handle complex backgrounds and variant text patterns (font, color, scale, etc.).
Keywords :
Gabor filters; image processing; text analysis; K-means algorithm; descriptive Gabor filters; heuristic layout analysis; natural scene images; stroke Gabor words; stroke components; suitability measurement; text characters; text detection; Classification algorithms; Feature extraction; Gabor filters; Image color analysis; Robustness; Testing; Training; Gabor Filter; SGW Characteristic Distributions; Stroke Component; Stroke Gabor Words; Suitability Measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.44
Filename :
6065299
Link To Document :
بازگشت