Title :
Localizing Text in Scene Images by Boundary Clustering, Stroke Segmentation, and String Fragment Classification
Author :
Yi, Chucai ; Tian, YingLi
Author_Institution :
Graduate Center, City University of New York, New York, NY, USA
Abstract :
In this paper, we propose a novel framework to extract text regions from scene images with complex backgrounds and multiple text appearances. This framework consists of three main steps: boundary clustering (BC), stroke segmentation, and string fragment classification. In BC, we propose a new bigram-color-uniformity-based method to model both text and attachment surface, and cluster edge pixels based on color pairs and spatial positions into boundary layers. Then, stroke segmentation is performed at each boundary layer by color assignment to extract character candidates. We propose two algorithms to combine the structural analysis of text stroke with color assignment and filter out background interferences. Further, we design a robust string fragment classification based on Gabor-based text features. The features are obtained from feature maps of gradient, stroke distribution, and stroke width. The proposed framework of text localization is evaluated on scene images, born-digital images, broadcast video images, and images of handheld objects captured by blind persons. Experimental results on respective datasets demonstrate that the framework outperforms state-of-the-art localization algorithms.
Keywords :
Algorithm design and analysis; Classification algorithms; Clustering algorithms; Feature extraction; Gabor filters; Image color analysis; Image edge detection; Bigram color uniformity; Gabor-based text features; boundary clustering (BC); color assignment; string fragment classification; stroke segmentation; text localization; Algorithms; Artificial Intelligence; Cluster Analysis; Humans; Image Processing, Computer-Assisted; Pattern Recognition, Automated; Self-Help Devices; Video Recording; Visually Impaired Persons; Writing;
Journal_Title :
Image Processing, IEEE Transactions on
DOI :
10.1109/TIP.2012.2199327