Title :
A density-based approach for text extraction in images
Author :
Liu, Fang ; Peng, Xiang ; Wang, Tianjiang ; Lu, Songfeng
Author_Institution :
Sch. of Comput. Sci. & Technol., Huazhong Univ. of Sci. & Technol., Wuhan
Abstract :
In this paper we describe a new approach to distinguish and extract text from images with various objects and complex backgrounds. The goal of our approach is to present characters in images with clear background and without other objects. The proposed approach mainly includes two steps. Firstly, a density-based clustering method is employed to segment candidate characters by integrating spatial connectivity and color feature of characterspsila pixels. In most images, colors of pixels in one character are commonly non-uniform due to the noise. So a new histogram segmentation method is proposed in this step to obtain the color thresholds of characters. Secondly, priori knowledge and texture-based method are performed on the candidate characters to filter the non-characters. Experimental results show that the proposed approach has a good performance in character extraction rate.
Keywords :
feature extraction; image colour analysis; image retrieval; image segmentation; pattern clustering; text analysis; candidate character segmentation; character extraction rate; color feature; color thresholds; density-based clustering method; histogram segmentation method; text extraction; texture-based method; Clustering algorithms; Clustering methods; Colored noise; Computer science; Gray-scale; Histograms; Image color analysis; Image segmentation; Optical character recognition software; Pixel;
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
DOI :
10.1109/ICPR.2008.4761637