Title :
Text extraction algorithm under background image using wavelet transforms
Author :
Zhang, Xiao-wei ; Zheng, Xiong-bo ; Weng, Zhi-juan
Author_Institution :
Sch. of Sci., Harbin Eng. Univ., Harbin
Abstract :
With the growing number of digital multimedia libraries, the need to efficiently index multimedia information is increasing, detecting and extracting the text information from images plays an important part in images indexing based on content. In the paper, a new text extraction algorithm under background image based on two-dimensional wavelet transforms is proposed. For the algorithm, firstly the image is transformed into the wavelet domain and then a sliding window is set to scan high frequency sub-bands, through computing the wavelet texture features of the image in the sliding window, k-means clustering algorithm is used to classify the image into text area, simple background area and complex background area. Finally mathematical morphology operations are applied on the text area to locate the text positions exactly. The experimental result shows that the algorithm can extract text information with different languages, fonts, sizes and ways of arrangement from the background image exactly.
Keywords :
content-based retrieval; feature extraction; image texture; mathematical morphology; multimedia computing; pattern clustering; wavelet transforms; background image; digital multimedia libraries; k-means clustering algorithm; mathematical morphology; sliding window; text extraction algorithm; wavelet texture features; wavelet transforms; Algorithm design and analysis; Clustering algorithms; Data mining; Frequency; Image analysis; Indexing; Information analysis; Wavelet analysis; Wavelet coefficients; Wavelet transforms; Wavelet transform; k -means clustering algorithm; mathematical morphology; text extraction; texture feature;
Conference_Titel :
Wavelet Analysis and Pattern Recognition, 2008. ICWAPR '08. International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-2238-8
Electronic_ISBN :
978-1-4244-2239-5
DOI :
10.1109/ICWAPR.2008.4635776