Title :
Text detection in images based on unsupervised classification of high-frequency wavelet coefficients
Author :
Gllavata, Julinda ; Ewerth, Ralph ; Freisleben, Bernd
Author_Institution :
SFB, Siegen Univ., Germany
Abstract :
Text localization and recognition in images is important for searching information in digital photo archives, video databases and Web sites. However, since text is often printed against a complex background, it is often difficult to detect. In this paper, a robust text localization approach is presented, which can automatically detect horizontally aligned text with different sizes, fonts, colors and languages. First, a wavelet transform is applied to the image and the distribution of high-frequency wavelet coefficients is considered to statistically characterize text and non-text areas. Then, the k-means algorithm is used to classify text areas in the image. The detected text areas undergo a projection analysis in order to refine their localization. Finally, a binary segmented text image is generated, to be used as input to an OCR engine. The detection performance of our approach is demonstrated by presenting experimental results for a set of video frames taken from the MPEG-7 video test set.
Keywords :
content-based retrieval; image classification; image retrieval; image segmentation; optical character recognition; text analysis; unsupervised learning; wavelet transforms; MPEG-7 video test set; OCR engine; Web sites; binary segmented text image; digital photo archives; high frequency wavelet coefficients; information searching; k-means algorithm; nontext area classification; projection analysis; robust text localization; statistical characterization; text area classification; text image detection; text image recognition; unsupervised classification; video databases; wavelet transform; Engines; Image databases; Image generation; Image recognition; Image segmentation; Optical character recognition software; Robustness; Text recognition; Wavelet coefficients; Wavelet transforms;
Conference_Titel :
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
Print_ISBN :
0-7695-2128-2
DOI :
10.1109/ICPR.2004.1334146