DocumentCode
3433366
Title
Text detection in images based on unsupervised classification of high-frequency wavelet coefficients
Author
Gllavata, Julinda ; Ewerth, Ralph ; Freisleben, Bernd
Author_Institution
SFB, Siegen Univ., Germany
Volume
1
fYear
2004
fDate
23-26 Aug. 2004
Firstpage
425
Abstract
Text localization and recognition in images is important for searching information in digital photo archives, video databases and Web sites. However, since text is often printed against a complex background, it is often difficult to detect. In this paper, a robust text localization approach is presented, which can automatically detect horizontally aligned text with different sizes, fonts, colors and languages. First, a wavelet transform is applied to the image and the distribution of high-frequency wavelet coefficients is considered to statistically characterize text and non-text areas. Then, the k-means algorithm is used to classify text areas in the image. The detected text areas undergo a projection analysis in order to refine their localization. Finally, a binary segmented text image is generated, to be used as input to an OCR engine. The detection performance of our approach is demonstrated by presenting experimental results for a set of video frames taken from the MPEG-7 video test set.
Keywords
content-based retrieval; image classification; image retrieval; image segmentation; optical character recognition; text analysis; unsupervised learning; wavelet transforms; MPEG-7 video test set; OCR engine; Web sites; binary segmented text image; digital photo archives; high frequency wavelet coefficients; information searching; k-means algorithm; nontext area classification; projection analysis; robust text localization; statistical characterization; text area classification; text image detection; text image recognition; unsupervised classification; video databases; wavelet transform; Engines; Image databases; Image generation; Image recognition; Image segmentation; Optical character recognition software; Robustness; Text recognition; Wavelet coefficients; Wavelet transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
ISSN
1051-4651
Print_ISBN
0-7695-2128-2
Type
conf
DOI
10.1109/ICPR.2004.1334146
Filename
1334146
Link To Document