• DocumentCode
    3433366
  • Title

    Text detection in images based on unsupervised classification of high-frequency wavelet coefficients

  • Author

    Gllavata, Julinda ; Ewerth, Ralph ; Freisleben, Bernd

  • Author_Institution
    SFB, Siegen Univ., Germany
  • Volume
    1
  • fYear
    2004
  • fDate
    23-26 Aug. 2004
  • Firstpage
    425
  • Abstract
    Text localization and recognition in images is important for searching information in digital photo archives, video databases and Web sites. However, since text is often printed against a complex background, it is often difficult to detect. In this paper, a robust text localization approach is presented, which can automatically detect horizontally aligned text with different sizes, fonts, colors and languages. First, a wavelet transform is applied to the image and the distribution of high-frequency wavelet coefficients is considered to statistically characterize text and non-text areas. Then, the k-means algorithm is used to classify text areas in the image. The detected text areas undergo a projection analysis in order to refine their localization. Finally, a binary segmented text image is generated, to be used as input to an OCR engine. The detection performance of our approach is demonstrated by presenting experimental results for a set of video frames taken from the MPEG-7 video test set.
  • Keywords
    content-based retrieval; image classification; image retrieval; image segmentation; optical character recognition; text analysis; unsupervised learning; wavelet transforms; MPEG-7 video test set; OCR engine; Web sites; binary segmented text image; digital photo archives; high frequency wavelet coefficients; information searching; k-means algorithm; nontext area classification; projection analysis; robust text localization; statistical characterization; text area classification; text image detection; text image recognition; unsupervised classification; video databases; wavelet transform; Engines; Image databases; Image generation; Image recognition; Image segmentation; Optical character recognition software; Robustness; Text recognition; Wavelet coefficients; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-2128-2
  • Type

    conf

  • DOI
    10.1109/ICPR.2004.1334146
  • Filename
    1334146