• DocumentCode
    477137
  • Title

    Text extraction algorithm under background image using wavelet transforms

  • Author

    Zhang, Xiao-wei ; Zheng, Xiong-bo ; Weng, Zhi-juan

  • Author_Institution
    Sch. of Sci., Harbin Eng. Univ., Harbin
  • Volume
    1
  • fYear
    2008
  • fDate
    30-31 Aug. 2008
  • Firstpage
    200
  • Lastpage
    204
  • Abstract
    With the growing number of digital multimedia libraries, the need to efficiently index multimedia information is increasing, detecting and extracting the text information from images plays an important part in images indexing based on content. In the paper, a new text extraction algorithm under background image based on two-dimensional wavelet transforms is proposed. For the algorithm, firstly the image is transformed into the wavelet domain and then a sliding window is set to scan high frequency sub-bands, through computing the wavelet texture features of the image in the sliding window, k-means clustering algorithm is used to classify the image into text area, simple background area and complex background area. Finally mathematical morphology operations are applied on the text area to locate the text positions exactly. The experimental result shows that the algorithm can extract text information with different languages, fonts, sizes and ways of arrangement from the background image exactly.
  • Keywords
    content-based retrieval; feature extraction; image texture; mathematical morphology; multimedia computing; pattern clustering; wavelet transforms; background image; digital multimedia libraries; k-means clustering algorithm; mathematical morphology; sliding window; text extraction algorithm; wavelet texture features; wavelet transforms; Algorithm design and analysis; Clustering algorithms; Data mining; Frequency; Image analysis; Indexing; Information analysis; Wavelet analysis; Wavelet coefficients; Wavelet transforms; Wavelet transform; k -means clustering algorithm; mathematical morphology; text extraction; texture feature;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wavelet Analysis and Pattern Recognition, 2008. ICWAPR '08. International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-2238-8
  • Electronic_ISBN
    978-1-4244-2239-5
  • Type

    conf

  • DOI
    10.1109/ICWAPR.2008.4635776
  • Filename
    4635776