• DocumentCode
    2531258
  • Title

    Two Texture Segmentation of Document Image Using Wavelet Packet Analysis

  • Author

    Lee, Geum-boon ; Odoyo, Wilfred O. ; Lee, Jae-Hoon ; Chung, Il-Yong ; Cho, Beom-joon

  • Author_Institution
    Dept. of Comput. Eng., Chosun Univ., Gwangju
  • Volume
    1
  • fYear
    2007
  • fDate
    12-14 Feb. 2007
  • Firstpage
    395
  • Lastpage
    398
  • Abstract
    In this paper, we present a text segmentation method using wavelet packet analysis and k-means clustering algorithm. This approach assumes that the text and non-text regions are considered as two different texture regions. The text segmentation is achieved by using wavelet packet analysis as a feature analysis. The wavelet packet analysis is a method of wavelet decomposition that offers a richer range of possibilities for document image. From these multiscale features, we compute the local energy and intensify the features before adapting the k-means clustering algorithm based on the unsupervised learning rule. The results show that our text segmentation method is effective for document images scanned from newspapers and journals.
  • Keywords
    document image processing; image segmentation; image texture; wavelet transforms; feature analysis; k-means clustering algorithm; text segmentation method; two texture document image segmentation; unsupervised learning rule; wavelet decomposition; wavelet packet analysis; Algorithm design and analysis; Clustering algorithms; Image analysis; Image segmentation; Image texture analysis; Signal analysis; Text analysis; Unsupervised learning; Wavelet analysis; Wavelet packets; document image segmentation; energy estimation; k-means clustering algorithm; wavelet packet analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Communication Technology, The 9th International Conference on
  • Conference_Location
    Gangwon-Do
  • ISSN
    1738-9445
  • Print_ISBN
    978-89-5519-131-8
  • Type

    conf

  • DOI
    10.1109/ICACT.2007.358379
  • Filename
    4195158