• DocumentCode
    2301360
  • Title

    An Efficient Method for Text Location and Segmentation

  • Author

    Wen, Weijuan ; Huang, Xianglin ; Yang, Lifang ; Yang, Zhao ; Zhang, Pengju

  • Author_Institution
    Comput. Sch., Commun. Univ. of China, Beijing, China
  • Volume
    3
  • fYear
    2009
  • fDate
    19-21 May 2009
  • Firstpage
    3
  • Lastpage
    7
  • Abstract
    Recently, with the development of the information technology, the number of images is increasing rapidly. There is much important text in these images. In order to satisfy the need of images viewing, text identification and text retrieval, this paper presents an efficient Method for text location and segmentation. Firstly, the image is converted into wavelet transform domain. Then three sub-band wavelet images of HxLy, LxHy, HxHy are binarizated and merged into a texture image by linear combination. The texture image is processed by CRLA(constrained run length algorithm) and image smoothing to enhance the candidate text regions. Finally, the text regions are located after 8-connected component growth and filtering out non-text regions. In order to be recognized by OCR(Optical Character Recognition), characters are segmented from text blocks based on the mean values of some slices of text-block. Experimental results show that the proposed method is robust to overlapped complex background.
  • Keywords
    feature extraction; filtering theory; image enhancement; image retrieval; image segmentation; image texture; optical character recognition; text analysis; wavelet transforms; CRLA; OCR; candidate text region; constrained run length algorithm; edge texture image extraction; image enhancement; image smoothing; image text location; nontext region filtering; optical character recognition; sub-band wavelet image; text identification; text retrieval; text segmentation; text-block slicing; wavelet transform domain; Character recognition; Filtering; Image converters; Image retrieval; Image segmentation; Information technology; Smoothing methods; Text recognition; Wavelet domain; Wavelet transforms; text location; text segmentation; wavelet transform;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, 2009. WCSE '09. WRI World Congress on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-0-7695-3570-8
  • Type

    conf

  • DOI
    10.1109/WCSE.2009.292
  • Filename
    5319367