• DocumentCode
    2067118
  • Title

    A Method of Text Segmentation from Scanned Image with Complex Background

  • Author

    Huang, Xiang-Lin ; Yang, Li-Fang ; Yang, Zhao

  • Author_Institution
    Comput. Sch. Commun., Univ. of China, Beijing, China
  • fYear
    2009
  • fDate
    20-22 Sept. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    With the development of information technology, the number of scanned images is increasing rapidly. There are many important texts in these images. In order to satisfy the need of images viewing, text identification and text retrieval, this paper presents an efficient method for text segmentation. Firstly, localizes the text blocks in scanned image. Secondly, according to its gray/color distribution, the text block image is decomposed into text sub-layer, background sub-layers and mixed sub-layers which contain both texts and backgrounds. Finally, the backgrounds are filtered out from these sub-layers, and the combination of texts in all remained sub-layers is the text segmentation result. Experimental results show that the proposed method is robust to overlapped complex background.
  • Keywords
    image colour analysis; image denoising; image retrieval; image segmentation; optical character recognition; text analysis; OCR; background sublayer; color distribution; information technology; noise filtering; overlapped complex background; scanned image; text identification; text retrieval; text segmentation method; Background noise; Filtering; Image edge detection; Image retrieval; Image segmentation; Information technology; Office automation; Robustness; Text recognition; Wavelet transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Management and Service Science, 2009. MASS '09. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4638-4
  • Electronic_ISBN
    978-1-4244-4639-1
  • Type

    conf

  • DOI
    10.1109/ICMSS.2009.5300826
  • Filename
    5300826