• DocumentCode
    3707357
  • Title

    A novel binarization approach for text in images

  • Author

    Ping Hu;Weiqiang Wang;Ke Lu

  • Author_Institution
    School of Computer and Control Engineering, University of Chinese Academy of Sciences, Beijing, China
  • fYear
    2015
  • Firstpage
    956
  • Lastpage
    960
  • Abstract
    Accurate recognition of scene text and overlaid text is still a challenging issue due to degradation and complex background, and text binarization is crucial for recognition accuracy. This paper presents an effective method to extract characters in images and video frames. Our method assumes that background pixels possess good spatial connectivity and high appearance similarity to boundary pixels in a cropped text string image. It first computes the confidence of pixels as text. Then the confidence map is exploited to partition text regions into characters. Further, each character region is clustered into different layers and background components are removed to generate candidate binarization results. The final result is obtained based on the scores of each layer. Our method is validated by better recognition rates and segmentation accuracy on the ICADR03 dataset and a big dataset of overlaid text.
  • Keywords
    "Image recognition","Text recognition","Optical character recognition software","Image color analysis","Image segmentation","Character recognition","Degradation"
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2015 IEEE International Conference on
  • Type

    conf

  • DOI
    10.1109/ICIP.2015.7350941
  • Filename
    7350941