• DocumentCode
    2145755
  • Title

    An Improved Scene Text Extraction Method Using Conditional Random Field and Optical Character Recognition

  • Author

    Zhang, Hongwei ; Liu, Changsong ; Yang, Cheng ; Ding, Xiaoqing ; Wang, Kongqiao

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    708
  • Lastpage
    712
  • Abstract
    Over the past few years, research on scene text extraction has developed rapidly. Recently, condition random field (CRF) has been used to give connected components (CCs) ´text´ or ´non-text´ labels. However, a burning issue in CRF model comes from multiple text lines extraction. In this paper, we propose a two-step iterative CRF algorithm with a Belief Propagation inference and an OCR filtering stage. Two kinds of neighborhood relationship graph are used in the respective iterations for extracting multiple text lines. Furthermore, OCR confidence is used as an indicator for identifying the text regions, while a traditional OCR filter module only considered the recognition results. The first CRF iteration aims at finding certain text CCs, especially in multiple text lines, and sending uncertain CCs to the second iteration. The second iteration gives second chance for the uncertain CCs and filter false alarm CCs with the help of OCR. Experiments based on the public dataset of ICDAR 2005 prove that the proposed method is comparative with the existing algorithms.
  • Keywords
    belief maintenance; filtering theory; inference mechanisms; iterative methods; optical character recognition; random processes; text analysis; CRF model; OCR filtering stage; belief propagation inference; conditional random field; connected components; neighborhood relationship graph; nontext labels; optical character recognition; scene text extraction method; text line extraction; two-step iterative CRF algorithm; Belief propagation; Feature extraction; Filtering; Image color analysis; Optical character recognition software; Text recognition; BP; CRF; OCR; Scene text extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.148
  • Filename
    6065403