• DocumentCode
    3486745
  • Title

    Whole is Greater than Sum of Parts: Recognizing Scene Text Words

  • Author

    Goel, Vikas ; Mishra, Anadi ; Alahari, Karteek ; Jawahar, C.V.

  • Author_Institution
    Center for Visual Inf. Technol., IIIT, Hyderabad, India
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    398
  • Lastpage
    402
  • Abstract
    Recognizing text in images taken in the wild is a challenging problem that has received great attention in recent years. Previous methods addressed this problem by first detecting individual characters, and then forming them into words. Such approaches often suffer from weak character detections, due to large intra-class variations, even more so than characters from scanned documents. We take a different view of the problem and present a holistic word recognition framework. In this, we first represent the scene text image and synthetic images generated from lexicon words using gradient-based features. We then recognize the text in the image by matching the scene and synthetic image features with our novel weighted Dynamic Time Warping (wDTW) approach. We perform experimental analysis on challenging public datasets, such as Street View Text and ICDAR 2003. Our proposed method significantly outperforms our earlier work in Mishra et al. (CVPR 2012), as well as many other recent works, such as Novikova et al. (ECCV 2012), Wang et al. al.(ICPR 2012), Wang et al.(ICCV 2011).
  • Keywords
    character recognition; document image processing; gradient methods; text detection; character detections; gradient-based features; holistic word recognition framework; large intra-class variations; lexicon words; novel weighted dynamic time warping approach; scanned documents; scene text image; scene text word recognition; synthetic images; wDTW approach; Accuracy; Feature extraction; Histograms; Image recognition; Robustness; Strips; Text recognition; DTW; Holistic matching; Scene Text Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.87
  • Filename
    6628652