• DocumentCode
    3487419
  • Title

    Scene Text Recognition with a Hough Forest Implicit Shape Model

  • Author

    Jae-Hyun Seok ; Jin Hyung Kim

  • Author_Institution
    Dept. of Comput. Sci., KAIST, Daejeon, South Korea
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    599
  • Lastpage
    603
  • Abstract
    Scene text recognition is the problem of recognizing text in natural scene images taken in unconstrained manner. Many approaches have been proposed, but most of them utilize character models only in character recognition phase, the last stage of the process. If characteristic of the target character shape is utilized earlier for the text detection and extraction, text localization would be much easier and therefore, text recognition would be more robust. In this paper, we propose a novel scene text recognition approach which fully utilizes models of target characters from the beginning to the end of the recognition process. Each of target character set is modeled with part-based object model called Implicit Shape Model (ISM) to achieve robustness for partial degradation of text objects. Towards this end, we trained a Hough forest which localizes and aggregates character parts to detect characters candidates from an image. The detected character candidates are verified by organizing the most plausible text lines using dynamic programming. As concrete character models are utilized throughout the process, even extremely deformed texts are detected and recognized, which are hardly detected with previous approaches.
  • Keywords
    dynamic programming; feature extraction; image recognition; object detection; Hough forest implicit shape model; character recognition phase; dynamic programming; natural scene image; part-based object model; scene text recognition; text detection; text extraction; text localization; text object partial degradation; Character recognition; Image recognition; Probabilistic logic; Robustness; Shape; Text recognition; Transforms; Hough forests; Implicit shape models; Part-based character models; Scene text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.124
  • Filename
    6628689