• DocumentCode
    49684
  • Title

    Scene text recognition by learning co-occurrence of strokes based on spatiality embedded dictionary

  • Author

    Song Gao ; Chunheng Wang ; Baihua Xiao ; Cunzhao Shi ; Wen Zhou ; Zhong Zhang

  • Author_Institution
    State Key Lab. of Manage. & Control for Complex Syst., Inst. of Autom., Beijing, China
  • Volume
    9
  • Issue
    1
  • fYear
    2015
  • fDate
    2 2015
  • Firstpage
    138
  • Lastpage
    148
  • Abstract
    Text information contained in scene images is very helpful for high-level image understanding. In this study, the authors propose to learn co-occurrence of local strokes for scene text recognition by using a spatiality embedded dictionary (SED). Unlike spatial pyramid partitioning images into grids to incorporate spatial information, the authors SED associates every codeword with a particular response region and introduces more precise spatial information for robust character recognition. After localised soft coding and max pooling of the first layer, a sparse dictionary is learned to model co-occurrence of several local strokes, which further improves classification performance. Experimental results on two scene character recognition datasets ICDAR2003 and CHARS74 K demonstrate that their character recognition method outperforms state-of-the-art methods. Besides, competitive word recognition results are also reported for four benchmark word recognition datasets ICDAR2003, ICDAR2011, ICDAR2013 and street view text when combining their character recognition method with a conditional random field language model.
  • Keywords
    character recognition; dictionaries; text detection; CHARS74 K dataset; ICDAR2003 dataset; SED; high-level image understanding; local strokes; localised soft coding; max pooling; robust character recognition; scene images; scene text recognition; sparse dictionary; spatiality embedded dictionary; text information;
  • fLanguage
    English
  • Journal_Title
    Computer Vision, IET
  • Publisher
    iet
  • ISSN
    1751-9632
  • Type

    jour

  • DOI
    10.1049/iet-cvi.2014.0022
  • Filename
    7029820