• DocumentCode
    594815
  • Title

    Effective text localization in natural scene images with MSER, geometry-based grouping and AdaBoost

  • Author

    Xuwang Yin ; Xu-Cheng Yin ; Hong-Wei Hao ; Iqbal, Kamran

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Sci. & Technol. Beijing, Beijing, China
  • fYear
    2012
  • fDate
    11-15 Nov. 2012
  • Firstpage
    725
  • Lastpage
    728
  • Abstract
    Text localization in natural scene images is an important prerequisite for many content-based image analysis tasks. In this paper, we proposed a novel and effective approach to accurately localize scene texts. Firstly, Maximally stable extremal regions(MSER) are extracted as letter candidates. Secondly, after elimination of non-letter candidates by using geometric information, candidate regions are constructed by grouping similar letter candidates using disjoint set. Candidate region features based on horizontal and vertical variances, stroke width, color and geometry are extracted. An AdaBoost classifier is built from these features and text regions are identified. The overall system is evaluated on the ICDAR 2011 competition dataset and the experimental results show that the proposed algorithm yields high precision and recall compared with the latest published algorithms.
  • Keywords
    geometry; image colour analysis; learning (artificial intelligence); text analysis; AdaBoost classifier; ICDAR 2011 competition dataset; MSER; candidate region features; content-based image analysis tasks; disjoint set; geometry-based grouping; horizontal variances; image color; letter candidates; maximally stable extremal regions; natural scene images; stroke width; text localization; vertical variances; Data mining; Educational institutions; Feature extraction; Geometry; Image color analysis; Image recognition; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2012 21st International Conference on
  • Conference_Location
    Tsukuba
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4673-2216-4
  • Type

    conf

  • Filename
    6460237