• DocumentCode
    3016902
  • Title

    Text detection and recognition in urban scenes

  • Author

    Minetto, R. ; Thome, N. ; Cord, M. ; Stolfi, J. ; Précioso, F. ; Guyomard, J. ; Leite, N.J.

  • Author_Institution
    LIP6, UPMC Univ. Paris 6, Paris, France
  • fYear
    2011
  • fDate
    6-13 Nov. 2011
  • Firstpage
    227
  • Lastpage
    234
  • Abstract
    Text detection and recognition in real images taken in unconstrained environments, such as street view images, remain surprisingly challenging in Computer Vision. In this paper, we present a comprehensive strategy combining bottom-up and top-down mechanisms to detect Text boxes. The bottom-up part is based on character segmentation and grouping . The top-down part is achieved with a statistical learning approach based on box descriptors. Our main contribution consists in introducing a new descriptor, Fuzzy HOG (F-HOG), fully adapted for text box analysis. A thorough experimental validation proves the efficiency of the whole system outperforming state of the art results on the standard ICDAR text detection benchmark. Another contribution concerns the exploitation of our text extraction in a complete search engine scheme. We propose to retrieve a location from a textual query: combining our text box detection technology with OCR on georeferenced street images, we achieved a GIS system with a fully automatic textual indexing. We demonstrate the relevance of our system on the real urban database of [10].
  • Keywords
    feature extraction; fuzzy set theory; geographic information systems; image retrieval; image segmentation; indexing; learning (artificial intelligence); natural scenes; optical character recognition; query processing; search engines; statistical analysis; text detection; traffic engineering computing; two-dimensional digital filters; F-HOG; GIS system; OCR; automatic textual indexing; bottom-up mechanism; box descriptors; character grouping; character segmentation; computer vision; fuzzy HOG; georeferenced street images; location retrieval; real urban database; search engine scheme; standard ICDAR text detection benchmark; statistical learning approach; text box analysis; text box detection; text extraction; text recognition; textual query; top-down mechanism; urban scenes; Context; Feature extraction; Histograms; Image recognition; Image segmentation; Optical character recognition software; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on
  • Conference_Location
    Barcelona
  • Print_ISBN
    978-1-4673-0062-9
  • Type

    conf

  • DOI
    10.1109/ICCVW.2011.6130247
  • Filename
    6130247