• DocumentCode
    53899
  • Title

    Learning and Recognition of On-Premise Signs From Weakly Labeled Street View Images

  • Author

    Tsung-Hung Tsai ; Wen-Huang Cheng ; Chuang-Wen You ; Min-Chun Hu ; Tsui, Arvin Wen ; Heng-Yu Chi

  • Author_Institution
    Res. Center for Inf. Technol. Innovation, Taipei, Taiwan
  • Volume
    23
  • Issue
    3
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    1047
  • Lastpage
    1059
  • Abstract
    Camera-enabled mobile devices are commonly used as interaction platforms for linking the user´s virtual and physical worlds in numerous research and commercial applications, such as serving an augmented reality interface for mobile information retrieval. The various application scenarios give rise to a key technique of daily life visual object recognition. On-premise signs (OPSs), a popular form of commercial advertising, are widely used in our living life. The OPSs often exhibit great visual diversity (e.g., appearing in arbitrary size), accompanied with complex environmental conditions (e.g., foreground and background clutter). Observing that such real-world characteristics are lacking in most of the existing image data sets, in this paper, we first proposed an OPS data set, namely OPS-62, in which totally 4649 OPS images of 62 different businesses are collected from Google´s Street View. Further, for addressing the problem of real-world OPS learning and recognition, we developed a probabilistic framework based on the distributional clustering, in which we proposed to exploit the distributional information of each visual feature (the distribution of its associated OPS labels) as a reliable selection criterion for building discriminative OPS models. Experiments on the OPS-62 data set demonstrated the outperformance of our approach over the state-of-the-art probabilistic latent semantic analysis models for more accurate recognitions and less false alarms, with a significant 151.28% relative improvement in the average recognition rate. Meanwhile, our approach is simple, linear, and can be executed in a parallel fashion, making it practical and scalable for large-scale multimedia applications.
  • Keywords
    learning (artificial intelligence); mobile computing; object recognition; OPS data set; OPS-62; augmented reality interface; camera-enabled mobile devices; complex environmental conditions; daily life visual object recognition; distributional clustering; mobile information retrieval; on-premise signs; probabilistic framework; probabilistic latent semantic analysis models; real-world OPS learning; weakly labeled street view images; Business; Feature extraction; Image color analysis; Image recognition; Probabilistic logic; Training; Visualization; Real-world objects; learning and recognition; object image data set; street view scenes;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2014.2298982
  • Filename
    6705667