• DocumentCode
    2395577
  • Title

    Scene classification with low-dimensional semantic spaces and weak supervision

  • Author

    Rasiwasia, Nikhil ; Vasconcelos, Nuno

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of California, San Diego, La Jolla, CA
  • fYear
    2008
  • fDate
    23-28 June 2008
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    A novel approach to scene categorization is proposed. Similar to previous works of [11, 15, 3, 12], we introduce an intermediate space, based on a low dimensional semantic ldquothemerdquo image representation. However, instead of learning the themes in an unsupervised manner, they are learned with weak supervision, from casual image annotations. Each theme induces a probability density on the space of low-level features, and images are represented as vectors of posterior theme probabilities. This enables an image to be associated with multiple themes, even when there are no multiple associations in the training labels. An implementation is presented and compared to various existing algorithms, on benchmark datasets. It is shown that the proposed low dimensional representation correlates well with human scene understanding, and is able to learn theme co-occurrences without explicit training. It is also shown to outperform unsupervised latent-space methods, with much smaller training complexity, and to achieve performance close to the state of the art methods, which rely on much higher-dimensional image representations. Finally a study of the effect of dimensionality on the classification performance is presented, indicating that the dimensionality of theme space grows sub-linearly with the number of scene categories.
  • Keywords
    image classification; image representation; semantic networks; unsupervised learning; benchmark datasets; casual image annotations; low dimensional representation; low-dimensional semantic spaces; posterior theme probabilities; probability density; scene classification; theme image representation; unsupervised latent-space methods; weak supervision; Cities and towns; Computer vision; Frequency; Humans; Image representation; Large-scale systems; Layout; Object detection; Robustness; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-2242-5
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2008.4587372
  • Filename
    4587372