• DocumentCode
    2293274
  • Title

    Learning semantic multimedia representations from a small set of examples

  • Author

    Naphade, Milind R. ; Lin, Ching-Yung ; Smith, John R.

  • Author_Institution
    Pervasive Media Manage. Group, IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
  • Volume
    2
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    513
  • Abstract
    We approach the problem of semantic multimedia retrieval as a supervised learning problem. Defining a lexicon of a small number of interesting semantic concepts we can handle a number of semantic queries. Since the number of interesting concepts available for training is usually small we explore discriminant learning techniques. In particular, we examine the use of kernel based methods and demonstrate impressive retrieval performance using semantic concepts like rocket, outdoor, greenery, sky and face. We also show that loosely coupled multimodal events can be detected based on the late fusion of detection of related auditory and visual concepts. Using a Bayesian network for inference we show how a rocket-launch event can be detected based on the detection of a related visual concept (rocket object) and a related auditory concept (explosion/blast-off).
  • Keywords
    belief networks; inference mechanisms; information retrieval; learning by example; multimedia databases; query processing; Bayesian inference network; auditory concepts; discriminant learning techniques; explosion/blast-off; interesting semantic concepts; kernel based methods; learning from examples; lexicon; loosely coupled multimodal events; retrieval performance; rocket-launch event; semantic multimedia representations; semantic multimedia retrieval; semantic queries; supervised learning; visual concepts; Bayesian methods; Event detection; Face detection; Kernel; Layout; Object detection; Rockets; Search engines; Spatial databases; Supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2002. ICME '02. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7803-7304-9
  • Type

    conf

  • DOI
    10.1109/ICME.2002.1035661
  • Filename
    1035661