• DocumentCode
    794724
  • Title

    Extracting semantics from audio-visual content: the final frontier in multimedia retrieval

  • Author

    Naphade, Milind R. ; Huang, Thomas S.

  • Author_Institution
    IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
  • Volume
    13
  • Issue
    4
  • fYear
    2002
  • fDate
    7/1/2002 12:00:00 AM
  • Firstpage
    793
  • Lastpage
    810
  • Abstract
    Multimedia understanding is a fast emerging interdisciplinary research area. There is tremendous potential for effective use of multimedia content through intelligent analysis. Diverse application areas are increasingly relying on multimedia understanding systems. Advances in multimedia understanding are related directly to advances in signal processing, computer vision, pattern recognition, multimedia databases, and smart sensors. We review the state-of-the-art techniques in multimedia retrieval. In particular, we discuss how multimedia retrieval can be viewed as a pattern recognition problem. We discuss how reliance on powerful pattern recognition and machine learning techniques is increasing in the field of multimedia retrieval. We review the state-of-the-art multimedia understanding systems with particular emphasis on a system for semantic video indexing centered around multijects and multinets. We discuss how semantic retrieval is centered around concepts and context and the various mechanisms for modeling concepts and context.
  • Keywords
    database indexing; image retrieval; information retrieval; knowledge representation; learning systems; multimedia databases; pattern recognition; Bayesian networks; decision theory; factor graphs; knowledge representation; machine learning; multimedia databases; multimedia retrieval; multimedia understanding systems; semantic video indexing; statistical pattern recognition; sum-product algorithm; Application software; Computer vision; Content based retrieval; Indexing; Intelligent sensors; Machine learning; Multimedia databases; Multimedia systems; Pattern recognition; Signal processing;
  • fLanguage
    English
  • Journal_Title
    Neural Networks, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1045-9227
  • Type

    jour

  • DOI
    10.1109/TNN.2002.1021881
  • Filename
    1021881