Title :
A probabilistic framework for semantic video indexing, filtering, and retrieval
Author :
Naphide, H.R. ; Huang, Thomas S.
Author_Institution :
Dept. of Electr. & Comput. Eng., Illinois Univ., Urbana, IL, USA
fDate :
3/1/2001 12:00:00 AM
Abstract :
Semantic filtering and retrieval of multimedia content is crucial for efficient use of the multimedia data repositories. Video query by semantic keywords is one of the most difficult problems in multimedia data retrieval. The difficulty lies in the mapping between low-level video representation and high-level semantics. We therefore formulate the multimedia content access problem as a multimedia pattern recognition problem. We propose a probabilistic framework for semantic video indexing, which call support filtering and retrieval and facilitate efficient content-based access. To map low-level features to high-level semantics we propose probabilistic multimedia objects (multijects). Examples of multijects in movies include explosion, mountain, beach, outdoor, music etc. Semantic concepts in videos interact and to model this interaction explicitly, we propose a network of multijects (multinet). Using probabilistic models for six site multijects, rocks, sky, snow, water-body forestry/greenery and outdoor and using a Bayesian belief network as the multinet we demonstrate the application of this framework to semantic indexing. We demonstrate how detection performance can be significantly improved using the multinet to take interconceptual relationships into account. We also show how the multinet can fuse heterogeneous features to support detection based on inference and reasoning
Keywords :
content-based retrieval; database indexing; multimedia databases; video databases; Bayesian belief networks; ROC curves; content access problem; content-based access; hidden Markov models; inference; likelihood ratio test; multimedia content; multimedia data repositories; multimedia data retrieval; multimedia pattern recognition; multimedia understanding; multinet; probabilistic graphical networks; query by example; query by keywords; reasoning; semantic filtering; semantic keywords; semantic video indexing; video indexing; video query; video representation; Content based retrieval; Explosions; Filtering; Forestry; Indexing; Information retrieval; Motion pictures; Music information retrieval; Pattern recognition; Snow;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/6046.909601