DocumentCode :
3167873
Title :
Integration of multimodal features for video scene classification based on HMM
Author :
Huang, J. ; Liu, Z. ; Wang, Y. ; Chen, Y. ; Wong, E.K.
Author_Institution :
Dept. of Electr. Eng., Polytech. Univ., Brooklyn, NY, USA
fYear :
1999
fDate :
1999
Firstpage :
53
Lastpage :
58
Abstract :
Along with the advances in multimedia and Internet technology, a huge amount of data, including digital video and audio, are generated daily. Tools for the efficient indexing and retrieval of such data are indispensable. With multi-modal information present in the data, effective integration is necessary and is still a challenging problem. In this paper, we present four different methods for integrating audio and visual information for video classification based on a hidden Markov model (HMM): direct concatenation, product HMM, two-stage HMM, and integration by neural network. Our results have shown significant improvements over using a single modality
Keywords :
Internet; audio-visual systems; content-based retrieval; database indexing; hidden Markov models; multimedia systems; neural nets; video databases; video signal processing; Internet; audio-visual information; data retrieval; direct concatenation; hidden Markov model; indexing; multi-modal information; multimedia; multimodal feature integration; neural network; video scene classification; Bandwidth; Frequency; Games; Hidden Markov models; Internet; Layout; Speech synthesis; TV; Videoconference; Weather forecasting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 1999 IEEE 3rd Workshop on
Conference_Location :
Copenhagen
Print_ISBN :
0-7803-5610-1
Type :
conf
DOI :
10.1109/MMSP.1999.793797
Filename :
793797
Link To Document :
بازگشت