Title :
Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams [video applications]
Author :
Xie, Lexing ; Kennedy, Lyndon ; Chang, Shih-Fu ; Divakaran, Ajay ; Sun, Huifang ; Lin, Ching-Yung
Author_Institution :
Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
Abstract :
We propose a layered dynamic mixture model for asynchronous multi-modal fusion for unsupervised pattern discovery in video. The lower layer of the model uses generative temporal structures such as a hierarchical hidden Markov model to convert the audiovisual streams into mid-level labels, it also models the correlations in text with probabilistic latent semantic analysis. The upper layer fuses the statistical evidence across diverse modalities with a flexible meta-mixture model that assumes loose temporal correspondence. Evaluation on a large news database shows that multi-modal clusters have better correspondence to news topics than audio-visual clusters alone; novel analysis techniques suggest that meaningful clusters occur when the prediction of salient features by the model concurs with those shown in the story clusters.
Keywords :
correlation methods; hidden Markov models; multimedia computing; pattern recognition; statistical analysis; video signal processing; asynchronous multimodal fusion; asynchronous multimodal video streams; audiovisual streams; hierarchical hidden Markov model; layered dynamic mixture model; meta-mixture model; multimodal clusters; pattern discovery; probabilistic latent semantic analysis; text correlations; unsupervised pattern discovery; Audio databases; Clustering algorithms; Fuses; Fusion power generation; Hidden Markov models; Inference algorithms; Predictive models; Spatial databases; Streaming media; Sun;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
Print_ISBN :
0-7803-8874-7
DOI :
10.1109/ICASSP.2005.1415589