مرکز منطقه ای اطلاع رساني علوم و فناوري - Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams [video applications]

DocumentCode :

431595

Title :

Layered dynamic mixture model for pattern discovery in asynchronous multi-modal streams [video applications]

Author :

Xie, Lexing ; Kennedy, Lyndon ; Chang, Shih-Fu ; Divakaran, Ajay ; Sun, Huifang ; Lin, Ching-Yung

Author_Institution :

Dept. of Electr. Eng., Columbia Univ., New York, NY, USA

Volume :

fYear :

2005

fDate :

18-23 March 2005

Abstract :

We propose a layered dynamic mixture model for asynchronous multi-modal fusion for unsupervised pattern discovery in video. The lower layer of the model uses generative temporal structures such as a hierarchical hidden Markov model to convert the audiovisual streams into mid-level labels, it also models the correlations in text with probabilistic latent semantic analysis. The upper layer fuses the statistical evidence across diverse modalities with a flexible meta-mixture model that assumes loose temporal correspondence. Evaluation on a large news database shows that multi-modal clusters have better correspondence to news topics than audio-visual clusters alone; novel analysis techniques suggest that meaningful clusters occur when the prediction of salient features by the model concurs with those shown in the story clusters.

Keywords :

correlation methods; hidden Markov models; multimedia computing; pattern recognition; statistical analysis; video signal processing; asynchronous multimodal fusion; asynchronous multimodal video streams; audiovisual streams; hierarchical hidden Markov model; layered dynamic mixture model; meta-mixture model; multimodal clusters; pattern discovery; probabilistic latent semantic analysis; text correlations; unsupervised pattern discovery; Audio databases; Clustering algorithms; Fuses; Fusion power generation; Hidden Markov models; Inference algorithms; Predictive models; Spatial databases; Streaming media; Sun;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

0-7803-8874-7

Type :

conf

DOI :

10.1109/ICASSP.2005.1415589

Filename :

1415589

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=431595