Title :
Multimodal tracking and classification of audio-visual features
Author_Institution :
Beckman Inst. for Adv. Sci. & Technol., Illinois Univ., Urbana, IL, USA
Abstract :
The surge of interest in multimedia and multimodal interfaces has prompted the need for novel estimation and classification techniques for data from different but coupled modalities. Unimodal techniques ported to this domain have only exhibited limited success. We propose a new framework for feature prediction and classification based on multimodal knowledge-constrained hidden Markov models (HMMs). The classical role of HMMs as statistical classifiers is enhanced by their new role as multimodal feature predictors. Moreover, by fusing the multimodal formulation with higher level knowledge we allow the influence of such knowledge to be reflected in feature prediction as well as in feature classification.
Keywords :
"Hidden Markov models","Speech recognition","State estimation","Bayesian methods","Surges","Feedback","Laboratories","Source separation","Mars","Network topology"
Conference_Titel :
Image Processing, 1998. ICIP 98. Proceedings. 1998 International Conference on
Print_ISBN :
0-8186-8821-1
DOI :
10.1109/ICIP.1998.723492