DocumentCode :
3281987
Title :
Multimedia/multimodal signal processing, analysis, and understanding
Author :
Huang, Thomas S.
Author_Institution :
Beckman Inst. for Adv. Sci. & Technol., Illinois Univ., Urbana-Champaign, IL, USA
fYear :
2004
fDate :
29 Sept.-1 Oct. 2004
Firstpage :
5
Abstract :
Summary form only given. "Multimodal" refers to the different senses (visual, audio, tactile, etc.) used in human-computer interface. "Multimedia" refers to the different ways of representing information (text, graphics, audio, images, video, etc.). A signal processing, analysis, or understanding task is called multimedia/multimodal if it involves two or more modalities or media interacting in nontrivial ways. We shall give an array of examples of multimedia/multimodal signal processing, analysis, and understanding, including audio/visual speech recognition and audio/visual emotion recognition. A stable and robust facial movement tracking algorithm is presented which is used in both tasks.
Keywords :
audio signal processing; emotion recognition; multimedia computing; speech recognition; audio-visual emotion recognition; audio-visual speech recognition; facial movement tracking algorithm; human-computer interface; multimedia signal processing; multimodal signal processing; Array signal processing; Emotion recognition; Graphics; Robustness; Signal analysis; Signal processing; Signal processing algorithms; Speech analysis; Speech recognition; Video signal processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia Signal Processing, 2004 IEEE 6th Workshop on
Print_ISBN :
0-7803-8578-0
Type :
conf
DOI :
10.1109/MMSP.2004.1436396
Filename :
1436396
Link To Document :
بازگشت