DocumentCode :
311368
Title :
Audio-visual interaction in multimedia communication
Author :
Chen, Tsuhan ; Rao, Ram R.
Author_Institution :
Res., AT&T Bell Labs., Holmdel, NJ, USA
Volume :
1
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
179
Abstract :
To many people, the word “multimedia” simply means the combination of various forms of information: text, speech, music, images, graphics and video. What is often overlooked is the interaction among these forms. In this paper, we present our results in exploiting the audio-visual interaction that is very significant in multimedia communication. The applications include lip synchronization, joint audio-video coding, and person verification. We present the enabling technologies, including audio-to-visual mapping and facial image analysis, for these applications. Our results show that the joint processing of audio and video provides advantages that are not available when audio and video are studied separately
Keywords :
multilayer perceptrons; multimedia communication; probability; speech processing; speech recognition; teleconferencing; video coding; audio-to-visual mapping; audio-visual interaction; enabling technologies; facial image analysis; joint audio-video coding; lip synchronization; multimedia communication; person verification; Background noise; Graphics; Humans; Image analysis; Image converters; Mouth; Multimedia communication; Shape; Speech; Videoconference;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.599592
Filename :
599592
Link To Document :
بازگشت