Title :
Audio-visual interaction in multimedia communication
Author :
Chen, Tsuhan ; Rao, Ram R.
Author_Institution :
Res., AT&T Bell Labs., Holmdel, NJ, USA
Abstract :
To many people, the word “multimedia” simply means the combination of various forms of information: text, speech, music, images, graphics and video. What is often overlooked is the interaction among these forms. In this paper, we present our results in exploiting the audio-visual interaction that is very significant in multimedia communication. The applications include lip synchronization, joint audio-video coding, and person verification. We present the enabling technologies, including audio-to-visual mapping and facial image analysis, for these applications. Our results show that the joint processing of audio and video provides advantages that are not available when audio and video are studied separately
Keywords :
multilayer perceptrons; multimedia communication; probability; speech processing; speech recognition; teleconferencing; video coding; audio-to-visual mapping; audio-visual interaction; enabling technologies; facial image analysis; joint audio-video coding; lip synchronization; multimedia communication; person verification; Background noise; Graphics; Humans; Image analysis; Image converters; Mouth; Multimedia communication; Shape; Speech; Videoconference;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.599592