Title :
Speech-assisted video processing: interpolation and low-bitrate coding
Author :
Chen, Tsuhan ; Graf, Hans Peter ; Wang, Kuansan
Author_Institution :
AT&T Bell Labs., Holmdel, NJ, USA
fDate :
31 Oct-2 Nov 1994
Abstract :
We utilize speech information to improve the quality of audio/visual communications, such as videotelephony, videoconferencing, and multimedia. In particular, marriage of speech processing and image processing can solve problems related to lip synchronization. Two main techniques proposed in this paper are: speech-assisted interpolation and speech-assisted coding of talking head video. Audio/video sequences are presented to demonstrate our techniques
Keywords :
audio-visual systems; image sequences; interpolation; speech coding; synchronisation; video coding; audio/video sequences; audio/visual communications; image processing; lip synchronization; low-bit rate coding; multimedia; speech information; speech processing; speech-assisted interpolation; speech-assisted video processing; talking head video coding; videoconferencing; videotelephony; Bit rate; Head; Image coding; Interpolation; Mouth; Speech analysis; Speech coding; Speech processing; Teleconferencing; Video coding;
Conference_Titel :
Signals, Systems and Computers, 1994. 1994 Conference Record of the Twenty-Eighth Asilomar Conference on
Conference_Location :
Pacific Grove, CA
Print_ISBN :
0-8186-6405-3
DOI :
10.1109/ACSSC.1994.471605