Title :
Joint audio-video processing for multimedia
Author :
Chen, Tsuhan ; Rao, Ram
Author_Institution :
AT&T Res., Holmdel, NJ, USA
Abstract :
In this paper, the authors report recent developments in the research of joint audio-visual processing for multimedia applications. These include: bimodality in speech production and perception, automatic lipreading, talking-head animation and lip synchronization. They present in detail the enabling technologies for these applications. A new trend of research is to utilize audio-visual interaction in the coding of talking head video. They show that the marriage of speech analysis and image processing can create a number of new research opportunities
Keywords :
computer animation; multimedia communication; multimedia computing; speech coding; speech processing; synchronisation; video coding; audio-visual interaction; automatic lipreading; bimodality; enabling technologies; image processing; joint audio-visual processing; lip synchronization; multimedia applications; recent developments; research opportunities; speech analysis; speech perception; speech production; talking head video coding; talking-head animation; Humans; Image analysis; Image converters; Lips; Mouth; Speech analysis; Speech recognition; Teeth; Tongue; Videoconference;
Conference_Titel :
Industrial Electronics, Control, and Instrumentation, 1996., Proceedings of the 1996 IEEE IECON 22nd International Conference on
Conference_Location :
Taipei
Print_ISBN :
0-7803-2775-6
DOI :
10.1109/IECON.1996.571012