Title :
Real-time streaming for the animation of talking faces in multiuser environments
Author :
Ostermann, Joern ; Rurainsky, Juergen ; Civanlar, Reha
Author_Institution :
AT&T Labs-Res., Middletown, NJ, USA
fDate :
6/24/1905 12:00:00 AM
Abstract :
In order to enable face animation on the Internet using high quality synthetic speech, the text-to-speech (TTS) servers need to be implemented on network-based servers and shared by many users. The output of a TTS server is used to animate talking heads as defined in MPEG-4. The TTS server creates two sets of data: audio data and phonemes with optional facial animation parameters (FAP) like smile. In order to animate talking heads on a client it is necessary to stream the output of the TTS server to the client. Real-time streaming protocols for audio data already exist. We developed a real-time transport protocol with error recovery capability to stream phonemes and facial animation parameters (PFAP), which are used to animate the talking head. The stream was designed for interactive services and allows for low latency communications. The typical bit rate for enabling a talking face is less than 800 bit/s.
Keywords :
Internet; client-server systems; computer animation; error correction codes; image coding; interactive systems; multimedia communication; multimedia servers; real-time systems; speech synthesis; transport protocols; Internet; MPEG-4; TTS servers; audio data; client; error recovery; face animation; facial animation parameters; high quality synthetic speech; interactive services; low latency communications; multiuser environments; network-based servers; phonemes; real-time streaming protocols; real-time transport protocol; smile; talking heads; text-to-speech servers; Delay; Facial animation; Financial advantage program; IP networks; MPEG 4 Standard; Network servers; Speech synthesis; Streaming media; Transport protocols; Web server;
Conference_Titel :
Circuits and Systems, 2002. ISCAS 2002. IEEE International Symposium on
Print_ISBN :
0-7803-7448-7
DOI :
10.1109/ISCAS.2002.1009871