Title :
Talking heads and synthetic speech: an architecture for supporting electronic commerce
Author :
Ostermann, Jörn ; Millen, David
Author_Institution :
Labs.-Res., AT&T, Red Bank, NJ, USA
Abstract :
Facial animation has been combined with text-to-speech synthesis to create innovative multimodal interfaces. In this paper, we present an architecture for such a multimodal interface. A face model is downloaded from a server into a client. The client uses an MPEG-4 compliant speech synthesizer that animates the head. The server sends text and animation data to the client in addition to regular content to be displayed by a Web browser. We believe that this architecture can support electronic commerce by providing a more friendly, helpful and intuitive user interface compared to that of a regular Web browser. In order to substantiate these claims, we undertook experiments to understand user reactions to interactive services designed with synthetic characters. In one experiment, participants played the `Social Dilemma´ game with the computer as a partner. The results indicate that users cooperate more with a computer when an animated face is representing the computer during the game. A simulated commercial application was evaluated, also comparing the performance of facial animation, text-to-speech and text-only conditions. According to the results, the use of facial animation in the design of interactive services was favorably rated for most of the attributes in these experiments. Further, the results show that facial animation may effectively fill application-waiting times and make delays more acceptable to the users
Keywords :
client-server systems; computer animation; computer games; delays; electronic commerce; graphical user interfaces; human factors; multimedia computing; online front-ends; software architecture; speech synthesis; speech-based user interfaces; MPEG-4 compliant speech synthesizer; Social Dilemma game; Web browser; application-waiting times; client-server system; delays; electronic commerce architecture; face model downloading; facial animation; interactive services; intuitive user interface; multimodal interfaces; performance; simulated commercial application; synthetic characters; synthetic speech; talking heads; text-only condition; text-to-speech synthesis; user cooperation; user reactions; Application software; Computational modeling; Electronic commerce; Facial animation; Games; MPEG 4 Standard; Service oriented architecture; Speech synthesis; Synthesizers; User interfaces;
Conference_Titel :
Multimedia and Expo, 2000. ICME 2000. 2000 IEEE International Conference on
Conference_Location :
New York, NY
Print_ISBN :
0-7803-6536-4
DOI :
10.1109/ICME.2000.869548