Title :
What type of inputs will we need for expressive speech synthesis?
Author_Institution :
ATR Human Inf. Sci. Labs., Kyoto, Japan
Abstract :
Speech synthesis is not necessarily synonymous with text-to-speech. This paper describes an implementation for a talking machine that produces multilingual conversational utterances from a combination of speaker, language, speaking-style, and content information, using icon-based input. The paper addresses the problems of specifying the text-content of a conversational utterance from a combination of conceptual icons, in conjunction with language and speaker information. It concludes that in order to specify the speech content (text details and speaking-style) adequately, further selection options for speaker-commitment will be required.
Keywords :
graphical user interfaces; speech synthesis; speech-based user interfaces; content information; expressive speech synthesis; icon-based input; language information; multilingual conversational utterances; speaker information; speaking-style information; talking machine; Electrostatic precipitators; Human voice; IEEE news; Information science; Laboratories; Natural languages; Personnel; Speech processing; Speech synthesis; Synthesizers;
Conference_Titel :
Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshop on
Print_ISBN :
0-7803-7395-2
DOI :
10.1109/WSS.2002.1224381