Author_Institution :
Dept. of Software Production Res., Lucent Technol. Bell Labs., Naperville, IL, USA
Abstract :
Many computing professionals have heard of XML, and some use it to describe text, images and other data with rich structure. The author discusses an innovative use of XML, called VoiceXML, to support human-computer dialogs via spoken input and audio output. VoiceXML defines dialogs between humans and machines in terms of audio files to be played, text-to-speech synthesis and speech recognition capabilities, and touch-tone input. The author reviews the existing architectures for World Wide Web and telephone services, describes how VoiceXML enables consolidation of service logic for Web and phone, and summarizes the features of the VoiceXML 1.0 specification. Implementation of VoiceXML clients and VoiceXML services has begun in many of the VoiceXML Forum´s member companies and will soon be available in the marketplace. The World Wide Web Voice Browser working group has adopted VoiceXML 1.0 as the basis for the dialog markup language that is part of its speech user interface framework
Keywords :
hypermedia markup languages; information resources; speech recognition; speech synthesis; speech-based user interfaces; VoiceXML 1.0 specification; World Wide Web Voice Browser; XML; audio files; audio output; dialogue markup language; human-computer dialogue; service architectures; service logic consolidation; speech recognition capabilities; speech user interface; spoken input; telephone services; text-to-speech synthesis; touch-tone input; voice-enabled World Wide Web; Humans; Logic; Markup languages; Service oriented architecture; Speech recognition; Speech synthesis; Telephony; User interfaces; Web sites; XML;