DocumentCode :
1290639
Title :
The thoughtful elephant: strategies for spoken dialog systems
Author :
Souvignier, Bernd ; Kellner, Andreas ; Rueber, Bernhard ; Schramm, Hauke ; Seide, Frank
Author_Institution :
Philips Res. Lab., Aachen, Germany
Volume :
8
Issue :
1
fYear :
2000
fDate :
1/1/2000 12:00:00 AM
Firstpage :
51
Lastpage :
62
Abstract :
We present technology used in spoken dialog systems for applications of a wide range. They include tasks from the travel domain and automatic switchboards as well as large scale directory assistance. The overall goal in developing spoken dialog systems is to allow for a natural and flexible dialog flow similar to human-human interaction. This imposes the challenging task to recognize and interpret user input, where he/she is allowed to choose from an unrestricted vocabulary and an infinite set of possible formulations. We therefore put emphasis on strategies that make the system more robust while still maintaining a high level of naturalness and flexibility. In view of this paradigm, we found that two fundamental principles characterize many of the proposed methods: to consider available sources of information as early as possible; and to keep alternative hypotheses and delay the decision for a single option as long as possible. We describe how our system architecture caters to incorporating application specific knowledge, including, for example, database constraints, in the determination of the best sentence hypothesis for a user turn. On the next higher level, we use the dialog history to assess the plausibility of a sentence hypothesis by applying consistency checks with information items from previous user turns. In particular, we demonstrate how combination decisions over several turns can be exploited to boost the recognition performance of the system
Keywords :
interactive systems; natural language interfaces; speech recognition; speech-based user interfaces; vocabulary; application specific knowledge; automatic switchboards; database constraints; dialog history; human interaction; large scale directory assistance; natural language understanding; sentence hypothesis; spoken dialog systems; travel domain; vocabulary; Databases; Delay; Glass; History; Information resources; Large-scale systems; Natural languages; Robustness; System testing; Vocabulary;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.817453
Filename :
817453
Link To Document :
بازگشت