Parallel computing-based architecture for mixed-initiative spoken dialogue

Author

Taguma, Ryuta ; Moriyama, Tatsuhiro ; Twano, K. ; Furui, Sadaoki

Author_Institution

Dept. of Comput. Sci., Tokyo Inst. of Technol., Japan

fYear

2002

fDate

2002

Firstpage

53

Lastpage

58

Abstract

This paper describes a new method of implementing mixed-initiative spoken dialogue systems based on parallel computing architecture. In a mixed-initiative dialogue, the user as well as the system needs to be capable of controlling the dialogue sequence. In our implementation, various language models corresponding to different dialogue contents, such as requests for information or replies to the system, are built and multiple recognizers using these language models are driven under a parallel computing architecture. The dialogue content of the user is automatically detected based on likelihood scores given by the recognizers, and the content is used to build the dialogue. A transitional probability from one dialogue state uttering a kind of content to another state uttering a different content is incorporated into the likelihood score. A flexible dialogue structure that gives users the initiative to control the dialogue is implemented by this architecture. Real-time dialogue systems for retrieving information about restaurants and food stores are built and evaluated in terms of dialogue content identification rate and keyword accuracy. The proposed architecture has the advantage that the dialogue system can be easily modified without remaking the whole language model.

Keywords

information retrieval; interactive systems; multimedia computing; natural language interfaces; parallel architectures; parallel programming; real-time systems; speech recognition; speech-based user interfaces; automatic dialogue content detection; dialogue content identification rate; dialogue sequence control; food stores; information retrieval; keyword accuracy; language models; likelihood scores; mixed-initiative spoken dialogue systems; multiple recognizers; parallel computing architecture; real-time dialogue systems; restaurants; transitional probability; Computer architecture; Computer interfaces; Concurrent computing;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimodal Interfaces, 2002. Proceedings. Fourth IEEE International Conference on

Print_ISBN

0-7695-1834-6

Type

conf

DOI

10.1109/ICMI.2002.1166968

Filename

1166968