• DocumentCode
    387491
  • Title

    Parallel computing-based architecture for mixed-initiative spoken dialogue

  • Author

    Taguma, Ryuta ; Moriyama, Tatsuhiro ; Twano, K. ; Furui, Sadaoki

  • Author_Institution
    Dept. of Comput. Sci., Tokyo Inst. of Technol., Japan
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    53
  • Lastpage
    58
  • Abstract
    This paper describes a new method of implementing mixed-initiative spoken dialogue systems based on parallel computing architecture. In a mixed-initiative dialogue, the user as well as the system needs to be capable of controlling the dialogue sequence. In our implementation, various language models corresponding to different dialogue contents, such as requests for information or replies to the system, are built and multiple recognizers using these language models are driven under a parallel computing architecture. The dialogue content of the user is automatically detected based on likelihood scores given by the recognizers, and the content is used to build the dialogue. A transitional probability from one dialogue state uttering a kind of content to another state uttering a different content is incorporated into the likelihood score. A flexible dialogue structure that gives users the initiative to control the dialogue is implemented by this architecture. Real-time dialogue systems for retrieving information about restaurants and food stores are built and evaluated in terms of dialogue content identification rate and keyword accuracy. The proposed architecture has the advantage that the dialogue system can be easily modified without remaking the whole language model.
  • Keywords
    information retrieval; interactive systems; multimedia computing; natural language interfaces; parallel architectures; parallel programming; real-time systems; speech recognition; speech-based user interfaces; automatic dialogue content detection; dialogue content identification rate; dialogue sequence control; food stores; information retrieval; keyword accuracy; language models; likelihood scores; mixed-initiative spoken dialogue systems; multiple recognizers; parallel computing architecture; real-time dialogue systems; restaurants; transitional probability; Computer architecture; Computer interfaces; Concurrent computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimodal Interfaces, 2002. Proceedings. Fourth IEEE International Conference on
  • Print_ISBN
    0-7695-1834-6
  • Type

    conf

  • DOI
    10.1109/ICMI.2002.1166968
  • Filename
    1166968