DocumentCode :
2972777
Title :
Weighted finite state transducer based statistical dialog management
Author :
Hori, Chiori ; Ohtake, Kazuki ; Misu, Teruhisa ; Kashioka, Hideki ; Nakamura, Shigenari
Author_Institution :
MASTAR Project, Spoken Language Commun. Group, Nat. Inst. of Inf. & Commun. Technol. (NICT), Seika, Japan
fYear :
2009
fDate :
Nov. 13 2009-Dec. 17 2009
Firstpage :
490
Lastpage :
495
Abstract :
We proposed a dialog system using a weighted finite-state transducer (WFST) in which user concept and system action tags are input and output of the transducer, respectively. The WFST-based platform for dialog management enables us to combine various statistical models for dialog management (DM), user input understanding and system action generation, and then search the best system action in response to user inputs among multiple hypotheses. To test the potential of the WFST-based DM platform using statistical models, we constructed a dialog system using a human-to-human spoken dialog corpus for hotel reservation, which is annotated with Interchange Format (IF). A scenario WFST and a spoken language understanding (SLU) WFST were obtained from the corpus and then composed together and optimized. We evaluated the detection accuracy of the system next action tags using Mean Reciprocal Ranking (MRR). Finally, we constructed a full WFST-based dialog system by composing SLU, scenario and sentence generation (SG) WFSTs. Humans read the system responses in natural language and judged the quality of the responses. We confirmed that the WFST-based DM platform was capable of handling various spoken language and scenarios when the user concept and system action tags are consistent and distinguishable.
Keywords :
interactive systems; natural language processing; statistical analysis; dialog management; human-to-human spoken dialog corpus; interchange format; mean reciprocal ranking; spoken language understanding; statistical models; weighted finite state transducer; Communications technology; Computer languages; Delta modulation; Humans; Natural languages; Probability; Project management; System testing; Technology management; Transducers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
Type :
conf
DOI :
10.1109/ASRU.2009.5373350
Filename :
5373350
Link To Document :
بازگشت