Title :
Language model switching based on topic detection for dialog speech recognition
Author :
Lane, Ian R. ; Kawahara, Tatsuya ; Matsui, Tomoko
Author_Institution :
Sch. of Informatics, Kyoto Univ., Japan
Abstract :
An efficient, scalable speech recognition architecture is proposed for multidomain dialog systems by combining topic detection and topic-dependent language modeling. The inferred domain is automatically detected from the user´s utterance, and speech recognition is then performed with an appropriate domain-dependent language model. The architecture improves accuracy and efficiency over current approaches and is scaleable to a large number of domains. In this paper, a novel framework using a multilayer hierarchy of language models is introduced in order to improve robustness against topic detection errors. The proposed system provides a relative reduction in WER of 10.5% over a single language model system. Furthermore it achieves an accuracy that is comparable to using multiple language models in parallel while using only a fraction of the computational cost.
Keywords :
interactive systems; natural language interfaces; speech recognition; WER; accuracy; dialog speech recognition; domain-dependent language model; language model switching; multidomain dialog systems; multilayer hierarchy; robustness; scalable speech recognition architecture; topic detection; topic detection errors; topic-dependent language modeling; Computational efficiency; Decoding; Informatics; Laboratories; Natural languages; Robustness; Routing; Speech recognition; Switches; Usability;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198856