Title :
Using semantic class information for rapid development of language models within ASR dialogue systems
Author :
Fosler-Lussier, Eric ; Kuo, Hong-Kwang Jeff
Author_Institution :
Lucent Technol. Bell Labs., Murray Hill, NJ, USA
Abstract :
When dialogue system developers tackle a new domain, much effort is required; the development of different parts of the system usually proceeds independently. Yet it may be profitable to coordinate development efforts between different modules. We focus our efforts on extending small amounts of language model training data by integrating semantic classes that were created for a natural language understanding module. By converting finite state parses of a training corpus into a probabilistic context free grammar and subsequently generating artificial data from the context free grammar, we can significantly reduce perplexity and automatic speech recognition (ASR) word error for situations with little training data. Experiments are presented using data from the ATIS and DARPA Communicator travel corpora
Keywords :
context-free grammars; interactive systems; natural languages; probability; speech recognition; ATIS; DARPA Communicator travel corpora; automatic speech recognition dialogue systems; finite state parses; language models; natural language understanding module; perplexity; probabilistic context free grammar; semantic class information; word error; Automatic speech recognition; Cities and towns; Context; Databases; Engines; Helium; Natural languages; Probability; Speech recognition; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940891