Title :
Modeling disfluency and background events in ASR for a natural language understanding task
Author :
Rose, R.C. ; Riccardi, G.
Author_Institution :
Res. Dept., AT&T Bell Labs., Florham Park, NJ, USA
Abstract :
This paper investigates techniques for minimizing the impact of non-speech events on the performance of large vocabulary continuous speech recognition (LVCSR) systems. An experimental study is presented that investigates whether the careful manual labeling of disfluency and background events in conversational speech can be used to provide an additional level of supervision in training HMM acoustic models and statistical language models. First, techniques are investigated for incorporating explicitly labeled disfluency and background events directly into the acoustic HMM. Second, phrase-based statistical language models are trained from utterance transcriptions which include labeled instances of these events. Finally, it is shown that significant word accuracy and run-time performance improvements are obtained for both sets of techniques on a telephone-based spoken language understanding task
Keywords :
computational linguistics; hidden Markov models; natural languages; speech recognition; ASR; HMM acoustic models; LVCSR systems; background events; conversational speech; disfluency; large vocabulary continuous speech recognition; manual labeling; natural language understanding task; nonspeech events; performance; phrase-based statistical language models; run-time; statistical language model; supervision; telephone-based spoken language understanding task; utterance transcriptions; word accuracy; Automatic speech recognition; Background noise; Hidden Markov models; Information analysis; Labeling; Manuals; Natural languages; Runtime; Speech enhancement; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758132