مرکز منطقه ای اطلاع رساني علوم و فناوري - Modeling disfluency and background events in ASR for a natural language understanding task

DocumentCode :

336777

Title :

Modeling disfluency and background events in ASR for a natural language understanding task

Author :

Rose, R.C. ; Riccardi, G.

Author_Institution :

Res. Dept., AT&T Bell Labs., Florham Park, NJ, USA

Volume :

fYear :

1999

fDate :

15-19 Mar 1999

Firstpage :

341

Abstract :

This paper investigates techniques for minimizing the impact of non-speech events on the performance of large vocabulary continuous speech recognition (LVCSR) systems. An experimental study is presented that investigates whether the careful manual labeling of disfluency and background events in conversational speech can be used to provide an additional level of supervision in training HMM acoustic models and statistical language models. First, techniques are investigated for incorporating explicitly labeled disfluency and background events directly into the acoustic HMM. Second, phrase-based statistical language models are trained from utterance transcriptions which include labeled instances of these events. Finally, it is shown that significant word accuracy and run-time performance improvements are obtained for both sets of techniques on a telephone-based spoken language understanding task

Keywords :

computational linguistics; hidden Markov models; natural languages; speech recognition; ASR; HMM acoustic models; LVCSR systems; background events; conversational speech; disfluency; large vocabulary continuous speech recognition; manual labeling; natural language understanding task; nonspeech events; performance; phrase-based statistical language models; run-time; statistical language model; supervision; telephone-based spoken language understanding task; utterance transcriptions; word accuracy; Automatic speech recognition; Background noise; Hidden Markov models; Information analysis; Labeling; Manuals; Natural languages; Runtime; Speech enhancement; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on

Conference_Location :

Phoenix, AZ

ISSN :

1520-6149

Print_ISBN :

0-7803-5041-3

Type :

conf

DOI :

10.1109/ICASSP.1999.758132

Filename :

758132

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=336777