DocumentCode :
302178
Title :
Improvements in switchboard recognition and topic identification
Author :
Peskin, B. ; Connolly, S. ; Gillick, L. ; Lowe, S.
Author_Institution :
Dragon Syst. Inc., Newton, MA
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
303
Abstract :
We revisit a topic identification test on the Switchboard Corpus first reported by Gillick et al. (see Proc. ICASSP-93, 1993 and ARPA Workshop on Human Language Technology, 1993). This approach to topic ID uses a large vocabulary continuous speech recognizer as a front-end to transcribe the speech and then scores the transcripts using a set of topic-specific language models. Our recognition of conversational telephone speech has improved dramatically in the three years since the original test, dropping from word error rates in the 90%´s to those in the 40%´s. Changing only the recognition engine but otherwise leaving our 1993 topic ID system in place, the resulting rate of message misclassification drops from 33/120 in 1993 down to 1/120 now-the same error rate that we obtain from the true transcriptions. This paper describes the topic classification test and the many improvements to the recognition engine that made such a dramatic reduction possible
Keywords :
information retrieval systems; natural languages; online front-ends; speech recognition; telephony; Switchboard Corpus; conversational telephone speech; error rate; front-end; large vocabulary continuous speech recognizer; message misclassification; recognition engine; switchboard recognition; topic ID; topic classification test; topic identification test; topic-specific language models; transcripts; word error rates; Engines; Error analysis; Frequency; Natural languages; Robustness; Signal processing; Speech recognition; System testing; Telephony; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.540418
Filename :
540418
Link To Document :
بازگشت