DocumentCode
1161265
Title
Analysis of the errors produced by the 2004 BBN speech recognition system in the DARPA EARS evaluations
Author
Duta, Nicolae ; Schwartz, Richard ; Makhoul, John
Author_Institution
Speech & Language Process. Dept., BBN Technol., Cambridge, MA
Volume
14
Issue
5
fYear
2006
Firstpage
1745
Lastpage
1753
Abstract
This paper aims to quantify the main error types the 2004 BBN speech recognition system made in the broadcast news (BN) and conversational telephone speech (CTS) DARPA EARS evaluations. We show that many of the remaining errors occur in clusters rather than isolated, have specific causes, and differ to some extent between the BN and CTS domains. The correctly recognized words are also clustered and are highly correlated with regions where the system produces a single hypothesized choice per word. A statistical analysis of some well-known error causes (out-of-vocabulary words, word fragments, hesitations, and unlikely language constructs) was performed in order to assess their contribution to the overall word error rate (WER). We conclude with a discussion of the lower bound on the WER introduced by the human annotator disagreement
Keywords
error analysis; speech recognition; statistical analysis; text analysis; BBN speech recognition system; DARPA EARS evaluation; broadcast news; conversational telephone speech; hesitations; human annotator disagreement; out-of-vocabulary words; statistical analysis; unlikely language constructs; word error rate; word fragments; word recognition; Automatic speech recognition; Broadcasting; Ear; Error analysis; Loudspeakers; Natural languages; Speech analysis; Speech processing; Speech recognition; Telephony; Error analysis; speech recognition;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2006.878268
Filename
1677993
Link To Document