Title :
Word-selective training for speech recognition
Author :
Kamm, Teresa M. ; Meyer, Gerard G L
Author_Institution :
Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
fDate :
30 Nov.-3 Dec. 2003
Abstract :
We previously proposed (Kamm and Meyer (2001, 2002)) a two-pronged approach to improve system performance by selective use of training data. We demonstrated a sentence-selective algorithm that, first, made effective use of the available humanly transcribed training data and, second, focused future human transcription effort on data that was more likely to improve system performance. We now extend that algorithm to focus on word selection, and demonstrate that we can reduce the error rate from 10.3 % to 9.3 % on a simple, 36-word corpus, by selecting 30 % (15 hours) of the 50 hours of training data available in this corpus, without knowledge of the true transcription. We also discuss application of our word selection algorithm to the Wall Street Journal 5 K word task. Preliminary results show that we can select up to 60 % (48 hours) of the training data, with minimal knowledge of the true transcription, and match or beat the error rate of a system built using the same amount of randomly selected training data.
Keywords :
error statistics; speech recognition; Wall Street Journal 5 K word task; error rate; speech recognition; training data; transcription; word selection; word-selective training; Automatic speech recognition; Costs; Error analysis; Humans; Learning systems; Natural languages; Speech processing; Speech recognition; System performance; Training data;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
DOI :
10.1109/ASRU.2003.1318403