DocumentCode :
2769935
Title :
Uncertainty in training large vocabulary speech recognizers
Author :
Subramanya, Amarnag ; Bartels, Chris ; Bilmes, Jeff ; Nguyen, Patrick
Author_Institution :
Univ. of Washington, Seattle
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
484
Lastpage :
489
Abstract :
We propose a technique for annotating data used to train a speech recognizer. The proposed scheme is based on labeling only a single frame for every word in the training set. We make use of the virtual evidence (VE) framework within a graphical model to take advantage of such data. We apply this approach to a large vocabulary speech recognition task, and show that our VE-based training scheme can improve over the performance of a system trained using sequence labeled data by 2.8% and 2.1% on the dev01 and eva101 sets respectively. Annotating data in the proposed scheme is not significantly slower than sequence labeling. We present timing results showing that training using the proposed approach is about 10 times faster than training using sequence labeled data while using only about 75% of the memory.
Keywords :
speech recognition; sequence labeled data; speech recognition task; virtual evidence framework; vocabulary speech recognizers; Graphical models; Humans; Labeling; Noise robustness; Semisupervised learning; Speech recognition; Timing; Training data; Uncertainty; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430160
Filename :
4430160
Link To Document :
بازگشت