مرکز منطقه ای اطلاع رساني علوم و فناوري - Uncertainty in training large vocabulary speech recognizers

DocumentCode :

2769935

Title :

Uncertainty in training large vocabulary speech recognizers

Author :

Subramanya, Amarnag ; Bartels, Chris ; Bilmes, Jeff ; Nguyen, Patrick

Author_Institution :

Univ. of Washington, Seattle

fYear :

2007

fDate :

9-13 Dec. 2007

Firstpage :

484

Lastpage :

489

Abstract :

We propose a technique for annotating data used to train a speech recognizer. The proposed scheme is based on labeling only a single frame for every word in the training set. We make use of the virtual evidence (VE) framework within a graphical model to take advantage of such data. We apply this approach to a large vocabulary speech recognition task, and show that our VE-based training scheme can improve over the performance of a system trained using sequence labeled data by 2.8% and 2.1% on the dev01 and eva101 sets respectively. Annotating data in the proposed scheme is not significantly slower than sequence labeling. We present timing results showing that training using the proposed approach is about 10 times faster than training using sequence labeled data while using only about 75% of the memory.

Keywords :

speech recognition; sequence labeled data; speech recognition task; virtual evidence framework; vocabulary speech recognizers; Graphical models; Humans; Labeling; Noise robustness; Semisupervised learning; Speech recognition; Timing; Training data; Uncertainty; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on

Conference_Location :

Kyoto

Print_ISBN :

978-1-4244-1746-9

Electronic_ISBN :

978-1-4244-1746-9

Type :

conf

DOI :

10.1109/ASRU.2007.4430160

Filename :

4430160

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2769935