DocumentCode :
2970760
Title :
Discriminative training of n-gram language models for speech recognition via linear programming
Author :
Magdin, Vladimir ; Jiang, Hui
Author_Institution :
Dept. of Comput. Sci. & Eng., York Univ., Toronto, ON, Canada
fYear :
2009
fDate :
Nov. 13 2009-Dec. 17 2009
Firstpage :
305
Lastpage :
310
Abstract :
This paper presents a novel discriminative training algorithm for n-gram language models for use in large vocabulary continuous speech recognition. The algorithm uses Maximum Mutual Information Estimation (MMIE) to build an objective function that involves a metric computed between correct transcriptions and their competing hypotheses, which are encoded as word graphs generated from the Viterbi decoding process. The nonlinear MMIE objective function is approximated by a linear one using an EM-style auxiliary function, thus converting the discriminative training of n-gram language models into a linear programming problem, which can be efficiently solved by many convex optimization tools. Experimental results on the SPINE1 speech recognition corpus have shown that the proposed discriminative training method can outperform the conventional discounting-based maximum likelihood estimation methods. A relative reduction in word error rate of close to 3% has been observed on the SPINE1 speech recognition task.
Keywords :
linear programming; maximum likelihood estimation; speech recognition; EM style auxiliary function; SPINE1 speech recognition; convex optimization; discriminative training algorithm; linear programming; maximum mutual information estimation; n-gram language models; Error analysis; Linear approximation; Linear programming; Maximum likelihood decoding; Maximum likelihood estimation; Mutual information; Natural languages; Speech recognition; Viterbi algorithm; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
Type :
conf
DOI :
10.1109/ASRU.2009.5373248
Filename :
5373248
Link To Document :
بازگشت