DocumentCode
2970760
Title
Discriminative training of n-gram language models for speech recognition via linear programming
Author
Magdin, Vladimir ; Jiang, Hui
Author_Institution
Dept. of Comput. Sci. & Eng., York Univ., Toronto, ON, Canada
fYear
2009
fDate
Nov. 13 2009-Dec. 17 2009
Firstpage
305
Lastpage
310
Abstract
This paper presents a novel discriminative training algorithm for n-gram language models for use in large vocabulary continuous speech recognition. The algorithm uses Maximum Mutual Information Estimation (MMIE) to build an objective function that involves a metric computed between correct transcriptions and their competing hypotheses, which are encoded as word graphs generated from the Viterbi decoding process. The nonlinear MMIE objective function is approximated by a linear one using an EM-style auxiliary function, thus converting the discriminative training of n-gram language models into a linear programming problem, which can be efficiently solved by many convex optimization tools. Experimental results on the SPINE1 speech recognition corpus have shown that the proposed discriminative training method can outperform the conventional discounting-based maximum likelihood estimation methods. A relative reduction in word error rate of close to 3% has been observed on the SPINE1 speech recognition task.
Keywords
linear programming; maximum likelihood estimation; speech recognition; EM style auxiliary function; SPINE1 speech recognition; convex optimization; discriminative training algorithm; linear programming; maximum mutual information estimation; n-gram language models; Error analysis; Linear approximation; Linear programming; Maximum likelihood decoding; Maximum likelihood estimation; Mutual information; Natural languages; Speech recognition; Viterbi algorithm; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location
Merano
Print_ISBN
978-1-4244-5478-5
Electronic_ISBN
978-1-4244-5479-2
Type
conf
DOI
10.1109/ASRU.2009.5373248
Filename
5373248
Link To Document