• DocumentCode
    2970760
  • Title

    Discriminative training of n-gram language models for speech recognition via linear programming

  • Author

    Magdin, Vladimir ; Jiang, Hui

  • Author_Institution
    Dept. of Comput. Sci. & Eng., York Univ., Toronto, ON, Canada
  • fYear
    2009
  • fDate
    Nov. 13 2009-Dec. 17 2009
  • Firstpage
    305
  • Lastpage
    310
  • Abstract
    This paper presents a novel discriminative training algorithm for n-gram language models for use in large vocabulary continuous speech recognition. The algorithm uses Maximum Mutual Information Estimation (MMIE) to build an objective function that involves a metric computed between correct transcriptions and their competing hypotheses, which are encoded as word graphs generated from the Viterbi decoding process. The nonlinear MMIE objective function is approximated by a linear one using an EM-style auxiliary function, thus converting the discriminative training of n-gram language models into a linear programming problem, which can be efficiently solved by many convex optimization tools. Experimental results on the SPINE1 speech recognition corpus have shown that the proposed discriminative training method can outperform the conventional discounting-based maximum likelihood estimation methods. A relative reduction in word error rate of close to 3% has been observed on the SPINE1 speech recognition task.
  • Keywords
    linear programming; maximum likelihood estimation; speech recognition; EM style auxiliary function; SPINE1 speech recognition; convex optimization; discriminative training algorithm; linear programming; maximum mutual information estimation; n-gram language models; Error analysis; Linear approximation; Linear programming; Maximum likelihood decoding; Maximum likelihood estimation; Mutual information; Natural languages; Speech recognition; Viterbi algorithm; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
  • Conference_Location
    Merano
  • Print_ISBN
    978-1-4244-5478-5
  • Electronic_ISBN
    978-1-4244-5479-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2009.5373248
  • Filename
    5373248