• DocumentCode
    2789598
  • Title

    Discriminative training methods for language models using conditional entropy criteria

  • Author

    Huang, Jui-Ting ; Li, Xiao ; Acero, Alex

  • Author_Institution
    Electr. & Comput. Eng. Dept., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    5182
  • Lastpage
    5185
  • Abstract
    This paper addresses the problem of discriminative training of language models that does not require any transcribed acoustic data. We propose to minimize the conditional entropy of word sequences given phone sequences, and present two settings in which this criterion can be applied. In an inductive learning setting, the phonetic/acoustic confusability information is given by a general phone error model. A transductive approach, in contrast, obtains that information by running a speech recognizer on test-set acoustics, with the goal of optimizing the test-set performance. Experiments show significant recognition accuracy improvements in both rescoring and first-pass decoding experiments using the transductive approach, and mixed results using the inductive approach.
  • Keywords
    acoustic signal processing; entropy; learning by example; speech processing; speech recognition; acoustic confusability information; conditional entropy criteria; discriminative training method; first pass decoding experiment; general phone error model; inductive learning; language model; phone sequence; phonetic confusability information; speech recognizer; test set acoustics; word sequence; Acoustic testing; Acoustic waves; Acoustical engineering; Data engineering; Entropy; Maximum likelihood decoding; Maximum likelihood estimation; Random variables; Speech recognition; Web search; Discriminative training; conditional entropy; language model; unsupervised training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495008
  • Filename
    5495008