• DocumentCode
    2180263
  • Title

    Multi-class Model M

  • Author

    Emami, Ahmad ; Chen, Stanley F.

  • Author_Institution
    IBM TJ. Watson Res. Center, Yorktown Heights, NY, USA
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5516
  • Lastpage
    5519
  • Abstract
    Model M, a novel class-based exponential language model, has been shown to significantly outperform word n-gram models in state-of-the-art machine translation and speech recognition systems. The model was motivated by the observation that shrinking the sum of the parameter magnitudes in an exponential language model leads to better performance on unseen data. Being a class-based language model, Model M makes use of word classes that are found automatically from training data. In this paper, we extend Model M to allow for different clusterings to be used at different word positions. This is motivated by the fact that words play different roles depending on their position in an n-gram. Experiments on standard NIST and GALE Arabic-to-English development and test sets show improvements in machine translation quality as measured by automatic evaluation metrics.
  • Keywords
    language translation; speech recognition; GALE Arabic-to-English development; NIST; class-based exponential language model; machine translation; multiclass model M; n-gram model; speech recognition system; Adaptation models; Clustering algorithms; Data models; Decoding; Prediction algorithms; Speech recognition; Training data; Language Modeling; Machine Translation; Maximum-Entropy Models;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947608
  • Filename
    5947608