• DocumentCode
    3426529
  • Title

    Language modeling for voice search: A machine translation approach

  • Author

    Li, Xiao ; Ju, Yun-Cheng ; Zweig, Geoffrey ; Acero, Alex

  • Author_Institution
    Microsoft Res., Redmond, WA
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4913
  • Lastpage
    4916
  • Abstract
    This paper presents a novel approach to language modeling for voice search based on the idea and method of statistical machine translation. We propose an n-gram based translation model that can be used for listing-to-query translation. We then leverage the query forms translated from listings to improve language modeling. The translation model is trained in an unsupervised manner using a set of transcribed voice search queries. Experiments show that the translation approach yielded drastic perplexity reductions compared with a baseline language model where no translation is applied.
  • Keywords
    language translation; statistical analysis; language modeling; listing-to-query translation; n-gram based translation model; statistical machine translation; voice search; Automatic speech recognition; Humans; Information retrieval; Predictive models; Robustness; Speech recognition; Telephony; directory assistance; language modeling; machine translation; voice search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518759
  • Filename
    4518759