• DocumentCode
    735032
  • Title

    Low-frequency word enhancement with similar pairs in speech recognition

  • Author

    Xi Ma ; Xiaoxi Wang ; Dong Wang

  • Author_Institution
    Center for Speech & Language Technol., Tsinghua Univ., Beijing, China
  • fYear
    2015
  • fDate
    12-15 July 2015
  • Firstpage
    343
  • Lastpage
    347
  • Abstract
    In practical automatic speech recognition (ASR) systems, it is difficult to recognize words that are with low-frequency in the language model (LM) training data. Ironically, these words tend to be highly important as they are often domain-specific name entities. In order to meet this challenge, we present a novel approach that enhances the weights of these words by borrowing information from some high-frequency words that are similar to the target words. Experimental results demonstrated that our method can significantly improve ASR performance on low-frequency words and does not impact performance on high-frequency words. Additionally, this method can be easily extended to deal with new words that are absent in the LM training data.
  • Keywords
    speech recognition; ASR system; LM training data; automatic speech recognition system; domain-specific name entity; language model; low-frequency word enhancement; Acoustics; Hidden Markov models; Probability; Speech; Speech recognition; Training; Training data; finite state transducer; language model; similar pair; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (ChinaSIP), 2015 IEEE China Summit and International Conference on
  • Conference_Location
    Chengdu
  • Type

    conf

  • DOI
    10.1109/ChinaSIP.2015.7230421
  • Filename
    7230421