• DocumentCode
    3046889
  • Title

    N-gram adaptation using Dirichlet class language model based on part-of-speech for speech recognition

  • Author

    Hatami, Ali ; Akbari, A. ; Nasersharif, Babak

  • Author_Institution
    Comput. Eng. Dept., Iran Univ. of Sci. & Technol., Tehran, Iran
  • fYear
    2013
  • fDate
    14-16 May 2013
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Language model plays an important role in automatic speech recognition (ASR) systems. Performance of this model depends on its adaptation to the linguistic features. Accordingly, adaptation methods endeavour to apply syntactic and semantic characteristics of the language for language modeling. The previous adaptation methods such as family of Dirichlet class language model (DCLM) extract class of history words. These methods due to lake of syntactic information are not suitable for high morphology languages such as Farsi. This work proposes an idea for using syntactic information such as part-of-speech (POS) in DCLM for combining with an n-gram language model. In our proposed approach, word clustering is based on POS of previous words and history words. The performance of language models are evaluated on BijanKhan corpus using a hidden Markov model based ASR system. Our experiments show that using POS information along with history words and class of history words improves language model, and decreases the perplexity on our corpus. Exploiting POS information along with DCLM, the word error rate of the ASR system decreases by 1% in comparison to DCLM.
  • Keywords
    hidden Markov models; speech recognition; BijanKhan corpus; DCLM; Dirichlet class language model; Farsi; N-gram adaptation; POS information; adaptation methods; automatic speech recognition systems; hidden Markov model based ASR system; history words; language model; language modeling; morphology languages; part-of-speech; semantic characteristics; syntactic characteristics; word clustering; Acoustics; Adaptation models; Computational modeling; Hidden Markov models; History; Probability; Speech recognition; language model adaptation; part-of-speech; perplexity; speech recognition; word error rate;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Engineering (ICEE), 2013 21st Iranian Conference on
  • Conference_Location
    Mashhad
  • Type

    conf

  • DOI
    10.1109/IranianCEE.2013.6599642
  • Filename
    6599642