• DocumentCode
    730726
  • Title

    An HMM-based formalism for automatic subword unit derivation and pronunciation generation

  • Author

    Razavi, Marzieh ; Magimai-Doss, Mathew

  • Author_Institution
    Idiap Res. Inst., Martigny, Switzerland
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4639
  • Lastpage
    4643
  • Abstract
    We propose a novel hidden Markov model (HMM) formalism for automatic derivation of subword units and pronunciation generation using only transcribed speech data. In this approach, the subword units are derived from the clustered context-dependent units in a grapheme based system using maximum-likelihood criterion. The subword unit based pronunciations are then learned in the framework of Kullback-Leibler divergence based HMM. The automatic speech recognition (ASR) experiments on WSJ0 English corpus show that the approach leads to 12.7% relative reduction in word error rate compared to grapheme-based system. Our approach can be beneficial in reducing the need for expert knowledge in development of ASR as well as text-to-speech systems.
  • Keywords
    hidden Markov models; maximum likelihood estimation; speech recognition; ASR experiments; HMM formalism; Kullback-Leibler divergence based HMM; WSJ0 English corpus; automatic speech recognition experiments; clustered context-dependent units; grapheme based system; hidden Markov model formalism; maximum-likelihood criterion; pronunciation generation; subword units automatic derivation; text-to-speech systems; transcribed speech data; word error rate; Acoustics; Hidden Markov models; Mathematical model; Probabilistic logic; Speech; Speech recognition; Training; Kullback-Leibler divergence based hidden Markov model; automatic subword unit derivation; hidden Markov model; pronunciation generation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178850
  • Filename
    7178850