• DocumentCode
    294628
  • Title

    Application of clustering techniques to mixture density modelling for continuous-speech recognition

  • Author

    Dugast, Christian ; Beyerlein, Peter ; Haeb-Umbach, Reinhold

  • Author_Institution
    Philips Res. Lab., Aachen, Germany
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    524
  • Abstract
    Clustering techniques have been integrated at different levels into the training procedure of a continuous-density hidden Markov model (HMM) speech recognizer. These clustering techniques can be used in two ways. First acoustically similar states are tied together. It will help to reduce the number of parameters but also allow to train otherwise rarely seen states together with more robust ones (state-tying). Secondly densities are clustered across states, this reduces the number of densities while at the same time keeping the best performances of our recognizer (density-clustering). We have applied these techniques both to word-based small-vocabulary and phoneme-based large-vocabulary recognition tasks. On the WSJ task, we could achieve a reduction of the word error rate by 7%. On the TI/NIST-connected digit task, the number of parameters was reduced by a factor 2-3 while keeping the same string error rate
  • Keywords
    hidden Markov models; speech recognition; HMM; TI/NIST-connected digit task; WSJ task; acoustically similar states; clustering techniques; continuous-density hidden Markov model; continuous-speech recognition; density-clustering; mixture density modelling; phoneme-based large-vocabulary recognition; state-tying; string error rate; training procedure; word error rate; word-based small-vocabulary recognition; Acoustic testing; Benchmark testing; Cognition; Electronic mail; Error analysis; Hidden Markov models; Laboratories; NIST; Robustness; Speech recognition; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479644
  • Filename
    479644