Title :
Application of clustering techniques to mixture density modelling for continuous-speech recognition
Author :
Dugast, Christian ; Beyerlein, Peter ; Haeb-Umbach, Reinhold
Author_Institution :
Philips Res. Lab., Aachen, Germany
Abstract :
Clustering techniques have been integrated at different levels into the training procedure of a continuous-density hidden Markov model (HMM) speech recognizer. These clustering techniques can be used in two ways. First acoustically similar states are tied together. It will help to reduce the number of parameters but also allow to train otherwise rarely seen states together with more robust ones (state-tying). Secondly densities are clustered across states, this reduces the number of densities while at the same time keeping the best performances of our recognizer (density-clustering). We have applied these techniques both to word-based small-vocabulary and phoneme-based large-vocabulary recognition tasks. On the WSJ task, we could achieve a reduction of the word error rate by 7%. On the TI/NIST-connected digit task, the number of parameters was reduced by a factor 2-3 while keeping the same string error rate
Keywords :
hidden Markov models; speech recognition; HMM; TI/NIST-connected digit task; WSJ task; acoustically similar states; clustering techniques; continuous-density hidden Markov model; continuous-speech recognition; density-clustering; mixture density modelling; phoneme-based large-vocabulary recognition; state-tying; string error rate; training procedure; word error rate; word-based small-vocabulary recognition; Acoustic testing; Benchmark testing; Cognition; Electronic mail; Error analysis; Hidden Markov models; Laboratories; NIST; Robustness; Speech recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479644