DocumentCode
294628
Title
Application of clustering techniques to mixture density modelling for continuous-speech recognition
Author
Dugast, Christian ; Beyerlein, Peter ; Haeb-Umbach, Reinhold
Author_Institution
Philips Res. Lab., Aachen, Germany
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
524
Abstract
Clustering techniques have been integrated at different levels into the training procedure of a continuous-density hidden Markov model (HMM) speech recognizer. These clustering techniques can be used in two ways. First acoustically similar states are tied together. It will help to reduce the number of parameters but also allow to train otherwise rarely seen states together with more robust ones (state-tying). Secondly densities are clustered across states, this reduces the number of densities while at the same time keeping the best performances of our recognizer (density-clustering). We have applied these techniques both to word-based small-vocabulary and phoneme-based large-vocabulary recognition tasks. On the WSJ task, we could achieve a reduction of the word error rate by 7%. On the TI/NIST-connected digit task, the number of parameters was reduced by a factor 2-3 while keeping the same string error rate
Keywords
hidden Markov models; speech recognition; HMM; TI/NIST-connected digit task; WSJ task; acoustically similar states; clustering techniques; continuous-density hidden Markov model; continuous-speech recognition; density-clustering; mixture density modelling; phoneme-based large-vocabulary recognition; state-tying; string error rate; training procedure; word error rate; word-based small-vocabulary recognition; Acoustic testing; Benchmark testing; Cognition; Electronic mail; Error analysis; Hidden Markov models; Laboratories; NIST; Robustness; Speech recognition; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479644
Filename
479644
Link To Document