Title :
Decision tree state tying based on segmental clustering for acoustic modeling
Author :
Reichl, Wolfgang ; Chou, W.
Author_Institution :
Bell Labs., Murray Hill, NJ, USA
Abstract :
A fast segmental clustering approach to decision tree tying based acoustic modeling is proposed for large vocabulary speech recognition. It is based on a two level clustering scheme for robust decision tree state clustering. This approach extends the conventional segmental K-means approach to phonetic decision tree state tying based acoustic modeling. It achieves high recognition performances while reducing the model training time from days to hours comparing to the approaches based on Baum-Welch training. Experimental results on standard Resource Management and Wall Street Journal tasks are presented which demonstrate the robustness and efficacy of this approach
Keywords :
Gaussian distribution; acoustic signal processing; pattern classification; speech recognition; trees (mathematics); Baum-Welch training; Gaussian distribution; Resource Management task; Wall Street Journal task; acoustic modeling; experimental results; large vocabulary speech recognition; model training time reduction; one pass decoding; phonetic decision tree state tying; recognition performance; robust decision tree state clustering; segmental K-means approach; segmental clustering; two level clustering scheme; Context modeling; Decision trees; Gaussian distribution; Hidden Markov models; Management training; Parameter estimation; Robustness; Speech recognition; Training data; Vocabulary;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675386