Title :
Improved HMM phone and triphone models for real-time ASR telephony applications
Author :
I. Zeljkovic;S. Narayanan
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Abstract :
The development of human-machine dialog applications for messaging and information retrieval over the telephone poses stringent requirements on the accuracy and speed of the automatic speech recognition (ASR) system. The authors describe strategies for improved acoustic-phone modeling directed towards increasing recognition accuracy while keeping the number of phone units low. Specifically, the paper considers: (1) the development of an improved set of head-tail context-dependent (CD) triphones; (2) a novel criterion for better selection of the number of states assigned to each phone unit based on the coefficient of variation measure of feature components in HMM-Gaussians. The performance of the models is evaluated using data that represent real telephony applications.
Keywords :
"Hidden Markov models","Automatic speech recognition","Telephony","Speech recognition","Databases","Context modeling","Tail","Frequency","Speech enhancement","Vectors"
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607799