DocumentCode :
3644450
Title :
Improved HMM phone and triphone models for real-time ASR telephony applications
Author :
I. Zeljkovic;S. Narayanan
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Volume :
2
fYear :
1996
Firstpage :
1105
Abstract :
The development of human-machine dialog applications for messaging and information retrieval over the telephone poses stringent requirements on the accuracy and speed of the automatic speech recognition (ASR) system. The authors describe strategies for improved acoustic-phone modeling directed towards increasing recognition accuracy while keeping the number of phone units low. Specifically, the paper considers: (1) the development of an improved set of head-tail context-dependent (CD) triphones; (2) a novel criterion for better selection of the number of states assigned to each phone unit based on the coefficient of variation measure of feature components in HMM-Gaussians. The performance of the models is evaluated using data that represent real telephony applications.
Keywords :
"Hidden Markov models","Automatic speech recognition","Telephony","Speech recognition","Databases","Context modeling","Tail","Frequency","Speech enhancement","Vectors"
Publisher :
ieee
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Print_ISBN :
0-7803-3555-4
Type :
conf
DOI :
10.1109/ICSLP.1996.607799
Filename :
607799
Link To Document :
بازگشت