DocumentCode :
3783763
Title :
Innovative approaches for large vocabulary name recognition
Author :
Yuqing Gao;B. Ramabhadran;J. Chen;H. Erdogan;M. Picheny
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
1
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
53
Abstract :
Automatic name dialing is a practical and interesting application of speech recognition on telephony systems. The IBM name recognition system is a large vocabulary, speaker independent system currently in use for reaching IBM employees in the United States. We present some innovative algorithms that improve name recognition accuracy. Unlike transcription tasks, such as the Switchboard task, recognition of names poses a variety of different problems. Several of these problems arise from the fact that foreign names are hard to pronounce for speakers who are not familiar with the names and that there are no standardized methods for pronouncing proper names. Noise robustness is another very important factor as these calls are typically made in noisy environments, such as from a car, cafeteria, airport, etc. and over different kinds of cellular and land-line telephone channels. We have performed a systematic analysis of the speech recognition errors and tackled the issues separately with techniques ranging from weighted speaker clustering, massive adaptation, rapid and unsupervised adaptation methods to pronunciation modeling methods. We find that the decoding accuracy can be improved significantly (28% relative) in this manner.
Keywords :
"Vocabulary","Speech recognition","Telephony","Working environment noise","Hidden Markov models","Speech analysis","Clustering algorithms","Loudspeakers","Speech coding","Testing"
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP ´01). 2001 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940765
Filename :
940765
Link To Document :
بازگشت