DocumentCode :
284578
Title :
Speech recognition using stochastic segment neural networks
Author :
Leung, Hong C. ; Heherington, I.L. ; Zue, Victor W.
Author_Institution :
Lab. for Comput. Sci., MIT, Cambridge, MA, USA
Volume :
1
fYear :
1992
fDate :
23-26 Mar 1992
Firstpage :
613
Abstract :
The authors previously presented (1991) a stochastic explicit-segment modeling (SESM) approach to speech recognition. There were two major stochastic components: boundary and phonetic classifications. The authors extend their earlier framework and incorporate context-dependent techniques into the major stochastic components. The current implementation uses stochastic segment neural networks (SSNN) to deal with these two problems. The authors have experimented with SSNN on a task of recognizing 25 words (city names) recorded from actual customers over the telephone network. Comparisons show that context-dependent modeling can reduce the error rate quite substantially. Specifically, using context-dependent boundary and phonetic classifications, the authors achieved an error rate of 3.3% with no rejections or about 0.35% at a rejection rate of 15%
Keywords :
neural nets; speech recognition; SSNN; boundary classification; context-dependent modeling; error rate; phonetic classifications; speech recognition; stochastic segment neural networks; Context modeling; Contracts; Error analysis; Hidden Markov models; Neural networks; Speech recognition; Stochastic processes; Stochastic systems; Telephony; Viterbi algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
ISSN :
1520-6149
Print_ISBN :
0-7803-0532-9
Type :
conf
DOI :
10.1109/ICASSP.1992.225834
Filename :
225834
Link To Document :
بازگشت