DocumentCode :
290124
Title :
IPA: improved phone modelling with recurrent neural networks
Author :
Robinson, Tony ; Hochberg, Mike ; Renals, Steve
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Volume :
i
fYear :
1994
fDate :
19-22 Apr 1994
Abstract :
This paper describes phone modelling improvements to the hybrid connectionist-hidden Markov model speech recognition system developed at Cambridge University. These improvements are applied to phone recognition from the TIMIT task and word recognition from the Wall Street Journal (WSJ) task. A recurrent net is used to map acoustic vectors to posterior probabilities of phone classes. The maximum likelihood phone or word string is then extracted using Markov models. The paper describes three improvements: connectionist model merging; explicit presentation of acoustic context; and improved duration modelling. The first is shown to provide a significant improvement in the TIMIT phone recognition rate and all three provide an improvement in the WSJ word recognition rate
Keywords :
hidden Markov models; maximum likelihood estimation; probability; recurrent neural nets; speech recognition; Cambridge University; Markov models; TIMIT task; Wall Street Journal; acoustic context presentation; acoustic vectors; connectionist model merging; duration modelling; hidden Markov model; maximum likelihood method; phone modelling; phone recognition rate; probabilities; recurrent neural networks; speech recognition system; word recognition rate; Acoustic applications; Context modeling; Hidden Markov models; Maximum likelihood decoding; Maximum likelihood estimation; Merging; Recurrent neural networks; Speech recognition; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
ISSN :
1520-6149
Print_ISBN :
0-7803-1775-0
Type :
conf
DOI :
10.1109/ICASSP.1994.389361
Filename :
389361
Link To Document :
بازگشت