Title :
Continuous speech recognition for the TIMIT database using neural networks
Author :
Fallside, F. ; Lucke, H. ; Marsland, T.P. ; O´Shea, P.J. ; Owen, M. St J ; Prager, R.W. ; Robinson, A.J. ; Russell, N.H.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
Four types of neural networks which have previously been established for speech recognition and tested on a small, seven-speaker, 100-sentence database are applied to the TIMIT database. The networks are a recurrent network phoneme recognizer, a modified Kanerva model morph recognizer, a compositional representation phoneme-to-word recognizer, and a modified Kanerva model morph-to-word recognizer. The major result is for the recurrent net, giving a phoneme recognition accuracy of 57% from the si and sx sentences. The Kanerva morph recognizer achieves 66.2% accuracy for a small subset of the sa and sx sentences. The results for the word recognizers are incomplete
Keywords :
neural nets; speech recognition; Kanerva morph recognizer; compositional representation phoneme-to-word recognizer; continuous speech recognition; modified Kanerva model morph recognizer; modified Kanerva model morph-to-word recognizer; neural networks; phoneme recognition accuracy; recurrent network phoneme recognizer; Data engineering; Databases; Hidden Markov models; Management training; Neural networks; Packaging; Recurrent neural networks; Speech recognition; Testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115745