Title :
A monolithic speech recognizer based on fully recurrent neural networks
Author :
Kasper, Klaus ; Reininger, Herbert ; Wolf, Dietrich ; Wüs, Harald
Author_Institution :
Inst. fur Angewandte Phys., Frankfurt Univ., Germany
Abstract :
Reports on investigations concerning the application of fully recurrent neural networks (FRNN) for speaker independent speech recognition. In a phoneme based recognition system separate FRNN are used for feature scoring as well as for compensating variations in time durations of speech segments. A recognizer with a FRNN for feature scoring achieves the same recognition rate as a recognition system where the context information is provided. The performance of the FRNN used for time alignment is comparable to that of a viterbi based alignment with durational constraints. Additionally, a monolithic speech recognizer is realized by FRNN which directly classifies feature sequences. The performance of this FRNN is comparable to that of speech recognition systems which are based on discrete hidden Markov models and use a sophisticated durational modeling. Furthermore, simulation experiments revealed that FRNN are able to extract relevant information for speech recognition from noise contaminated speech and thus achieve a robust recognition performance
Keywords :
hidden Markov models; recurrent neural nets; speech recognition; durational modeling; feature scoring; fully recurrent neural networks; monolithic speech recognizer; noise contaminated speech; phoneme based recognition system; recognition rate; speech segments; time alignment; time durations; Artificial neural networks; Data mining; Dynamic programming; Erbium; Hidden Markov models; Neurons; Recurrent neural networks; Speech enhancement; Speech recognition; Viterbi algorithm;
Conference_Titel :
Neural Networks for Signal Processing [1994] IV. Proceedings of the 1994 IEEE Workshop
Conference_Location :
Ermioni
Print_ISBN :
0-7803-2026-3
DOI :
10.1109/NNSP.1994.366033