DocumentCode
1896481
Title
A recurrent time-delay neural network for improved phoneme recognition
Author
Greco, Fabio ; Paoloni, Andrea ; Ravaioli, G.
Author_Institution
Fondazione Ugo Bordoni, Roma, Italy
fYear
1991
fDate
14-17 Apr 1991
Firstpage
81
Abstract
The authors propose a modification to the structure of the time-delay neural network (TDNN), obtained through feedback at the first-hidden layer level. The experiment carried out with the new model, called RTDNN (recurrent TDNN), consists of the classification of the unvoiced plosive phonemes. These were extracted from an initial and intermediate position in a list of the most common Italian words, uttered by a male speaker, thus obtaining 250 tokens per phoneme. The training was carried out through a modified variant of back propagation, known as BPS (back propagation for sequences), using half of the tokens for learning and the remaining for the test. The error rate trend thus obtained shows a 27% decrease in a particular range of the magnitude of feedback, with values ranging from 5% for the original TDNN model with no feedback to 3.6% for the proposed RTDNN model
Keywords
delays; neural nets; speech recognition; RTDNN; back propagation; error rate; feedback; first-hidden layer level; male speaker; most common Italian words; phoneme recognition; recurrent time-delay neural network; speech recognition; unvoiced plosive phonemes; Data mining; Error analysis; Neural networks; Neurofeedback; Pattern classification; Recurrent neural networks; Shape; Signal analysis; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location
Toronto, Ont.
ISSN
1520-6149
Print_ISBN
0-7803-0003-3
Type
conf
DOI
10.1109/ICASSP.1991.150283
Filename
150283
Link To Document