A recurrent time-delay neural network for improved phoneme recognition

Author

Greco, Fabio ; Paoloni, Andrea ; Ravaioli, G.

Author_Institution

Fondazione Ugo Bordoni, Roma, Italy

fYear

1991

fDate

14-17 Apr 1991

Firstpage

81

Abstract

The authors propose a modification to the structure of the time-delay neural network (TDNN), obtained through feedback at the first-hidden layer level. The experiment carried out with the new model, called RTDNN (recurrent TDNN), consists of the classification of the unvoiced plosive phonemes. These were extracted from an initial and intermediate position in a list of the most common Italian words, uttered by a male speaker, thus obtaining 250 tokens per phoneme. The training was carried out through a modified variant of back propagation, known as BPS (back propagation for sequences), using half of the tokens for learning and the remaining for the test. The error rate trend thus obtained shows a 27% decrease in a particular range of the magnitude of feedback, with values ranging from 5% for the original TDNN model with no feedback to 3.6% for the proposed RTDNN model

Keywords

delays; neural nets; speech recognition; RTDNN; back propagation; error rate; feedback; first-hidden layer level; male speaker; most common Italian words; phoneme recognition; recurrent time-delay neural network; speech recognition; unvoiced plosive phonemes; Data mining; Error analysis; Neural networks; Neurofeedback; Pattern classification; Recurrent neural networks; Shape; Signal analysis; Speech recognition; Testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on

Conference_Location

Toronto, Ont.

ISSN

1520-6149

Print_ISBN

0-7803-0003-3

Type

conf

DOI

10.1109/ICASSP.1991.150283

Filename

150283