• DocumentCode
    1896481
  • Title

    A recurrent time-delay neural network for improved phoneme recognition

  • Author

    Greco, Fabio ; Paoloni, Andrea ; Ravaioli, G.

  • Author_Institution
    Fondazione Ugo Bordoni, Roma, Italy
  • fYear
    1991
  • fDate
    14-17 Apr 1991
  • Firstpage
    81
  • Abstract
    The authors propose a modification to the structure of the time-delay neural network (TDNN), obtained through feedback at the first-hidden layer level. The experiment carried out with the new model, called RTDNN (recurrent TDNN), consists of the classification of the unvoiced plosive phonemes. These were extracted from an initial and intermediate position in a list of the most common Italian words, uttered by a male speaker, thus obtaining 250 tokens per phoneme. The training was carried out through a modified variant of back propagation, known as BPS (back propagation for sequences), using half of the tokens for learning and the remaining for the test. The error rate trend thus obtained shows a 27% decrease in a particular range of the magnitude of feedback, with values ranging from 5% for the original TDNN model with no feedback to 3.6% for the proposed RTDNN model
  • Keywords
    delays; neural nets; speech recognition; RTDNN; back propagation; error rate; feedback; first-hidden layer level; male speaker; most common Italian words; phoneme recognition; recurrent time-delay neural network; speech recognition; unvoiced plosive phonemes; Data mining; Error analysis; Neural networks; Neurofeedback; Pattern classification; Recurrent neural networks; Shape; Signal analysis; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
  • Conference_Location
    Toronto, Ont.
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0003-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1991.150283
  • Filename
    150283