• DocumentCode
    2361313
  • Title

    A monolithic speech recognizer based on fully recurrent neural networks

  • Author

    Kasper, Klaus ; Reininger, Herbert ; Wolf, Dietrich ; Wüs, Harald

  • Author_Institution
    Inst. fur Angewandte Phys., Frankfurt Univ., Germany
  • fYear
    1994
  • fDate
    6-8 Sep 1994
  • Firstpage
    335
  • Lastpage
    344
  • Abstract
    Reports on investigations concerning the application of fully recurrent neural networks (FRNN) for speaker independent speech recognition. In a phoneme based recognition system separate FRNN are used for feature scoring as well as for compensating variations in time durations of speech segments. A recognizer with a FRNN for feature scoring achieves the same recognition rate as a recognition system where the context information is provided. The performance of the FRNN used for time alignment is comparable to that of a viterbi based alignment with durational constraints. Additionally, a monolithic speech recognizer is realized by FRNN which directly classifies feature sequences. The performance of this FRNN is comparable to that of speech recognition systems which are based on discrete hidden Markov models and use a sophisticated durational modeling. Furthermore, simulation experiments revealed that FRNN are able to extract relevant information for speech recognition from noise contaminated speech and thus achieve a robust recognition performance
  • Keywords
    hidden Markov models; recurrent neural nets; speech recognition; durational modeling; feature scoring; fully recurrent neural networks; monolithic speech recognizer; noise contaminated speech; phoneme based recognition system; recognition rate; speech segments; time alignment; time durations; Artificial neural networks; Data mining; Dynamic programming; Erbium; Hidden Markov models; Neurons; Recurrent neural networks; Speech enhancement; Speech recognition; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks for Signal Processing [1994] IV. Proceedings of the 1994 IEEE Workshop
  • Conference_Location
    Ermioni
  • Print_ISBN
    0-7803-2026-3
  • Type

    conf

  • DOI
    10.1109/NNSP.1994.366033
  • Filename
    366033