• DocumentCode
    290357
  • Title

    Hybrid system combining expert-TDNNs and HMMs for continuous speech recognition

  • Author

    Devillers, Laurence ; Dugast, Christaan

  • Author_Institution
    Lab. d´´Informatique pour la Mecanique et les Sci. de l´´Ingenieur, CNRS, Orsay, France
  • Volume
    ii
  • fYear
    1994
  • fDate
    19-22 Apr 1994
  • Abstract
    Hybrid systems, using neural networks (NNs) and hidden Markov models (HMMs) are designed to take advantage of both methods; the pattern classification power of NNs and the temporal modelling structure of HMMs. This paper describes the use of expert sub-network modules of the type time delay neural network (TDNN) for phone recognition of continuous speech. The originality of the hybrid system developed is in combining the probabilities of the modular TDNN architecture with those of CDHMMs during the recognition phase. On three speakers of the DARPA RM speaker-dependent task, we show that these small TDNNs trained on phone ambiguities can improve word recognition performance of state-of-the-art CDHMMs. The TDNN implementation achieved a word error rate reduction of 15%. We discuss strategies for extending this approach from the DARPA RM speaker-dependent database to the larger DARPA RM speaker-independent database
  • Keywords
    delays; expert systems; hidden Markov models; learning (artificial intelligence); multilayer perceptrons; neural net architecture; pattern classification; probability; speech recognition; CDHMM; DARPA RM speaker-dependent task; DARPA RM speaker-independent database; HMM; TDNN architecture; continuous speech recognition; expert sub-network modules; expert-TDNN; hidden Markov models; hybrid system; pattern classification; phone recognition; probabilities; temporal modelling structure; time delay neural network; word error rate reduction; word recognition performance; Artificial neural networks; Databases; Error analysis; Hidden Markov models; Laboratories; Neural networks; Pattern classification; Power system modeling; Speech recognition; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
  • Conference_Location
    Adelaide, SA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-1775-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1994.389693
  • Filename
    389693