Hybrid system combining expert-TDNNs and HMMs for continuous speech recognition

Author

Devillers, Laurence ; Dugast, Christaan

Author_Institution

Lab. d´´Informatique pour la Mecanique et les Sci. de l´´Ingenieur, CNRS, Orsay, France

Volume

ii

fYear

1994

fDate

19-22 Apr 1994

Abstract

Hybrid systems, using neural networks (NNs) and hidden Markov models (HMMs) are designed to take advantage of both methods; the pattern classification power of NNs and the temporal modelling structure of HMMs. This paper describes the use of expert sub-network modules of the type time delay neural network (TDNN) for phone recognition of continuous speech. The originality of the hybrid system developed is in combining the probabilities of the modular TDNN architecture with those of CDHMMs during the recognition phase. On three speakers of the DARPA RM speaker-dependent task, we show that these small TDNNs trained on phone ambiguities can improve word recognition performance of state-of-the-art CDHMMs. The TDNN implementation achieved a word error rate reduction of 15%. We discuss strategies for extending this approach from the DARPA RM speaker-dependent database to the larger DARPA RM speaker-independent database

Keywords

delays; expert systems; hidden Markov models; learning (artificial intelligence); multilayer perceptrons; neural net architecture; pattern classification; probability; speech recognition; CDHMM; DARPA RM speaker-dependent task; DARPA RM speaker-independent database; HMM; TDNN architecture; continuous speech recognition; expert sub-network modules; expert-TDNN; hidden Markov models; hybrid system; pattern classification; phone recognition; probabilities; temporal modelling structure; time delay neural network; word error rate reduction; word recognition performance; Artificial neural networks; Databases; Error analysis; Hidden Markov models; Laboratories; Neural networks; Pattern classification; Power system modeling; Speech recognition; System testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on

Conference_Location

Adelaide, SA

ISSN

1520-6149

Print_ISBN

0-7803-1775-0

Type

conf

DOI

10.1109/ICASSP.1994.389693

Filename

389693