مرکز منطقه ای اطلاع رساني علوم و فناوري - A NN/HMM hybrid for continuous speech recognition with a discriminant nonlinear feature extraction

DocumentCode :

323479

Title :

A NN/HMM hybrid for continuous speech recognition with a discriminant nonlinear feature extraction

Author :

Rigoll, Gerhard ; Willett, Daniel

Author_Institution :

Dept. of Comput. Sci., Gerhard-Mercator Univ., Duisburg, Germany

Volume :

fYear :

1998

fDate :

12-15 May 1998

Firstpage :

Abstract :

This paper deals with a hybrid NN/HMM architecture for continuous speech recognition. We present a novel approach to set up a neural linear or nonlinear feature transformation that is used as a preprocessor on top of the HMM system´s RBF-network to produce discriminative feature vectors that are well suited for being modeled by mixtures of Gaussian distributions. In order to omit the computational cost of discriminative training of a context-dependent system, we propose to train a discriminant neural feature transformation on a system of low complexity and reuse this transformation in the context-dependent system to output improved feature vectors. The resulting hybrid system is an extension of a state-of-the-art continuous HMM system, and in fact, it is the first hybrid system that really is capable of outperforming these standard systems with respect to the recognition accuracy, without the need for discriminative training of the entire system. In experiments carried out on the Resource Management 1000-word continuous speech recognition task we achieved a relative error reduction of about 10% with a recognition system that, even before, was among the best ever observed on this task

Keywords :

Gaussian distribution; feature extraction; feedforward neural nets; hidden Markov models; learning (artificial intelligence); multilayer perceptrons; speech recognition; Gaussian distributions; RBF-network; Resource Management database; context-dependent system; continuous speech recognition; discriminant neural feature transformation; discriminant nonlinear feature extraction; discriminative feature vectors; discriminative training; hybrid NN/HMM architecture; low complexity; multilayer perceptron; neural network; Computational efficiency; Computer architecture; Computer science; Continuous time systems; Feature extraction; Gaussian distribution; Hidden Markov models; Neural networks; Speech recognition; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on

Conference_Location :

Seattle, WA

ISSN :

1520-6149

Print_ISBN :

0-7803-4428-6

Type :

conf

DOI :

10.1109/ICASSP.1998.674354

Filename :

674354

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=323479