DocumentCode
1617191
Title
Hybrid speech recognition system with discriminative training applied for Romanian language
Author
Gavat, Inge ; Zirra, Matei ; Cula, Oana
Author_Institution
Polytech.. Univ. of Bucharest, Romania
Volume
1
fYear
1998
Firstpage
11
Abstract
This paper describes a hybrid connectionist-statistical system consisting of a neural network integrated in a hidden Markov model (HMM). The neural network used is the multilayer perceptron (MLP) and that network is the mechanism that computes the a-posteriori probability of a sequence of HMMs states. The classifier is based on total scores computed by Viterbi alignment for each hybrid model corresponding to the words in the vocabulary. Because of the lack of discrimination between the models and the unintended discrimination between the states in each model, we propose a solution that improves the system, namely an additional training task based on a cost function that approximates the misclassification rate of the hybrid system. The optimization criterion is based on a descent algorithm and the result is a minimum classification error. Our experiments on a 35 word vocabulary, show an improvement of the recognition rate from 92.4% for the case of a statistical system based only on HMMs, to 94.7% for the case of a hybrid HMM-MLP system, and to 97.9% for the case of an improved hybrid system with an extra discriminative training
Keywords
hidden Markov models; learning (artificial intelligence); multilayer perceptrons; natural languages; probability; signal classification; speech recognition; statistical analysis; Romanian language; Viterbi alignment; a-posteriori probability; cost function; descent algorithm; discriminative training; experiments; hidden Markov model; hybrid HMM-MLP system; hybrid connectionist-statistical system; hybrid speech recognition system; minimum classification error; misclassification rate approximation; multilayer perceptron; recognition rate; speech processing; training task; vocabulary; Artificial neural networks; Backpropagation algorithms; Hidden Markov models; Natural languages; Neural networks; Power system modeling; Speech processing; Speech recognition; Viterbi algorithm; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrotechnical Conference, 1998. MELECON 98., 9th Mediterranean
Conference_Location
Tel-Aviv
Print_ISBN
0-7803-3879-0
Type
conf
DOI
10.1109/MELCON.1998.692163
Filename
692163
Link To Document