DocumentCode :
310461
Title :
Combining ANNs to improve phone recognition
Author :
Mak, Brian
Author_Institution :
Center for Spoken Language Understanding, Oregon Graduate Inst. of Sci. & Technol., Portland, OR, USA
Volume :
4
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
3253
Abstract :
In applying neural networks to speech recognition, one often finds that slightly different training configurations lead to significantly different networks. Thus different training sessions using different setups will likely end up in “mixed” network configurations representing different solutions in different regions of the data space. This sensitivity to the initial weights assigned, the training parameters and the training data can be used to enhance performance, using a committee of neural networks. We study various ways to combine context-dependent (CD) and context-independent (CD) neural network phone estimators to improve phone recognition. As a result, we obtain 6.3% and 2.2% increase in accuracy in phone recognition using monophones and biphones respectively
Keywords :
backpropagation; neural nets; parameter estimation; speech processing; speech recognition; ANN; backpropagation; biphones; context dependent neural network phone estimator; context independent neural network phone estimator; data space regions; initial weights; mixed network configurations; monophones; neural networks committee; phone recognition; training configurations; training data; training parameters; Artificial neural networks; Computer networks; Interpolation; Natural languages; Neural networks; Speech recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.595487
Filename :
595487
Link To Document :
بازگشت