DocumentCode :
1748974
Title :
Multi-source neural networks based on fixed and multiple resolution analysis for speech recognition
Author :
Albesano, Dario ; Gemello, Roberto ; Mana, Franco ; Pegoraro, Paolo
Author_Institution :
LOQUENDO S.p.A., Torino, Italy
Volume :
4
fYear :
2001
fDate :
2001
Firstpage :
2964
Abstract :
This paper reports the results obtained by an automatic speech recognition system when MFCCs, J-RASTA perceptual linear prediction coefficients (J-Rasta PLP) and energies from a multi-resolution analysis (MRA) tree of filters are used as input features to a hybrid system consisting of a neural network (NN) which provides observation probabilities for a network of hidden Markov models. Furthermore, the paper compares the performance of the system when various combinations of these features are used showing a WER reduction of 20% with respect to the use of J-Rasta PLP coefficients, when J-Rasta PLP coefficients are combined with the energies computed at the output of the leaves of an MRA filter tree. Such a combination is practically feasible due to the use of a NN architecture designed to integrate multiple features, exploiting the NN capability of mixing several input parameters without any assumption about their stochastic independence. Recognition is performed on a very large test set including many speakers uttering proper names from different locations of the Italian public telephone network
Keywords :
hidden Markov models; linear predictive coding; neural nets; probability; speech recognition; wavelet transforms; Italian public telephone network; hidden Markov models; multiple-source neural networks; observation probability; perceptual linear prediction; speech recognition; wavelet packet; Automatic speech recognition; Computer architecture; Hidden Markov models; Neural networks; Nonlinear filters; Performance evaluation; Speech analysis; Stochastic processes; Telephony; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 2001. Proceedings. IJCNN '01. International Joint Conference on
Conference_Location :
Washington, DC
ISSN :
1098-7576
Print_ISBN :
0-7803-7044-9
Type :
conf
DOI :
10.1109/IJCNN.2001.938849
Filename :
938849
Link To Document :
بازگشت