DocumentCode
353327
Title
Multi-source neural networks for speech recognition: a review of recent results
Author
Gemello, Roberto ; Albesano, Dario ; Mana, Franco ; Moisa, Loreta
Author_Institution
Centro Studi e Lab. Telecommun. SpA, Torino, Italy
Volume
5
fYear
2000
fDate
2000
Firstpage
265
Abstract
Different parameterizations of the speech signal may potentially extract complementary information useful to increase the accuracy in discriminating between confusable sound classes. In spite of this a single parameterization has nearly universally been used in speech recognition because the most diffused matching technology (hidden Markov models) is bound by theoretical and practical constraints that limit the use of multiple features derived from the speech signal with different processing algorithms. On the contrary neural networks are capable of incorporating multiple heterogeneous input features, which do not need to be treated as independent, finding the optimal combination of these features for classification. The purpose of this work is the exploitation of this potentiality of neural networks to improve the speech recognition accuracy. The multiple input features coming from different parameterization algorithms are combined through a network architecture called multi-source NN, designed to obtain the best synergy from them. In this work, we report the last results obtained on this research line by combining the basic spectral features with two auditory inspired features, a formant like feature and the frequency derivatives. The results show that multi-source NN leads to significant error reductions on both isolated words and continuous speech test sets
Keywords
hidden Markov models; neural nets; speech recognition; auditory inspired features; classification; continuous speech; error reduction; formant like feature; frequency derivatives; hidden Markov models; multi-source neural networks; multiple heterogeneous input features; spectral features; speech recognition; speech signal parameterizations; Algorithm design and analysis; Constraint theory; Data mining; Frequency; Hidden Markov models; Neural networks; Signal processing; Speech processing; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 2000. IJCNN 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on
Conference_Location
Como
ISSN
1098-7576
Print_ISBN
0-7695-0619-4
Type
conf
DOI
10.1109/IJCNN.2000.861468
Filename
861468
Link To Document