DocumentCode :
316807
Title :
Improving environmental robustness of speech recognition using neural networks
Author :
Sirigos, John ; Fakotakis, Nikos ; Kokkinakis, George
Author_Institution :
Wire Commun. Lab., Patras Univ., Greece
Volume :
2
fYear :
1997
fDate :
2-4 Jul 1997
Firstpage :
575
Abstract :
This paper presents a method for improving speech recognition in noisy environment by using neural networks. Two multilayer perceptrons (MLPs) are used. The first MLP minimises the difference between noisy and clean speech and the second one measures the degree of noise in the speech signal and adjusts the time interval between subsequent frames of the processed speech signal accordingly. If we use the technique presented in this paper as a pre-processing stage of a speech recognition system we can extend the application of the system to different environments without re-training it. We need only to train the preprocessing stage with a small portion of noisy data which is created by conducting part of the original clean speech database used for training the speech recognizer through the desired environment. There is no need for creating a new database in the desired working environment. Our method was tested on a vowel spotting system, and is trained with two well known databases: TIMIT and NTIMIT. The evaluation of the system through a vowel spotting process, shows a significant improvement of the recognition rate of the system
Keywords :
learning (artificial intelligence); multilayer perceptrons; noise; speech processing; speech recognition; NTIMIT; TIMIT; automatic speech recognition; clean speech; environmental robustness; multilayer perceptrons; neural networks; noisy data; noisy speech; preprocessing stage; processed speech signal; recognition rate; speech database; speech recognition system; speech recognizer training; speech signal; time interval; vowel spotting system; Databases; Multilayer perceptrons; Neural networks; Noise measurement; Noise robustness; Signal processing; Speech enhancement; Speech processing; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Signal Processing Proceedings, 1997. DSP 97., 1997 13th International Conference on
Conference_Location :
Santorini
Print_ISBN :
0-7803-4137-6
Type :
conf
DOI :
10.1109/ICDSP.1997.628414
Filename :
628414
Link To Document :
بازگشت