DocumentCode :
2800889
Title :
On noise estimation for robust speech recognition using vector Taylor series
Author :
Zhao, Yong ; Juang, Biing-Hwang Fred
Author_Institution :
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
4290
Lastpage :
4293
Abstract :
In this paper, we propose a novel noise variance estimation method using the fixed point method for the VTS-based robust speech recognition. Noise parameters are re-estimated over a given utterance using an EM algorithm. The derivative of the auxiliary function with respect to the noise variance is resolved, and the fixed point algorithm estimates the noise variance by recursively approximating the root of the resulting derivative. The method leads to a re-estimation formula with a flavor like the standard ML variance estimation, and the iteration procedure is step-size free. We also investigate improving the noise estimation for efficient VTS adaptation. Several fast noise estimation methods are examined including estimation from non-speech areas and incremental adaptation. In the evaluation over Aurora 2 database, the proposed noise variance estimation method obtains a significant improvement in recognition accuracy over the method using sample variance. Further experiments show that the VTS ML estimation over non-speech areas is an effective fast adaptation method. The final refined approach achieves 8.75% WER, 13% relative improvement over the conventional VTS adaptation.
Keywords :
expectation-maximisation algorithm; series (mathematics); speech recognition; VTS-based robust speech recognition; auxiliary function; expectation-maximization algorithm; fixed point method; noise variance estimation method; vector Taylor series; Acoustic noise; Additive noise; Gaussian noise; Maximum likelihood estimation; Noise robustness; Speech enhancement; Speech recognition; Taylor series; Vectors; Working environment noise; Robust speech recognition; noise estimation; vector Taylor series;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5495669
Filename :
5495669
Link To Document :
بازگشت