DocumentCode
3488661
Title
A vector statistical piecewise polynomial approximation algorithm for environment compensation in telephone LVCSR
Author
Han, Zhaobing ; Zhang, Shuwu ; Zhang, Huayun ; Xu, Bo
Author_Institution
Inst. of Autom., Acad. Sinica, Beijing, China
Volume
2
fYear
2003
fDate
6-10 April 2003
Abstract
A vector statistical piecewise polynomial (VPP) approximation algorithm is proposed for environment compensation in speech signals that are degraded by both additive and convolutive noise. By investigating the model of the telephone environment, we address a piecewise polynomial, namely two linear polynomials and a quadratic polynomial, to approximate the environment function precisely. The VPP is applied either to stationary noise, or to non-stationary noise. In the first case, batch EM is used in the log-spectral domain; in the second case, recursive EM with iterative stochastic approximation is developed in the cepstral domain. Both approaches are based on the minimum mean squared error (MMSE) sense. Experimental results are presented on the application of this approach in improving the performance of Mandarin large vocabulary continuous speech recognition (LVCSR) in background noise and different transmission channels (such as fixed telephone line and GSM). The method can reduce the average character error rate (CER) by about 18%.
Keywords
acoustic noise; error statistics; iterative methods; least mean squares methods; natural languages; optimisation; piecewise polynomial techniques; speech enhancement; speech recognition; statistical analysis; stochastic processes; telephony; MMSE; Mandarin large vocabulary continuous speech recognition; additive noise; batch EM; cepstral domain; character error rate; convolutive noise; environment compensation; iterative stochastic approximation; linear polynomials; log-spectral domain; minimum mean squared error; nonstationary noise; quadratic polynomial; recursive EM; speech signals; stationary noise; telephone LVCSR; vector statistical piecewise polynomial approximation; Additive noise; Approximation algorithms; Cepstral analysis; Degradation; Polynomials; Speech enhancement; Stochastic resonance; Telephony; Vectors; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1202308
Filename
1202308
Link To Document