مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environment

DocumentCode :

3631364

Title :

Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environment

Author :

Hynek Boril;John H. L. Hansen

Author_Institution :

Center for Robust Speech Systems, Erik Jonsson School of Engineering & Computer Science, The University of Texas at Dallas, USA

fYear :

2009

Firstpage :

3937

Lastpage :

3940

Abstract :

When exposed to environmental noise, speakers adjust their speech production to maintain intelligible communication. This phenomenon, called Lombard effect (LE), is known to considerably impact the performance of automatic speech recognition (ASR) systems. In this study, novel frequency and cepstral domain equalizations that reduce the impact of LE on ASR are proposed. Short-time spectra of LE speech are transformed towards neutral ASR models in a maximum likelihood fashion. Dynamics of cepstral coefficients are normalized to a constant range using quantile estimations. The algorithms are incorporated in a recognizer employing a codebook of noisy acoustic models. In a recognition task on connected Czech digits presented in various levels of background car noise, the resulting system provides an absolute reduction in word error rate (WER) on 10 dB SNR data of 8.7% and 37.7% for female neutral and LE speech, and of 8.7% and 32.8% for male neutral and LE speech when compared to the baseline system employing perceptual linear prediction (PLP) coefficients and cepstral mean and variance normalization.

Keywords :

"Speech recognition","Working environment noise","Automatic speech recognition","Speech enhancement","Cepstral analysis","Acoustic noise","Maximum likelihood estimation","Noise reduction","Background noise","Noise level"

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

978-1-4244-2353-8

Electronic_ISBN :

2379-190X

Type :

conf

DOI :

10.1109/ICASSP.2009.4960489

Filename :

4960489

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3631364