Automatic word recognition in cars

Author

Mokbel, Chafic E. ; Chollet, Gérard F A

Author_Institution

Telecom-Paris, CNRS, Paris, France

Volume

3

Issue

5

fYear

1995

fDate

9/1/1995 12:00:00 AM

Firstpage

346

Lastpage

356

Abstract

The paper compares, on a database recorded in a car, a number of signal analysis and speech enhancement techniques as well as some approaches to adapt speech recognition systems. It is shown that a new nonlinear spectral subtraction associated with Mel frequency cepstral coefficients (MFCC) is an adequate compromise for low-cost integration. The Lombard effect is analyzed and simulated. Such a simulation is used to derive realistic training utterances from noise-free utterances. Adapting a continuous-density hidden Markov model (CDHMM) to these artificially generated training samples yields a very high performance with respect to that achieved within the ESPRIT adverse environment recognition of speech (ARS) project, i.e., an average of 1% error for all driving conditions. Finally, the paper shows, both theoretically and experimentally, that whatever the noise estimation technique is, it is better to add this noise estimate to the reference clean models than to subtract it from the noisy data

Keywords

acoustic noise; cepstral analysis; hidden Markov models; speech enhancement; speech recognition; Lombard effect; Mel frequency cepstral coefficients; automatic word recognition; cars; continuous-density hidden Markov model; low-cost integration; noise estimation technique; noise-free utterances; nonlinear spectral subtraction; realistic training utterances; reference clean models; signal analysis; speech enhancement; speech recognition systems; training samples; Acoustic noise; Cepstral analysis; Databases; Frequency; Hidden Markov models; Microphones; Noise robustness; Speech enhancement; Speech recognition; Working environment noise;

fLanguage

English

Journal_Title

Speech and Audio Processing, IEEE Transactions on

Publisher

ieee

ISSN

1063-6676

Type

jour

DOI

10.1109/89.466660

Filename

466660