• DocumentCode
    977725
  • Title

    Automatic word recognition in cars

  • Author

    Mokbel, Chafic E. ; Chollet, Gérard F A

  • Author_Institution
    Telecom-Paris, CNRS, Paris, France
  • Volume
    3
  • Issue
    5
  • fYear
    1995
  • fDate
    9/1/1995 12:00:00 AM
  • Firstpage
    346
  • Lastpage
    356
  • Abstract
    The paper compares, on a database recorded in a car, a number of signal analysis and speech enhancement techniques as well as some approaches to adapt speech recognition systems. It is shown that a new nonlinear spectral subtraction associated with Mel frequency cepstral coefficients (MFCC) is an adequate compromise for low-cost integration. The Lombard effect is analyzed and simulated. Such a simulation is used to derive realistic training utterances from noise-free utterances. Adapting a continuous-density hidden Markov model (CDHMM) to these artificially generated training samples yields a very high performance with respect to that achieved within the ESPRIT adverse environment recognition of speech (ARS) project, i.e., an average of 1% error for all driving conditions. Finally, the paper shows, both theoretically and experimentally, that whatever the noise estimation technique is, it is better to add this noise estimate to the reference clean models than to subtract it from the noisy data
  • Keywords
    acoustic noise; cepstral analysis; hidden Markov models; speech enhancement; speech recognition; Lombard effect; Mel frequency cepstral coefficients; automatic word recognition; cars; continuous-density hidden Markov model; low-cost integration; noise estimation technique; noise-free utterances; nonlinear spectral subtraction; realistic training utterances; reference clean models; signal analysis; speech enhancement; speech recognition systems; training samples; Acoustic noise; Cepstral analysis; Databases; Frequency; Hidden Markov models; Microphones; Noise robustness; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.466660
  • Filename
    466660