• DocumentCode
    417303
  • Title

    Multi-environment models based linear normalization for speech recognition in car conditions

  • Author

    Buera, Luis ; Lleida, Eduardo ; Miguel, Antonio ; Ortega, Alfonso

  • Author_Institution
    Univ. of Zaragoza, Spain
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    A multi-environment adaptation technique, based on minimum mean squared error estimation, is proposed. MEMLIN (multi-environment models based linear normalization) consists of a feature adaptation using stereo data and several basic defined environments. The target of this algorithm is to learn the difference between clean and noisy feature vectors associated to a pair of Gaussians (one for a clean model, and the other for a noisy model), for each basic environment. This knowledge, the associated Gaussians, the conditional probability between clean and noisy Gaussians, and the environment are the data used to compensate the mismatch between clean and noisy vectors. This algorithm obtains important improvements regarding other techniques that look for similar targets. The experimental results with the SpeechDat Car database shows an average improvement of more than 68%, concerning the baseline, over 7 different defined environments.
  • Keywords
    Gaussian distribution; acoustic noise; adaptive signal processing; least mean squares methods; parameter estimation; random noise; speech recognition; SpeechDat Car database; clean feature vectors; feature adaptation; in-car conditions; minimum mean squared error estimation; multi-environment models based linear normalization; noisy feature vectors; probability; speech recognition; stereo data; Acoustic testing; Adaptation model; Databases; Error analysis; Gaussian distribution; Gaussian noise; Gaussian processes; Speech recognition; Vectors; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326160
  • Filename
    1326160