• DocumentCode
    294545
  • Title

    Rapid environment adaptation for robust speech recognition

  • Author

    Takagi, Keizaburo ; Hattori, Hiroak ; Watanabe, Takao

  • Author_Institution
    C&C Inf. Technol. Res. Labs., NEC Corp., Kawasaki, Japan
  • Volume
    1
  • fYear
    1995
  • fDate
    9-12 May 1995
  • Firstpage
    149
  • Abstract
    The paper proposes a rapid environment adaptation algorithm based on spectrum equalization (REALISE). In practical speech recognition applications, differences between training and testing environments often seriously diminish recognition accuracy. These environmental differences can be classified into two types: difference in additive noise and difference in multiplicative noise in the spectral domain. The proposed method calculates time-alignment between a testing utterance and the closest reference pattern to it, and then calculates the noise differences between the two according to the time-alignment. Then, the authors adapt all reference patterns to the testing environment using the differences. Finally, the testing utterance is recognized using the adapted reference patterns. In a 250 Japanese word recognition task, in which the training and testing microphones were of two different types, REALISE improved recognition accuracy from 87% to 96%
  • Keywords
    acoustic noise; adaptive equalisers; spectral analysis; spectral-domain analysis; speech recognition; REALISE; additive noise; closest reference pattern; multiplicative noise; rapid environment adaptation algorithm based on spectrum equalization; robust speech recognition; spectral domain; testing environments; testing utterance; time-alignment; training environment; Acoustic testing; Additive noise; Computational efficiency; Hidden Markov models; High performance computing; Iterative algorithms; Microphones; Pattern recognition; Robustness; Speech enhancement; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
  • Conference_Location
    Detroit, MI
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-2431-5
  • Type

    conf

  • DOI
    10.1109/ICASSP.1995.479386
  • Filename
    479386