• DocumentCode
    312131
  • Title

    Smoothed spectral subtraction for a frequency-weighted HMM in noisy speech recognition

  • Author

    Matsumoto, Hiroshi ; Naitoh, Noboru

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Shinshu Univ., Nagano, Japan
  • Volume
    2
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    905
  • Abstract
    The paper proposes improved methods of smoothed spectral subtraction to enhance the recognition performance of a frequency weighted HMM (HMM-FW) in very noisy environments. The conventional spectral subtraction tends to produce discontinuity in estimated power spectra. This distortion is undesirable for HMM-FW which uses group delay spectra as feature vectors. In order to remove this distortion, the paper proposes two frequency smoothing methods in log spectral domain: (1) a low pass filtering by DCT; and (2) a weighted minimum mean square error method (WMSE) which fits cosine series to an estimated log power spectrum. The results show that the smoothers are very effective under very noisy conditions, especially for the frequency weighted HMM. The WMSE method combined with HMM-FW achieves the highest recognition accuracies, for instance, improving recognition rate from 68% to 88% at -6 dB SNR of car noise
  • Keywords
    acoustic noise; hidden Markov models; least mean squares methods; speech enhancement; speech recognition; DCT; WMSE method; cosine series; estimated log power spectrum; estimated power spectra; feature vectors; frequency smoothing methods; frequency weighted HMM; group delay spectra; log spectral domain; low pass filtering; noisy conditions; noisy environments; noisy speech recognition; recognition accuracies; recognition performance; recognition rate; smoothed spectral subtraction; weighted minimum mean square error method; Delay; Discrete cosine transforms; Filtering; Frequency estimation; Hidden Markov models; Low pass filters; Mean square error methods; Signal to noise ratio; Smoothing methods; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607748
  • Filename
    607748