• DocumentCode
    3245028
  • Title

    High resolution signal reconstruction

  • Author

    Kristjansson, T. ; Hershey, John

  • fYear
    2003
  • fDate
    30 Nov.-3 Dec. 2003
  • Firstpage
    291
  • Lastpage
    296
  • Abstract
    We present a framework for speech enhancement and robust speech recognition that exploits the harmonic structure of speech. We achieve substantial gains in signal-to-noise ratio (SNR) of enhanced speech as well as considerable gains in accuracy of automatic speech recognition in very noisy conditions. The method exploits the harmonic structure of speech by employing a high frequency resolution speech model in the log-spectrum domain and reconstructs the signal from the estimated posteriors of the clean signal and the phases from the original noisy signal. We achieve a gain in SNR of 8.38 dB for enhancement of speech at 0 dB. We also present recognition results on the Aurora 2 data-set. At 0 dB SNR, we achieve a reduction of relative word error rate of 43.75% over the baseline, and 15.90% over the equivalent low-resolution algorithm.
  • Keywords
    error statistics; parameter estimation; signal reconstruction; signal resolution; speech enhancement; speech recognition; SNR; automatic speech recognition; clean signal posterior estimation; high resolution signal reconstruction; log-spectrum domain; signal-to-noise ratio; speech enhancement; word error rate; Automatic speech recognition; Frequency estimation; Phase estimation; Phase noise; Robustness; Signal reconstruction; Signal resolution; Signal to noise ratio; Speech enhancement; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7980-2
  • Type

    conf

  • DOI
    10.1109/ASRU.2003.1318456
  • Filename
    1318456