• DocumentCode
    699936
  • Title

    A new feature analysis method for robust ASR in reverberant environments based on the harmonic structure of speech

  • Author

    Petrick, Rico ; Lohde, Kevin ; Lorenz, Mike ; Hoffmann, Ruediger

  • Author_Institution
    Lab. of Acoust. & Speech Commun., Dresden Univ. of Technol., Dresden, Germany
  • fYear
    2008
  • fDate
    25-29 Aug. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    This article proposes a new signal analysis method for automatic speech recognition designed to aim high robustness against distortions caused by room reverberation. The method is initially named Harmonicity based Feature Analysis (HFA) and implements the following three ideas: (i) reconstruction of a spectrum from the harmonic components (assumed to be undistorted) of a voiced speech spectrum. (ii) suppression of disturbing reverberation in unvoiced spectra coming from previous voiced sections. (iii) high frequency regions are not affected by HFA since they have negligible effect on the recognition rate. HFA works on the basis of fundamental frequency estimation and voiced/unvoiced decision. Evaluation results show significant improvement of the recognition performance over a wide range of reverberant conditions while using HFA in connection with reverberant training. Apart from good performance, advantages of HFA compared to state of the art dereverberation approaches are real time processing (no adaptation time) and robustness against changes of the room impulse response.
  • Keywords
    frequency estimation; reverberation; speech recognition; transient response; HFA; automatic speech recognition; frequency estimation; harmonic components; harmonic speech structure; harmonicity based feature analysis; reverberant conditions; reverberant environments; reverberant training; robust ASR; room impulse response; room reverberation; signal analysis; spectrum reconstruction; unvoiced spectra; voiced speech spectrum; voiced-unvoiced decision; Harmonic analysis; Indexes; Reverberation; Robustness; Speech; Speech recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2008 16th European
  • Conference_Location
    Lausanne
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7080468