• DocumentCode
    780237
  • Title

    Noisy Speech Enhancement Using Harmonic-Noise Model and Codebook-Based Post-Processing

  • Author

    Zavarehei, Esfandiar ; Vaseghi, Saeed ; Yan, Qin

  • Author_Institution
    DSP Audio Centre, Cambridge Silicon Radio
  • Volume
    15
  • Issue
    4
  • fYear
    2007
  • fDate
    5/1/2007 12:00:00 AM
  • Firstpage
    1194
  • Lastpage
    1203
  • Abstract
    This paper presents a post-processing speech restoration module for enhancing the performance of conventional speech enhancement methods. The restoration module aims to retrieve parts of speech spectrum that may be lost to noise or suppressed when using conventional speech enhancement methods. The proposed restoration method utilizes a harmonic plus noise model (HNM) of speech to retrieve damaged speech structure. A modified HNM of speech is proposed where, instead of the conventional binary labeling of the signal in each subband as voiced or unvoiced, the concept of harmonicity is introduced which is more adaptable to the codebook mapping method used in the later stage of enhancement. To restore the lost or suppressed information, an HNM codebook mapping technique is proposed. The HNM codebook is trained on speaker-independent speech data. To reduce the sensitivity of the HNM codebook to speaker variability, a spectral energy normalization process is introduced. The proposed post-processing method is tested as an add-on module with several popular noise reduction methods. Evaluations of the performance gain obtained from the proposed post-processing are presented and compared to standard speech enhancement systems which show substantial improvement gains in perceptual quality
  • Keywords
    harmonic analysis; speech coding; speech enhancement; codebook mapping method; codebook-based post-processing; harmonic plus noise model; harmonic-noise model; noise reduction methods; noise suppression; noisy speech enhancement; speaker-independent speech data; spectral energy normalization process; speech restoration module; speech spectrum; Digital signal processing; Frequency estimation; Mobile communication; Noise generators; Noise level; Noise reduction; Performance gain; Signal restoration; Signal to noise ratio; Speech enhancement; Harmonic-noise model; noise reduction; speech enhancement; speech reconstruction; weighted codebook mapping;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2007.894516
  • Filename
    4156224