• DocumentCode
    427702
  • Title

    Improved perceptually inspired speech enhancement using a psychoacoustic model

  • Author

    Hu, Rongqiang ; Anderson, David V.

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    7-10 Nov. 2004
  • Firstpage
    440
  • Abstract
    A speech enhancement algorithm is described that uses a psychoacoustic model to estimate speech cues in the presence of low SNRs. The motivation of the proposed algorithm is to model the peripheral of human auditory system and derive a perceptually improved solution for noise suppression. In the model, a detector for speech saliency is derived to measure the degree of conspicuousness of "significant" speech signals with the presence of background noise. Three biological correlates are exploited, including a spectral saliency determining the speech cues by frequency sensitivity of human auditory system in cochlear, a phoneme saliency indicating the perception discrimination of phonemes and an audibility saliency illuminating psychoacoustic masking properties by the frequency-to-place transformation of basilar membrane. The detector generates frequency-based soft-decisions that are used in determining the presence of speech cues and controlling the parameters of speech enhancement to preserve interested components.
  • Keywords
    acoustic noise; ear; hearing; interference suppression; speech enhancement; speech intelligibility; background noise; basilar membrane; cochlear; frequency-based soft-decision; frequency-to-place transformation; human auditory system; noise suppression; perception discrimination; phoneme saliency; psychoacoustic model; speech enhancement algorithm; speech intelligibility; speech signals; Auditory system; Background noise; Biological system modeling; Detectors; Frequency; Humans; Noise measurement; Psychoacoustic models; Speech coding; Speech enhancement;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2004. Conference Record of the Thirty-Eighth Asilomar Conference on
  • Print_ISBN
    0-7803-8622-1
  • Type

    conf

  • DOI
    10.1109/ACSSC.2004.1399170
  • Filename
    1399170