• DocumentCode
    2018208
  • Title

    The psychoacoustic approach towards enhancing speech intelligibility in noise

  • Author

    Chan, Paul Yaozhu ; Dong, Minghui ; Cen, Ling ; Li, Haizhou

  • Author_Institution
    Dept. of Human Language Technol., Agency for Sci. Technol. & Res. (A*STAR), Singapore, Singapore
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    238
  • Lastpage
    241
  • Abstract
    In this paper, we propose a psychoacoustic approach towards enhancing speech intelligibility in noise. Understanding the relationship between the short-term spectral movement of a sound and a listener´s sensitivity towards it, we conjecture that humans rely greatly on Inter-Phoneme Spectral Gradients (IPSGs) to distinguish each phoneme, especially when the short-term speech spectrum is masked by extremely high levels of noise. We then move on to explain how the IPSG may most effectively be steepened while introducing the concept of Formant Contrast. The effectiveness of this process is validated with spectral analysis and listening tests, verifying that our initial deduction is true. In these, we present a simple, yet novel and effective method of improving speech intelligibility - especially in extremely high noise environments.
  • Keywords
    noise; spectral analysis; speech intelligibility; speech synthesis; formant contrast; interphoneme spectral gradient; listener sensitivity; noise susceptibility; psychoacoustic approach; short term spectral movement; spectral analysis; speech intelligibility; speech synthesis; Humans; Real time systems; Signal to noise ratio; Spectrogram; Speech; Speech enhancement; formant contrast; noise susceptibility; noise tolerance; spectral gradient; speech intelligibility; speech synthesis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684902
  • Filename
    5684902