DocumentCode
2018208
Title
The psychoacoustic approach towards enhancing speech intelligibility in noise
Author
Chan, Paul Yaozhu ; Dong, Minghui ; Cen, Ling ; Li, Haizhou
Author_Institution
Dept. of Human Language Technol., Agency for Sci. Technol. & Res. (A*STAR), Singapore, Singapore
fYear
2010
fDate
Nov. 29 2010-Dec. 3 2010
Firstpage
238
Lastpage
241
Abstract
In this paper, we propose a psychoacoustic approach towards enhancing speech intelligibility in noise. Understanding the relationship between the short-term spectral movement of a sound and a listener´s sensitivity towards it, we conjecture that humans rely greatly on Inter-Phoneme Spectral Gradients (IPSGs) to distinguish each phoneme, especially when the short-term speech spectrum is masked by extremely high levels of noise. We then move on to explain how the IPSG may most effectively be steepened while introducing the concept of Formant Contrast. The effectiveness of this process is validated with spectral analysis and listening tests, verifying that our initial deduction is true. In these, we present a simple, yet novel and effective method of improving speech intelligibility - especially in extremely high noise environments.
Keywords
noise; spectral analysis; speech intelligibility; speech synthesis; formant contrast; interphoneme spectral gradient; listener sensitivity; noise susceptibility; psychoacoustic approach; short term spectral movement; spectral analysis; speech intelligibility; speech synthesis; Humans; Real time systems; Signal to noise ratio; Spectrogram; Speech; Speech enhancement; formant contrast; noise susceptibility; noise tolerance; spectral gradient; speech intelligibility; speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
Conference_Location
Tainan
Print_ISBN
978-1-4244-6244-5
Type
conf
DOI
10.1109/ISCSLP.2010.5684902
Filename
5684902
Link To Document