DocumentCode :
2703439
Title :
Importance of the Dynamic Range of an Analysis Windowfunction for Phase-Only and Magnitude-Only Reconstruction of Speech
Author :
Wojcicki, Kamil K. ; Paliwal, Kuldip K.
Author_Institution :
Signal Process. Lab., Griffith Univ., Brisbane, Qld., Australia
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
The short-time Fourier transform (STFT) of a speech signal has two components: the short-time magnitude spectrum and the short-time phase spectrum. It is traditionally believed that the short-time magnitude spectrum plays the dominant role for speech perception at small window durations (20-40 ms). However, recent perceptual studies have shown that the short-time phase spectrum can contribute as much to speech intelligibility as the short-time magnitude spectrum. It was observed that the use of the rectangular (non-tapered) analysis window for the computation of the short-time phase spectrum is more advantageous than the use of the Hamming (tapered) analysis window. This paper investigates the effect that the dynamic range of an analysis window has on the intelligibility of speech for phase-only and magnitude-only stimuli. For this purpose, the Chebyshev analysis window with adjustable equi-ripple side-lobes is employed. Two types of magnitude-only stimuli are investigated: random phase and zero phase. It is shown that the intelligibility of the magnitude-only stimuli constructed with zero phase is independent of the dynamic range of the analysis window, while the random phase stimuli are intelligible only for analysis windows with high dynamic range. This study also shows that for low dynamic range analysis windows, the short-time phase spectrum at small window durations (20-40 ms) contributes as much as to speech intelligibility as the short-time magnitude spectrum.
Keywords :
Fourier transforms; speech intelligibility; speech processing; Chebyshev analysis window; Hamming analysis window; dynamic range; equi-ripple side-lobes; magnitude-only reconstruction; phase-only reconstruction; random phase; short-time Fourier transform; speech intelligibility; speech perception; speech reconstruction; speech signal; window function; zero phase; Attenuation; Chebyshev approximation; Dynamic range; Fourier transforms; Laboratories; Phase estimation; Signal analysis; Signal processing; Speech analysis; Speech processing; Short-time magnitude spectrum; short-time phase spectrum; speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367016
Filename :
4218204
Link To Document :
بازگشت