DocumentCode :
3327933
Title :
Importance of window shape for phase-only reconstruction of speech
Author :
Alsteris, Leigh D. ; Paliwal, Kuldip K.
Author_Institution :
Sch. of Microelectron. Eng., Griffith Univ., Brisbane, Qld., Australia
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
The authors recently conducted a human perception experiment to measure the intelligibility of speech stimuli synthesised either from short-time magnitude spectra or short-time phase spectra. The results of the experiment indicate that even for small window durations (of relevance for automatic speech recognition applications), the phase spectrum can contribute to speech intelligibility as much as the magnitude spectrum if the analysis-modification-synthesis parameters are properly selected. This intelligibility is significantly more than that reported by Liu et al. (1997), who carried out a similar experiment with the same analysis-modification-synthesis framework. The significant improvement in intelligibility over Liu´s results may be attributed to the differences in the parameter settings adopted. In this paper, we review our previous experiment and conduct an additional experiment to determine the contribution that each parameter setting provides towards the intelligibility of stimuli reconstructed from short-time phase spectra. The parameter selection that contributes most to the intelligibility of the phase-only stimuli is that of a rectangular analysis window, as opposed to a Hamming window (which is generally used in speech analysis).
Keywords :
signal reconstruction; spectral analysis; speech intelligibility; speech processing; speech recognition; transforms; Hamming window; automatic speech recognition; parameter setting; phase-only speech reconstruction; rectangular analysis window; short-time phase spectra; speech analysis; speech intelligibility; speech stimuli synthesis; window shape; Automatic speech recognition; Fourier transforms; Humans; Microelectronics; Phase measurement; Signal analysis; Signal processing; Speech analysis; Speech processing; Speech synthesis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326050
Filename :
1326050
Link To Document :
بازگشت