DocumentCode
3327933
Title
Importance of window shape for phase-only reconstruction of speech
Author
Alsteris, Leigh D. ; Paliwal, Kuldip K.
Author_Institution
Sch. of Microelectron. Eng., Griffith Univ., Brisbane, Qld., Australia
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The authors recently conducted a human perception experiment to measure the intelligibility of speech stimuli synthesised either from short-time magnitude spectra or short-time phase spectra. The results of the experiment indicate that even for small window durations (of relevance for automatic speech recognition applications), the phase spectrum can contribute to speech intelligibility as much as the magnitude spectrum if the analysis-modification-synthesis parameters are properly selected. This intelligibility is significantly more than that reported by Liu et al. (1997), who carried out a similar experiment with the same analysis-modification-synthesis framework. The significant improvement in intelligibility over Liu´s results may be attributed to the differences in the parameter settings adopted. In this paper, we review our previous experiment and conduct an additional experiment to determine the contribution that each parameter setting provides towards the intelligibility of stimuli reconstructed from short-time phase spectra. The parameter selection that contributes most to the intelligibility of the phase-only stimuli is that of a rectangular analysis window, as opposed to a Hamming window (which is generally used in speech analysis).
Keywords
signal reconstruction; spectral analysis; speech intelligibility; speech processing; speech recognition; transforms; Hamming window; automatic speech recognition; parameter setting; phase-only speech reconstruction; rectangular analysis window; short-time phase spectra; speech analysis; speech intelligibility; speech stimuli synthesis; window shape; Automatic speech recognition; Fourier transforms; Humans; Microelectronics; Phase measurement; Signal analysis; Signal processing; Speech analysis; Speech processing; Speech synthesis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326050
Filename
1326050
Link To Document