DocumentCode :
2984717
Title :
Resonant Bandwidth Estimation of Vowels using Clustered-Line Spectrum Modeling for Pressure Speech Waveforms
Author :
Yasojima, O. ; Takahashi, Y. ; Tohyama, M.
Author_Institution :
Global Inf. & Telecommun. Studies, Waseda Univ., Tokyo
fYear :
2006
fDate :
Aug. 2006
Firstpage :
589
Lastpage :
593
Abstract :
The estimation of resonant frequency bandwidths is a fundamental issue related to the quality of spoken vowels and vocal-tract acoustics. In this article, we discuss our analysis of bandwidths using clustered line-spectrum modeling (CLSM) of the pressure waveforms of vowels on a cycle-by-cycle basis with reference to Lx waveforms from an electrolaryngograph recorded at the same time as the speech signal. We used CLSM to decompose the waveforms into three dominant resonant (modal) oscillations with almost exponentially decaying envelopes. The modal (so-called formant) frequencies were observed in a wide frequency range from 100 (Hz) to over 4 (kHz). The modal bandwidths were estimated from the decaying constants of the modal oscillations and were wider than those reported in the literature under the closed glottis condition. The bandwidths increased for both male and female speakers as the formant frequencies became higher. The bandwidths for females, however, were wider with greater variances than those for males. We could effectively represent a cycle of a vowel record shorter than 10 (ms) by CLSM based on the least squares error criterion in the frequency domain. We thus confirmed that cycle-by-cycle analysis using CLSM is a practical approach to characterizing vowel sounds in terms of dominant frequencies using their modal bandwidths
Keywords :
frequency-domain analysis; mean square error methods; spectral analysis; speech processing; 100 Hz to 4 kHz; Lx waveforms; closed glottis condition; clustered-line spectrum modeling; dominant resonant modal oscillations; electrolaryngograph; exponentially decaying envelopes; formant frequencies; frequency domain; least squares error criterion; pressure speech waveforms; resonant bandwidth vowel estimation; resonant frequency bandwidths; spoken vowel quality; vocal-tract acoustics; waveform decomposition; Bandwidth; Frequency domain analysis; Frequency estimation; Information technology; Least squares methods; Resonance; Resonant frequency; Signal analysis; Signal processing; Speech analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology, 2006 IEEE International Symposium on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9753-3
Electronic_ISBN :
0-7803-9754-1
Type :
conf
DOI :
10.1109/ISSPIT.2006.270869
Filename :
4042311
Link To Document :
بازگشت