Title :
Frequency-warping in speech
Author :
Umesh, S. ; Cohen, L. ; Marinovic, N. ; Nelson, D.
Author_Institution :
Hunter Coll., City Univ. of New York, NY, USA
Abstract :
We present results that indicate that the formant frequencies between different speakers scale differently at different frequencies. Based on our experiments on speech data, we then numerically compute a universal frequency-warping function, to make the scale-factor independent of frequency in the warped domain. The proposed warping function is found to be similar to the mel-scale, which has previously been derived from purely psycho-acoustic experiments. The motivation for the present experiments stems from our proposed use of scale-transform based cepstral coefficients (Umesh et al., 1996) as acoustic features, since they provide superior separability of vowels than mel-cepstral coefficients
Keywords :
Fourier transforms; cepstral analysis; feature extraction; parameter estimation; speech recognition; acoustic features; cepstral coefficients; formant frequencies; mel-scale; parameter estimation; psycho-acoustic experiments; scale-factor; scale-transform; speakers; speech data; speech frequency-warping; speech recognition; universal frequency-warping function; vowel separability; Acoustic scattering; Cepstral analysis; Cepstrum; Databases; Educational institutions; Fourier transforms; Frequency estimation; Loudspeakers; Psychology; Speech recognition;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607142