DocumentCode
2254128
Title
Frequency-warping in speech
Author
Umesh, S. ; Cohen, L. ; Marinovic, N. ; Nelson, D.
Author_Institution
Hunter Coll., City Univ. of New York, NY, USA
Volume
1
fYear
1996
fDate
3-6 Oct 1996
Firstpage
414
Abstract
We present results that indicate that the formant frequencies between different speakers scale differently at different frequencies. Based on our experiments on speech data, we then numerically compute a universal frequency-warping function, to make the scale-factor independent of frequency in the warped domain. The proposed warping function is found to be similar to the mel-scale, which has previously been derived from purely psycho-acoustic experiments. The motivation for the present experiments stems from our proposed use of scale-transform based cepstral coefficients (Umesh et al., 1996) as acoustic features, since they provide superior separability of vowels than mel-cepstral coefficients
Keywords
Fourier transforms; cepstral analysis; feature extraction; parameter estimation; speech recognition; acoustic features; cepstral coefficients; formant frequencies; mel-scale; parameter estimation; psycho-acoustic experiments; scale-factor; scale-transform; speakers; speech data; speech frequency-warping; speech recognition; universal frequency-warping function; vowel separability; Acoustic scattering; Cepstral analysis; Cepstrum; Databases; Educational institutions; Fourier transforms; Frequency estimation; Loudspeakers; Psychology; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location
Philadelphia, PA
Print_ISBN
0-7803-3555-4
Type
conf
DOI
10.1109/ICSLP.1996.607142
Filename
607142
Link To Document