• DocumentCode
    2254128
  • Title

    Frequency-warping in speech

  • Author

    Umesh, S. ; Cohen, L. ; Marinovic, N. ; Nelson, D.

  • Author_Institution
    Hunter Coll., City Univ. of New York, NY, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    3-6 Oct 1996
  • Firstpage
    414
  • Abstract
    We present results that indicate that the formant frequencies between different speakers scale differently at different frequencies. Based on our experiments on speech data, we then numerically compute a universal frequency-warping function, to make the scale-factor independent of frequency in the warped domain. The proposed warping function is found to be similar to the mel-scale, which has previously been derived from purely psycho-acoustic experiments. The motivation for the present experiments stems from our proposed use of scale-transform based cepstral coefficients (Umesh et al., 1996) as acoustic features, since they provide superior separability of vowels than mel-cepstral coefficients
  • Keywords
    Fourier transforms; cepstral analysis; feature extraction; parameter estimation; speech recognition; acoustic features; cepstral coefficients; formant frequencies; mel-scale; parameter estimation; psycho-acoustic experiments; scale-factor; scale-transform; speakers; speech data; speech frequency-warping; speech recognition; universal frequency-warping function; vowel separability; Acoustic scattering; Cepstral analysis; Cepstrum; Databases; Educational institutions; Fourier transforms; Frequency estimation; Loudspeakers; Psychology; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
  • Conference_Location
    Philadelphia, PA
  • Print_ISBN
    0-7803-3555-4
  • Type

    conf

  • DOI
    10.1109/ICSLP.1996.607142
  • Filename
    607142