• DocumentCode
    454754
  • Title

    Study Of Non-Linear Frequency Warping Functions For Speaker Normalization

  • Author

    Kumar, Bharath S V ; Umesh, S. ; Sinha, R.

  • Author_Institution
    Dept. of ECE, California Univ.
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    In this paper, we study non-linear frequency-warping functions that are commonly used in speaker normalization. This study is motivated by our recently proposed affine transformation model for speaker normalization which has provided improved recognition performance when compared to uniform scaling model. In this work, using formant data from Peterson & Barney and Hillenbrand vowel databases, we analyze the behavior of scale factor as a function of frequency. The empirical observation shows that while uniform scaling assumption may be valid at higher frequencies, there are significant deviations at low frequencies. We show that while our recently proposed model has behavior similar to the empirical result, the behavior of many of the commonly used non-linear models (including that of Eide-Gish, power law and bilinear transformation) differ significantly from the empirical result. This difference in behavior from the empirical observation may explain the limited improvement in recognition performance provided by these non-linear models when compared to conventional uniform-scaling model. We also show that our proposed model does better fitting to the formant data than these non-linear models. We, therefore, conclude that the affine-transformation model may be a more appropriate non-linear model for speaker normalization
  • Keywords
    affine transforms; speech recognition; Hillenbrand vowel databases; Peterson & Barney vowel databases; affine transformation model; formant data; nonlinear frequency warping functions; speaker normalization; uniform scaling model; Data analysis; Databases; Fitting; Frequency estimation; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660253
  • Filename
    1660253