• DocumentCode
    394306
  • Title

    A method for compensation of Jacobian in speaker normalization

  • Author

    Sinha, Rohit ; Umesh, S.

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Technol., Kanpur, India
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    In the conventional maximum likelihood based speaker normalization approach, the optimal frequency warping factors are estimated by maximizing the likelihood of warped features in a grid search. The conventional method of likelihood computation for warped features does not account for the Jacobian of the transformation. This fact is pointed out by some researchers who have also shown that frequency warping is equivalent to the transformation in the cepstral domain. As an approximation, variance normalization of cepstral features is used before likelihood computation to account for the Jacobian. In this paper, we suggest an alternate method to avoid the Jacobian problem. Our preliminary investigation shows that our proposed method provides improvement in normalization performance compared to the conventional method of warping factor estimation for a digit recognition task.
  • Keywords
    compensation; maximum likelihood estimation; speech recognition; Jacobian; cepstral domain; digit recognition; grid search; likelihood computation; maximum likelihood; optimal frequency warping factors; speaker normalization; variance normalization; warped features; warping factor; Cepstral analysis; Frequency estimation; Hidden Markov models; Jacobian matrices; Maximum likelihood estimation; Performance evaluation; Testing; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198842
  • Filename
    1198842