• DocumentCode
    417168
  • Title

    An investigation into front-end signal processing for speaker normalization

  • Author

    Umesh, S. ; Sinha, Rohit ; Kumar, S.V.B.

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Technol., Kanpur, India
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Our investigation into the front-end signal processing for maximum likelihood based speaker normalization reveals that, in the linear scaling model, it is more appropriate (and evidently more correct) to assume that the spectral envelopes of any two speakers for the same sound are linearly scaled versions of one another, rather than assuming that the magnitudes of the whole spectra (including pitch harmonics) are scaled. The use of the proposed model and its implementation results in about 4% and 7% relative improvement for adults and children respectively on a digit recognition task.
  • Keywords
    signal processing; spectral analysis; speech recognition; digit recognition; front-end signal processing; linear scaling model; maximum likelihood based speaker normalization; spectral envelopes; speech recognition; Appropriate technology; Frequency estimation; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Mel frequency cepstral coefficient; Power harmonic filters; Signal processing; Speech; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1325993
  • Filename
    1325993