Title :
An investigation into front-end signal processing for speaker normalization
Author :
Umesh, S. ; Sinha, Rohit ; Kumar, S.V.B.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol., Kanpur, India
Abstract :
Our investigation into the front-end signal processing for maximum likelihood based speaker normalization reveals that, in the linear scaling model, it is more appropriate (and evidently more correct) to assume that the spectral envelopes of any two speakers for the same sound are linearly scaled versions of one another, rather than assuming that the magnitudes of the whole spectra (including pitch harmonics) are scaled. The use of the proposed model and its implementation results in about 4% and 7% relative improvement for adults and children respectively on a digit recognition task.
Keywords :
signal processing; spectral analysis; speech recognition; digit recognition; front-end signal processing; linear scaling model; maximum likelihood based speaker normalization; spectral envelopes; speech recognition; Appropriate technology; Frequency estimation; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Mel frequency cepstral coefficient; Power harmonic filters; Signal processing; Speech; Testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1325993