DocumentCode :
417168
Title :
An investigation into front-end signal processing for speaker normalization
Author :
Umesh, S. ; Sinha, Rohit ; Kumar, S.V.B.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol., Kanpur, India
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
Our investigation into the front-end signal processing for maximum likelihood based speaker normalization reveals that, in the linear scaling model, it is more appropriate (and evidently more correct) to assume that the spectral envelopes of any two speakers for the same sound are linearly scaled versions of one another, rather than assuming that the magnitudes of the whole spectra (including pitch harmonics) are scaled. The use of the proposed model and its implementation results in about 4% and 7% relative improvement for adults and children respectively on a digit recognition task.
Keywords :
signal processing; spectral analysis; speech recognition; digit recognition; front-end signal processing; linear scaling model; maximum likelihood based speaker normalization; spectral envelopes; speech recognition; Appropriate technology; Frequency estimation; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Mel frequency cepstral coefficient; Power harmonic filters; Signal processing; Speech; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325993
Filename :
1325993
Link To Document :
بازگشت