DocumentCode
417168
Title
An investigation into front-end signal processing for speaker normalization
Author
Umesh, S. ; Sinha, Rohit ; Kumar, S.V.B.
Author_Institution
Dept. of Electr. Eng., Indian Inst. of Technol., Kanpur, India
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
Our investigation into the front-end signal processing for maximum likelihood based speaker normalization reveals that, in the linear scaling model, it is more appropriate (and evidently more correct) to assume that the spectral envelopes of any two speakers for the same sound are linearly scaled versions of one another, rather than assuming that the magnitudes of the whole spectra (including pitch harmonics) are scaled. The use of the proposed model and its implementation results in about 4% and 7% relative improvement for adults and children respectively on a digit recognition task.
Keywords
signal processing; spectral analysis; speech recognition; digit recognition; front-end signal processing; linear scaling model; maximum likelihood based speaker normalization; spectral envelopes; speech recognition; Appropriate technology; Frequency estimation; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Mel frequency cepstral coefficient; Power harmonic filters; Signal processing; Speech; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325993
Filename
1325993
Link To Document