DocumentCode :
3423383
Title :
Fast speaker adaptation using non-negative matrix factorization
Author :
Duchateau, Jacques ; Leroy, Tobias ; Demuynck, Kris ; Hamme, Hugo Van
Author_Institution :
ESAT, Katholieke Univ. Leuven, Leuven
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4269
Lastpage :
4272
Abstract :
This paper describes a new method for fast speaker adaptation in large vocabulary recognition systems. As in most HMM-based recognizers, the observation densities are modeled as a weighted sum of Gaussian densities. Instead of adapting the means of the Gaussian densities, which is typically done, the weights for the Gaussian densities in the states are adapted. By applying non-negative matrix factorization (NMF) in the proposed method, very fast adaptation was achieved. Experiments on the Wall Street Journal benchmark recognition task show relative improvements between 5% and 15%, while the adaptation converges within 0.2 seconds. Analysis of the latent speakers found by NMF learns that these latent speakers reflect the gender of the speaker most prominently, even when vocal tract length normalization is used, and that they reflect the speaker´s age more clearly than the speaker´s regional influences or dialect.
Keywords :
Gaussian processes; matrix decomposition; speaker recognition; Gaussian densities; Wall Street Journal benchmark recognition; fast speaker adaptation; nonnegative matrix factorization; vocabulary recognition; Adaptation model; Adaptive systems; Databases; Hidden Markov models; Loudspeakers; Matrix decomposition; Speech recognition; Training data; Vocabulary; Working environment noise; Speech recognition; adaptive systems; matrix decomposition; non-negative matrix factorization; speaker adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518598
Filename :
4518598
Link To Document :
بازگشت