DocumentCode :
3200398
Title :
Rapid Speaker Adaptation using Maximum Likelihood Neural Regression
Author :
Bahari, Mohamad Hasan ; Van hamme, Hugo
Author_Institution :
Dept. of Electr. Eng. (ESAT), Katholieke Univ. Leuven, Leuven, Belgium
fYear :
2011
fDate :
11-15 July 2011
Firstpage :
1
Lastpage :
6
Abstract :
In this paper, a new method called Maximum Likelihood Neural Regression (MLNR) is introduced for Rapid Speaker Adaptation (RSA). MLNR, which is conceptually simple, adapts the Gaussian means of a speaker independent (SI) model to the data of a new speaker by assuming a non-linear mapping from the SI Gaussian means to the adapted Gaussian means. It performs a non linear regression between maximum likelihood (ML) estimates of the means and the speaker independent means using General Regression Neural Networks (GRNN). Evaluation on the Wall Street Journal benchmark shows that the suggested scheme outperforms different conventional approaches.
Keywords :
maximum likelihood estimation; neural nets; regression analysis; speaker recognition; Gaussian means; Wall Street Journal benchmark; general regression neural networks; maximum likelihood estimation; maximum likelihood neural regression; nonlinear mapping; nonlinear regression; rapid speaker adaptation; speaker independent model; Adaptation models; Data models; Hidden Markov models; Kernel; Maximum likelihood estimation; Silicon; Training; General regression neural networks; maximum likelihood; non-linear speaker adaptation; rapid speaker adaptation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia and Expo (ICME), 2011 IEEE International Conference on
Conference_Location :
Barcelona
ISSN :
1945-7871
Print_ISBN :
978-1-61284-348-3
Electronic_ISBN :
1945-7871
Type :
conf
DOI :
10.1109/ICME.2011.6012192
Filename :
6012192
Link To Document :
بازگشت