Title :
Rapid Speaker Adaptation using Maximum Likelihood Neural Regression
Author :
Bahari, Mohamad Hasan ; Van hamme, Hugo
Author_Institution :
Dept. of Electr. Eng. (ESAT), Katholieke Univ. Leuven, Leuven, Belgium
Abstract :
In this paper, a new method called Maximum Likelihood Neural Regression (MLNR) is introduced for Rapid Speaker Adaptation (RSA). MLNR, which is conceptually simple, adapts the Gaussian means of a speaker independent (SI) model to the data of a new speaker by assuming a non-linear mapping from the SI Gaussian means to the adapted Gaussian means. It performs a non linear regression between maximum likelihood (ML) estimates of the means and the speaker independent means using General Regression Neural Networks (GRNN). Evaluation on the Wall Street Journal benchmark shows that the suggested scheme outperforms different conventional approaches.
Keywords :
maximum likelihood estimation; neural nets; regression analysis; speaker recognition; Gaussian means; Wall Street Journal benchmark; general regression neural networks; maximum likelihood estimation; maximum likelihood neural regression; nonlinear mapping; nonlinear regression; rapid speaker adaptation; speaker independent model; Adaptation models; Data models; Hidden Markov models; Kernel; Maximum likelihood estimation; Silicon; Training; General regression neural networks; maximum likelihood; non-linear speaker adaptation; rapid speaker adaptation;
Conference_Titel :
Multimedia and Expo (ICME), 2011 IEEE International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-61284-348-3
Electronic_ISBN :
1945-7871
DOI :
10.1109/ICME.2011.6012192