Title :
Fast Speaker Adaption Via Maximum Penalized Likelihood Kernel Regression
Author :
Tsang, Ivor W. ; Kwok, James T. ; Mak, Brian ; Zhang, Kai ; Pan, Jeffrey J.
Author_Institution :
Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., Kowloon
Abstract :
Maximum likelihood linear regression (MLLR) has been a popular speaker adaptation method for many years. In this paper, we investigate a generalization of MLLR using nonlinear regression. Specifically, kernel regression is applied with appropriate regularization to determine the transformation matrix in MLLR for fast speaker adaptation. The proposed method, called maximum penalized likelihood kernel regression adaptation (MPLKR), is computationally simple and the mean vectors of the speaker adapted acoustic model can be obtained analytically by simply solving a linear system. Since no nonlinear optimization is involved, the obtained solution is always guaranteed to be globally optimal. The new adaptation method was evaluated on the resource management task with 5s and 10s of adaptation speech. Results show that MPLKR outperforms the standard MLLR method
Keywords :
matrix algebra; maximum likelihood estimation; regression analysis; speech processing; maximum penalized likelihood kernel regression; nonlinear optimization; nonlinear regression; resource management; speaker adapted acoustic model; speaker adaption; transformation matrix; Computer science; Hidden Markov models; Kernel; Linear systems; Loudspeakers; Maximum likelihood estimation; Maximum likelihood linear regression; Principal component analysis; Speech recognition; Vectors;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660191