DocumentCode
417163
Title
A study of various composite kernels for kernel eigenvoice speaker adaptation
Author
Mak, Brian ; Kwok, James T. ; Ho, Simon
Author_Institution
Dept. of Comput. Sci., Hong Kong Univ. of Sci. & Technol., China
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
Eigenvoice-based methods have been shown to be effective for fast speaker adaptation when the amount of adaptation data is small, say, less than 10 seconds. In traditional eigenvoice (EV) speaker adaptation, linear principal component analysis (PCA) is used to derive the eigenvoices. Recently, we proposed that eigenvoices found by nonlinear kernel PCA could be more effective, and the eigenvoices thus derived were called kernel eigenvoices (KEV). One of our novelties is the use of composite kernel that makes it possible to compute state observation likelihoods via kernel functions. We investigate two different composite kernels: direct sum kernel and tensor product kernel for KEV adaptation. In an evaluation on the TIDIGITS task, it is found that KEV speaker adaptations using either form of composite kernel are equally effective, and they outperform a speaker-independent model and the adapted models from EV, MAP, or MLLR adaptation using 2.1s and 4.1s of speech. For example, with 2.1s of adaptation data, KEV adaptation outperforms the speaker-independent model by 27.5%, whereas EV, MAP, and MLLR adaptations are not effective at all.
Keywords
eigenvalues and eigenfunctions; principal component analysis; speech recognition; composite kernels; direct sum kernel; kernel eigenvoice speaker adaptation; linear principal component analysis; nonlinear kernel PCA; speech recognition; tensor product kernel; Bayesian methods; Computer science; Error analysis; Face recognition; Kernel; Maximum likelihood linear regression; Principal component analysis; Speech analysis; Speech recognition; Tensile stress;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325988
Filename
1325988
Link To Document