DocumentCode
394310
Title
Speaker adaptation by hierarchical EigenVoice
Author
Onishi, Yoshijiumi ; Iso, Ken-ichi
Author_Institution
Multimedia Res. Labs., NEC Corp., Kawasaki, Japan
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
We propose a novel speaker adaptation method, hierarchical EigenVoice (HEV). This method extends the eigenvoice through clustering the Gaussian components of HMMs into a hierarchical tree structure. It enables one to autonomously control a number of adaptation parameters (model complexity) depending on the amount of adaptation utterances from a new speaker. The experimental results of Japanese large vocabulary continuous speech recognition confirmed the significant performance increase in all range of the adaptation utterance amounts compared with the conventional speaker adaptation methods.
Keywords
Gaussian processes; eigenvalues and eigenfunctions; hidden Markov models; pattern clustering; speech recognition; Gaussian components clustering; Japan; adaptation parameters; adaptation utterance; continuous density mixture Gaussian HMM; hidden Markov models; hierarchical EigenVoice; hierarchical tree structure; large vocabulary continuous speech recognition; model complexity; speaker adaptation; speaker adaptation method; Adaptation model; Databases; Hidden Markov models; Hybrid electric vehicles; Laboratories; Loudspeakers; Maximum likelihood linear regression; Principal component analysis; Speech recognition; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198846
Filename
1198846
Link To Document