DocumentCode :
3164930
Title :
Combining eigenvoice speaker modeling and VTS-based environment compensation for robust speech recognition
Author :
Ou, Zhijian ; Deng, Kan
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4673
Lastpage :
4676
Abstract :
Eigenvoice and vector Taylor series (VTS) are good models for speaker differences and environmental variations separately. However, speaker and environmental variation always coexist in real-world speech. In this paper, we propose to combine eigenvoice and VTS. Specifically, we introduce eigenvoice speaker modeling for the clean speech into VTS´s nonlinear mismatch function. In contrast, the standard VTS uses speaker-independent modeling to represent the clean speech, regardless of speaker differences. The eigenvoice coefficients and the noise model parameters are jointly estimated in the new approach. Experimental results on the Aurora2 task show the improved performances of combining eigenvoice and VTS and demonstrate its ability for speaker and noise factorization.
Keywords :
speaker recognition; Aurora2 task; VTS-based environment compensation; clean speech; eigenvoice coefficients; eigenvoice speaker modeling; environmental variation; noise factorization; noise model parameters; robust speech recognition; speaker factorization; speaker-independent modeling; vector Taylor series; Accuracy; Adaptation models; Hidden Markov models; Noise; Noise measurement; Speech; Speech recognition; eigenvoice; robust speech recognition; speaker adaptation; vector Taylor series;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288961
Filename :
6288961
Link To Document :
بازگشت