Title :
Combining eigenvoice speaker modeling and VTS-based environment compensation for robust speech recognition
Author :
Ou, Zhijian ; Deng, Kan
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Abstract :
Eigenvoice and vector Taylor series (VTS) are good models for speaker differences and environmental variations separately. However, speaker and environmental variation always coexist in real-world speech. In this paper, we propose to combine eigenvoice and VTS. Specifically, we introduce eigenvoice speaker modeling for the clean speech into VTS´s nonlinear mismatch function. In contrast, the standard VTS uses speaker-independent modeling to represent the clean speech, regardless of speaker differences. The eigenvoice coefficients and the noise model parameters are jointly estimated in the new approach. Experimental results on the Aurora2 task show the improved performances of combining eigenvoice and VTS and demonstrate its ability for speaker and noise factorization.
Keywords :
speaker recognition; Aurora2 task; VTS-based environment compensation; clean speech; eigenvoice coefficients; eigenvoice speaker modeling; environmental variation; noise factorization; noise model parameters; robust speech recognition; speaker factorization; speaker-independent modeling; vector Taylor series; Accuracy; Adaptation models; Hidden Markov models; Noise; Noise measurement; Speech; Speech recognition; eigenvoice; robust speech recognition; speaker adaptation; vector Taylor series;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288961