مرکز منطقه ای اطلاع رساني علوم و فناوري - Combining eigenvoice speaker modeling and VTS-based environment compensation for robust speech recognition

DocumentCode :

3164930

Title :

Combining eigenvoice speaker modeling and VTS-based environment compensation for robust speech recognition

Author :

Ou, Zhijian ; Deng, Kan

Author_Institution :

Dept. of Electron. Eng., Tsinghua Univ., Beijing, China

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4673

Lastpage :

4676

Abstract :

Eigenvoice and vector Taylor series (VTS) are good models for speaker differences and environmental variations separately. However, speaker and environmental variation always coexist in real-world speech. In this paper, we propose to combine eigenvoice and VTS. Specifically, we introduce eigenvoice speaker modeling for the clean speech into VTS´s nonlinear mismatch function. In contrast, the standard VTS uses speaker-independent modeling to represent the clean speech, regardless of speaker differences. The eigenvoice coefficients and the noise model parameters are jointly estimated in the new approach. Experimental results on the Aurora2 task show the improved performances of combining eigenvoice and VTS and demonstrate its ability for speaker and noise factorization.

Keywords :

speaker recognition; Aurora2 task; VTS-based environment compensation; clean speech; eigenvoice coefficients; eigenvoice speaker modeling; environmental variation; noise factorization; noise model parameters; robust speech recognition; speaker factorization; speaker-independent modeling; vector Taylor series; Accuracy; Adaptation models; Hidden Markov models; Noise; Noise measurement; Speech; Speech recognition; eigenvoice; robust speech recognition; speaker adaptation; vector Taylor series;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288961

Filename :

6288961

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3164930