DocumentCode :
1096966
Title :
Compensation of Nuisance Factors for Speaker and Language Recognition
Author :
Castaldo, Fabio ; Colibro, Daniele ; Dalmasso, Emanuele ; Laface, Pietro ; Vair, Claudio
Author_Institution :
Politecnico di Torino, Turin
Volume :
15
Issue :
7
fYear :
2007
Firstpage :
1969
Lastpage :
1978
Abstract :
The variability of the channel and environment is one of the most important factors affecting the performance of text-independent speaker verification systems. The best techniques for channel compensation are model based. Most of them have been proposed for Gaussian mixture models, while in the feature domain blind channel compensation is usually performed. The aim of this work is to explore techniques that allow more accurate intersession compensation in the feature domain. Compensating the features rather than the models has the advantage that the transformed parameters can be used with models of a different nature and complexity and for different tasks. In this paper, we evaluate the effects of the compensation of the intersession variability obtained by means of the channel factors approach. In particular, we compare channel variability modeling in the usual Gaussian mixture model domain, and our proposed feature domain compensation technique. We show that the two approaches lead to similar results on the NIST 2005 Speaker Recognition Evaluation data with a reduced computation cost. We also report the results of a system, based on the intersession compensation technique in the feature space that was among the best participants in the NIST 2006 Speaker Recognition Evaluation. Moreover, we show how we obtained significant performance improvement in language recognition by estimating and compensating, in the feature domain, the distortions due to interspeaker variability within the same language.
Keywords :
Gaussian processes; speaker recognition; Gaussian mixture model domain; NIST 2005 speaker recognition evaluation data; channel compensation; channel factors approach; feature domain blind channel compensation; intersession compensation; interspeaker variability; language recognition; nuisance factor compensation; text-independent speaker verification systems; Acoustic distortion; Acoustic testing; Automatic speech recognition; Computational efficiency; Gaussian distribution; Loudspeakers; NIST; Natural languages; Speaker recognition; System testing; Factor analysis; feature compensation; language recognition; speaker recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2007.901823
Filename :
4291593
Link To Document :
بازگشت