DocumentCode :
2174215
Title :
Intra-session variability compensation and a hypothesis generation and selection strategy for speaker segmentation
Author :
Vaquero, Carlos ; Ortega, Alfonso ; Lleida, Eduardo
Author_Institution :
Commun. Technol. Group (GTC), Univ. of Zaragoza, Zaragoza, Spain
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
4532
Lastpage :
4535
Abstract :
This paper addresses the problem of speaker segmentation in two-speaker telephone conversations, using an eigenvoice based factor analysis approach. We present a set of improvements in the speaker segmentation system. First, we study two methods to compensate for intra-session variability, that is the variability present in a speaker during a single session. Secondly we propose a method to generate segmentation hypotheses that combined with a given confidence measure, enables the selection of correct hypotheses improving the overall segmentation performance. The proposed improvements are evaluated on the NIST Speaker Recognition Evaluation 2008 summed channel test condition, obtaining 28% relative improvement in terms of speaker segmentation error.
Keywords :
speaker recognition; NIST speaker recognition evaluation; eigenvoice based factor analysis approach; hypothesis generation; intrasession variability compensation; selection strategy; speaker segmentation; two-speaker telephone conversations; Adaptation models; Covariance matrix; Diversity reception; Mel frequency cepstral coefficient; NIST; Robustness; Speaker recognition; Speaker segmentation; hypothesis generation and selection; intra-session variability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947362
Filename :
5947362
Link To Document :
بازگشت