Title :
Intra-session variability compensation and a hypothesis generation and selection strategy for speaker segmentation
Author :
Vaquero, Carlos ; Ortega, Alfonso ; Lleida, Eduardo
Author_Institution :
Commun. Technol. Group (GTC), Univ. of Zaragoza, Zaragoza, Spain
Abstract :
This paper addresses the problem of speaker segmentation in two-speaker telephone conversations, using an eigenvoice based factor analysis approach. We present a set of improvements in the speaker segmentation system. First, we study two methods to compensate for intra-session variability, that is the variability present in a speaker during a single session. Secondly we propose a method to generate segmentation hypotheses that combined with a given confidence measure, enables the selection of correct hypotheses improving the overall segmentation performance. The proposed improvements are evaluated on the NIST Speaker Recognition Evaluation 2008 summed channel test condition, obtaining 28% relative improvement in terms of speaker segmentation error.
Keywords :
speaker recognition; NIST speaker recognition evaluation; eigenvoice based factor analysis approach; hypothesis generation; intrasession variability compensation; selection strategy; speaker segmentation; two-speaker telephone conversations; Adaptation models; Covariance matrix; Diversity reception; Mel frequency cepstral coefficient; NIST; Robustness; Speaker recognition; Speaker segmentation; hypothesis generation and selection; intra-session variability;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947362