DocumentCode :
3585063
Title :
Speaker diarization with plda i-vector scoring and unsupervised calibration
Author :
Sell, Gregory ; Garcia-Romero, Daniel
Author_Institution :
Human Language Technol. Center of Excellence, Johns Hopkins Univ., Baltimore, MD, USA
fYear :
2014
Firstpage :
413
Lastpage :
417
Abstract :
Speaker diarization via unsupervised i-vector clustering has gained popularity in recent years. In this approach, i-vectors are extracted from short clips of speech segmented from a larger multi-speaker conversation and organized into speaker clusters, typically according to their cosine score. In this paper, we propose a system that incorporates probabilistic linear discriminant analysis (PLDA) for i-vector scoring, a method already frequently utilized in speaker recognition tasks, and uses unsupervised calibration of the PLDA scores to determine the clustering stopping criterion. We also demonstrate that denser sampling in the i-vector space with overlapping temporal segments provides a gain in the diarization task. We test our system on the CALLHOME conversational telephone speech corpus, which includes multiple languages and a varying number of speakers, and we show that PLDA scoring outperforms the same system with cosine scoring, and that overlapping segments reduce diarization error rate (DER) as well.
Keywords :
calibration; pattern clustering; sampling methods; speaker recognition; CALLHOME conversational telephone speech corpus; DER; PLDA i-vector scoring; cosine score; denser sampling; diarization error rate; multispeaker conversation; probabilistic linear discriminant analysis; segmented speech; speaker clusters; speaker diarization; speaker recognition tasks; unsupervised calibration; unsupervised i-vector clustering; Calibration; Density estimation robust algorithm; Principal component analysis; Speaker recognition; Speech; Speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
Type :
conf
DOI :
10.1109/SLT.2014.7078610
Filename :
7078610
Link To Document :
بازگشت