Title :
Robust speaker change detection
Author :
Ajmera, Jitendra ; McCowan, Iain ; Bourlard, Hervé
Author_Institution :
IDIAP, Martigny, Switzerland
Abstract :
Most commonly used criteria for speaker change detection like log likelihood ratio (LLR) and Bayesian information criterion (BIC) have an adjustable threshold/penalty parameter to make speaker change decisions. These parameters are not always robust to different acoustic conditions and have to be tuned. In this letter, we present a criterion which can be used to identify speaker changes in an audio stream without such tuning. The criterion consists of calculating the LLR of two models with the same number of parameters. Results on the Hub4 1997 evaluation set indicate that we achieve a performance comparable to using BIC with optimal penalty term.
Keywords :
audio systems; LLR; audio stream; log likelihood ratio calculation; robust speaker change detection; Acoustic signal detection; Bayesian methods; Data mining; Feature extraction; Loudspeakers; Maximum likelihood detection; Maximum likelihood estimation; Robustness; Streaming media; Testing; BIC; Bayesian Information Criterion; LLR; Log Likelihood Ratio; speaker change detection;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2004.831666