DocumentCode :
1041593
Title :
Robust speaker change detection
Author :
Ajmera, Jitendra ; McCowan, Iain ; Bourlard, Hervé
Author_Institution :
IDIAP, Martigny, Switzerland
Volume :
11
Issue :
8
fYear :
2004
Firstpage :
649
Lastpage :
651
Abstract :
Most commonly used criteria for speaker change detection like log likelihood ratio (LLR) and Bayesian information criterion (BIC) have an adjustable threshold/penalty parameter to make speaker change decisions. These parameters are not always robust to different acoustic conditions and have to be tuned. In this letter, we present a criterion which can be used to identify speaker changes in an audio stream without such tuning. The criterion consists of calculating the LLR of two models with the same number of parameters. Results on the Hub4 1997 evaluation set indicate that we achieve a performance comparable to using BIC with optimal penalty term.
Keywords :
audio systems; LLR; audio stream; log likelihood ratio calculation; robust speaker change detection; Acoustic signal detection; Bayesian methods; Data mining; Feature extraction; Loudspeakers; Maximum likelihood detection; Maximum likelihood estimation; Robustness; Streaming media; Testing; BIC; Bayesian Information Criterion; LLR; Log Likelihood Ratio; speaker change detection;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2004.831666
Filename :
1316876
Link To Document :
بازگشت