DocumentCode :
2221859
Title :
Unsupervised speaker change detection for broadcast news segmentation
Author :
Jorgensen, Kasper ; Molgaard, Lasse ; Hansen, Lars Kai
Author_Institution :
Inf. & Math. Modelling, Tech. Univ. of Denmark, Lyngby, Denmark
fYear :
2006
fDate :
4-8 Sept. 2006
Firstpage :
1
Lastpage :
5
Abstract :
This paper presents a speaker change detection system for broadcast news segmentation based on a vector quantization (VQ) approach. The system does not make any assumption about the number of speakers or speaker identity. The system uses mel frequency cepstral coefficients and change detection is done using the VQ distortion measure and is evaluated against two other statistics, namely the symmetric Kullback-Leibler (KL2) distance and the so-called `divergence shape distance´. First level alarms are further tested using the VQ distortion. We find that the false alarm rate can be reduced without significant losses in the detection of correct changes. We furthermore evaluate the generalizability of the approach by testing the complete system on an independent set of broadcasts, including a channel not present in the training set.
Keywords :
cepstral analysis; speaker recognition; statistical analysis; vector quantisation; VQ distortion measure; broadcast news segmentation; divergence shape distance; false alarm rate; first level alarms; mel frequency cepstral coefficients; speaker identity; symmetric Kullback-Leibler distance; unsupervised speaker change detection; vector quantization; Abstracts; Atmospheric modeling; Measurement; Robustness; Speech; Speech processing; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2006 14th European
Conference_Location :
Florence
ISSN :
2219-5491
Type :
conf
Filename :
7071485
Link To Document :
بازگشت