Title :
Novel strategies for reducing the false alarm rate in a speaker segmentation system
Author :
Lopez-Otero, Paula ; Docio-Fernandez, Laura ; Garcia-Mateo, Carmen
Author_Institution :
Dept. of Signal Theor. & Commun., Univ. of Vigo, Vigo, Spain
Abstract :
Reliable speaker segmentation is critical in many applications in the speech processing domain. In this paper, we extend our earlier formulation for false alarm reduction in a typical state-of-art speaker segmentation system. Specifically, we present two novel strategies for reducing the false alarm rate with a minimal impact on the true speaker change detection rate. One of the new strategies rejects, given a discard probability, those changes that are suspicious of being false alarms because of their low ΔBIC value; and the other one assumes that the occurrence of changes constitute a Poisson process, so changes will be discarded with a probability that follows a Poisson cumulative density function. Our experiments show the improvements obtained with each false alarm reduction approach using the Spanish Parliament Sessions defined for the 2006 TC-STAR Automatic Speech Recognition evaluation campaign.
Keywords :
speaker recognition; speech processing; stochastic processes; Poisson cumulative density function; false alarm rate reduction; speaker change detection rate; speaker segmentation system; speech processing; Audio recording; Automatic speech recognition; Broadcasting; Density functional theory; Navigation; Reliability theory; Signal processing; Speech processing; Speech recognition; Telecommunications; Audio segmentation; speaker change detection; speaker diarization; speaker segmentation;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495091