Title :
A regularized kernel-based approach to unsupervised audio segmentation
Author :
Harchaoui, Zaïd ; Vallet, Félicien ; Lung-Yut-Fong, Alexandre ; Cappé, Olivier
Author_Institution :
CNRS, LTCI, TELECOM ParisTech, Paris
Abstract :
We introduce a regularized kernel-based rule for unsupervised change detection based on a simpler version of the recently proposed kernel fisher discriminant ratio. Compared to other kernel-based change detectors found in the literature, the proposed test statistic is easier to compute and has a known asymptotic distribution which can effectively be used to set the false alarm rate a priori. This technique is applied for segmenting tracks from TV shows, both for segmentation into semantically homogeneous sections (applause, movie, music, etc.) and for speaker diarization within the speech sections. On these tasks, the proposed approach outperforms other kernel-based tests and is competitive with a standard HMM-based supervised alternative.
Keywords :
audio signal processing; hidden Markov models; signal detection; hidden Markov model; kernel fisher discriminant ratio; regularized kernel-based approach; speaker diarization; unsupervised audio segmentation; unsupervised change detection; Detectors; Kernel; Motion pictures; Speech; Statistical analysis; Statistical distributions; Streaming media; TV; Telecommunications; Testing; Change detection; audio segmentation; kernel methods;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4959921