DocumentCode :
3423862
Title :
Speaker diarization of French broadcast news
Author :
Gupta, Vishwa ; Boulianne, Gilles ; Kenny, Patrick ; Ouellet, Pierre ; Dumouchel, Pierre
Author_Institution :
Centre de Rech. Inf. de Montreal (CRIM), Montreal, QC
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4365
Lastpage :
4368
Abstract :
We report results on speaker diarization of French broadcast news and talk shows on current affairs. This speaker diarization process is a multistage segmentation and clustering system. One of the stages is agglomerative clustering using state-of-the-art speaker identification methods (SID). For the QMMs used in this stage, we tried many different feature parameters, including MFCCs, Gaussianized MFCCs, Gaussianized MFCCs with cepstral mean subtraction, and Gaussianized MFCCs with cepstral mean substraction containing only frames with high energy. We found that this last set of feature parameters gave the best results. Compared to Gaussianized MFCCs, these features reduced the diarization error rate (DER) by 12% on a development set and by 19% on a test set. We also combined clusters resulting from Gaussianized and non-Gaussianized feature sets. This cluster combination resulted in another 4% reduction in DER for both the development and the test sets. The best DER we have achieved is 15.4% on the development set, and 14.5% on the test set.
Keywords :
Gaussian processes; cepstral analysis; pattern clustering; speaker recognition; French broadcast news; GMM; Gaussianized MFCC; agglomerative clustering; cepstral mean subtraction; multistage speaker segmentation; speaker diarization; speaker identification; Acoustic noise; Acoustic signal detection; Cepstral analysis; Density estimation robust algorithm; Error analysis; Gaussian processes; Multimedia systems; Radio broadcasting; TV broadcasting; Testing; BIC clustering; SID clustering; speaker diarization; speaker segmentation and clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518622
Filename :
4518622
Link To Document :
بازگشت