DocumentCode :
3488120
Title :
Audio segmentation, classification and clustering in a broadcast news task
Author :
Meinedo, Hugo ; Neto, João
Author_Institution :
Spoken Language Syst. Lab., Instituto Superior Tecnico, Lisboa, Portugal
Volume :
2
fYear :
2003
fDate :
6-10 April 2003
Abstract :
The paper describes our work on the development of an audio segmentation, classification and clustering system applied to a broadcast news task for the European Portuguese language. We developed a new algorithm for audio segmentation that is both accurate and uses fewer computational resources than other approaches. Our speaker clustering module uses a modified BIC (Bayesian information criterion) algorithm which performs substantially better than the standard symmetric Kullback-Liebler, KL2, and is much faster than the full BIC. Finally, we developed a scheme for tagging certain speaker clusters (anchors) using trained cluster models. A series of tests were conducted showing the advantage of the new algorithms. This system is part of a prototype system that is daily processing the main news show of the national Portuguese broadcaster.
Keywords :
Bayes methods; pattern classification; pattern clustering; signal classification; speaker recognition; speech recognition; Bayesian information criterion; Portuguese language; audio classification; audio segmentation; broadcast news; news anchor detection; speaker clustering; symmetric Kullback-Liebler algorithm; Acoustic signal detection; Broadcasting; Clustering algorithms; Detection algorithms; Laboratories; Loudspeakers; Natural languages; Speech recognition; Tagging; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1202280
Filename :
1202280
Link To Document :
بازگشت