Title :
Speaker diarization system for RT07 and RT09 meeting room audio
Author :
Sun, Hanwu ; Ma, Bin ; Khine, Swe Zin Kalayar ; Li, Haizhou
Author_Institution :
Inst. for Infocomm Res. (I2R), A*STAR, Singapore, Singapore
Abstract :
This paper describes an improved speaker diarization system for the Single Distant Microphone (SDM) task in the 2007 and 2009 NIST Rich Transcription Meeting Recognition Evaluations. The system includes three main modules: front-end processing, initial speaker clustering and cluster purification/merging. The front-end processing involves the Wiener filtering for the targeted audio channels and a self-adaptation speech activity detection algorithm. A simple but effective energy based segmentation is applied to chunk the meeting data into small segments to construct the initial clusters. An enhanced purification algorithm is proposed to further improve the performance after the preliminary purification, and the BIC criterion is adopted for the cluster merging. The system achieves competitive overall DERs of 15.67% for RT07 SDM speaker diarization task and 17.34% for RT09 SDM speaker diarization task.
Keywords :
Wiener filters; audio streaming; microphones; pattern clustering; speaker recognition; BIC criterion; RT07; RT09; Wiener filtering; audio channels; front-end processing; meeting room audio; self- adaptation speech activity detection algorithm; single distant microphone; speaker clustering; speaker diarization system; speech segmentation; transcription meeting recognition evaluations; Computer science; Detection algorithms; Histograms; Merging; Microphones; NIST; Purification; Speech; Statistics; Sun; Single Distant Microphone; speaker clustering; speaker diarization; speech activity detection;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495077