DocumentCode :
3528154
Title :
Speaker diarization in meeting audio
Author :
Nwe, Tin Lay ; Sun, Hanwu ; Li, Haizhou ; Rahardja, Susanto
Author_Institution :
Inst. for Infocomm Res. (I2R), A*STAR, Singapore
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4073
Lastpage :
4076
Abstract :
This paper describes speaker diarization system on a NIST Rich Transcription 2007 (RT-07) meeting recognition evaluation data set for the task of multiple distant microphone (MDM). Our implementation includes three components: initial clustering, non-speech removal and cluster purification. Initial clusters are generated using directional of arrival (DOA) information and bootstrap clustering. Multiple GMM modeling for speech/non-speech classification is employed for non-speech removal component. In addition, a novel system fusion strategy using information from receiver operating curve (ROC) is proposed for non-speech removal component. Finally, consensus clustering approach together with iterative GMM clustering method is employed for speaker cluster purification. The system achieves the overall DER of 10.81%.
Keywords :
direction-of-arrival estimation; pattern classification; pattern clustering; speaker recognition; GMM modeling; NIST Rich Transcription 2007 meeting recognition evaluation data set; bootstrap clustering; consensus clustering approach; directional of arrival; meeting audio; multiple distant microphone; nonspeech classification; nonspeech removal component; receiver operating curve; speaker cluster purification; speaker diarization system; system fusion strategy; Adaptive filters; Conferences; Direction of arrival estimation; Erbium; Machine learning; Natural languages; Purification; Speech processing; Sun; Tin; Meetings; clustering methods; modeling; pattern classification; speech processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960523
Filename :
4960523
Link To Document :
بازگشت