DocumentCode
177425
Title
Information bottleneck based speaker diarization of meetings using non-speech as side information
Author
Yella, Sree Harsha ; Bourlard, Herve
Author_Institution
Idiap Res. Inst., Martigny, Switzerland
fYear
2014
fDate
4-9 May 2014
Firstpage
96
Lastpage
100
Abstract
Background noise and errors in speech/non-speech detection cause significant degradation to the output of a speaker diarization system. In a typical speaker diarization system, non-speech segments are excluded prior to unsupervised clustering. In the current study, we exploit the information present in the non-speech segments of a recording to improve the output of the speaker diarization system based on information bottleneck framework. This is achieved by providing information from non-speech segments as side (irrelevant) information to information bottleneck based clustering. Experiments on meeting recordings from RT 06, 07, 09, evaluation sets have shown that the proposed method decreases the diarization error rate by around 18% relative to the baseline speaker diarization system based on information bottleneck framework. Comparison with a state of the art system based on HMM/GMM framework shows that the proposed method significantly decreases the gap in performance between the information bottleneck system and HMM/GMM system.
Keywords
hidden Markov models; speaker recognition; GMM; HMM; baseline speaker diarization system; diarization error rate; information bottleneck based clustering; information bottleneck based speaker diarization; nonspeech segments; side information; speaker diarization system; Hidden Markov models; Meetings; Mutual information; NIST; Noise measurement; Speech; Speech processing; clustering; information bottleneck; side information; speaker diarization; spontaneous meeting recordings;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6853565
Filename
6853565
Link To Document