DocumentCode
3413495
Title
Estimating the dominant person in multi-party conversations using speaker diarization strategies
Author
Hung, Hayley ; Huang, Yan ; Friedland, Gerald ; Gatica-Perez, Daniel
Author_Institution
lDIAP, Martigny
fYear
2008
fDate
March 31 2008-April 4 2008
Firstpage
2197
Lastpage
2200
Abstract
In this paper, we apply speaker diarization strategies from a single source to the task of estimating the dominant person in a group meeting. Previous work has shown that speaking length is strongly correlated with perceived dominance. Here we investigate this in more depth by considering two dominance tasks where there is full and majority agreement amongst ground-truth annotators. In addition, we investigate how 24 different speed-up and algorithmic strategies, and source types lead to interesting outcomes when applied to dominance estimation. We obtained the best performance of 77% using our slowest scheme and a single distant microphone (SDM). Within the top 3 out of 24 performing experiments in both dominance tasks, we show that we can use the furthest SDM, with no prior knowledge of the number of speakers and the fastest diarization scheme, which performs 1.3 times faster than real-time.
Keywords
blind source separation; speaker recognition; multi-party conversations; single distant microphone; speaker diarization strategies; Computational complexity; Computer science; Feature extraction; Humans; Length measurement; Microphone arrays; Particle measurements; Performance evaluation; Speech enhancement; Testing; dominance modelling; speaker diarization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location
Las Vegas, NV
ISSN
1520-6149
Print_ISBN
978-1-4244-1483-3
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2008.4518080
Filename
4518080
Link To Document