DocumentCode
3716206
Title
Adaptive and online speaker diarization for meeting data
Author
Giovanni Soldi;Christophe Beaugeant;Nicholas Evans
Author_Institution
Multimedia Communications Department, EURECOM, Sophia Antipolis, France
fYear
2015
Firstpage
2112
Lastpage
2116
Abstract
Speaker diarization aims to determine `who spoke when´ in a given audio stream. Different applications, such as document structuring or information retrieval have led to the exploration of speaker diarization in many different domains, from broadcast news to lectures, phone conversations and meetings. Almost all current diarization systems are offline and ill-suited to the growing need for online or real-time diarization, stemming from the increasing popularity of powerful, mobile smart devices. While a small number of such systems have been reported, truly online diarization systems for challenging and highly spontaneous meeting data are lacking. This paper reports our work to develop an adaptive and online diarization system using the NIST Rich Transcription meetings corpora. While not dissimilar to those previously reported for less challenging domains, high diarization error rates illustrate the challenge ahead and lead to some ideas to improve performance through future research.
Keywords
"Speech","Adaptation models","Density estimation robust algorithm","NIST","Acoustics","Data models","Computational modeling"
Publisher
ieee
Conference_Titel
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN
2076-1465
Type
conf
DOI
10.1109/EUSIPCO.2015.7362757
Filename
7362757
Link To Document