DocumentCode :
1121237
Title :
Speakers Role Recognition in Multiparty Audio Recordings Using Social Network Analysis and Duration Distribution Modeling
Author :
Vinciarelli, Alessandro
Author_Institution :
IDIAP Res. Inst., Martigny
Volume :
9
Issue :
6
fYear :
2007
Firstpage :
1215
Lastpage :
1226
Abstract :
This paper presents two approaches for speaker role recognition in multiparty audio recordings. The experiments are performed over a corpus of 96 radio bulletins corresponding to roughly 19 h of material. Each recording involves, on average, 11 speakers playing one among six roles belonging to a predefined set. Both proposed approaches start by segmenting automatically the recordings into single speaker segments, but perform role recognition using different techniques. The first approach is based on Social Network Analysis, the second relies on the intervention duration distribution across different speakers. The two approaches are used separately and combined and the results show that around 85% of the recording time can be labeled correctly in terms of role.
Keywords :
audio signal processing; hidden Markov models; pattern clustering; source separation; speech recognition; audio indexing; automatic recording segmentation; duration distribution modeling; hidden Markov models; intervention duration distribution; multiparty audio recording; radio bulletins; single speaker segment; social network analysis; sociometry; speaker clustering; speaker role recognition; stochastic process; Audio indexing; social network analysis; sociometry; speaker clustering; speaker segmentation; stochastic processes;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2007.902882
Filename :
4303037
Link To Document :
بازگشت