Title :
Automatic speaker role labeling in AMI meetings: Recognition of formal and social roles
Author :
Sapru, Ashtosh ; Valente, Fabio
Author_Institution :
Idiap Res. Inst., Martigny, Switzerland
Abstract :
This work aims at investigating the automatic recognition of speaker role in meeting conversations from the AMI corpus. Two types of roles are considered: formal roles, fixed over the meeting duration and recognized at recording level, and social roles related to the way participants interact between themselves, recognized at speaker turn level. Various structural, lexical and prosodic features as well as Dialog Act tags are exhaustively investigated and combined for this purpose. Results reveal an accuracy of 74% in recognizing the speakers formal roles and an accuracy of 66% (percentage of time) in correctly labeling the social roles. Feature analysis reveals that lexical features provide the higher performances in formal/functional role recognition while prosodic features provide the higher performances in social role recognition. Furthermore results reveal that social role recognition in case of rare roles in the corpus can be improved through the use of lexical and Dialog Act information combined over short time windows.
Keywords :
feature extraction; social sciences; speaker recognition; AMI corpus; AMI meetings; automatic speaker role labeling; automatic speaker role recognition; dialog act information; formal recognition; formal roles; meeting conversations; meeting duration; recording level; short time windows; social role recognition; social roles; Accuracy; Boosting; Encoding; Feature extraction; Labeling; Logic gates; Speech; AMI Meetings; Formal and Social Roles; Lexical and Prosodic feature analysis; Speaker Role Labeling; Structural;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289057