DocumentCode
3530857
Title
Contrasting emotion-bearing laughter types in multiparticipant vocal activity detection for meetings
Author
Laskowski, Kornel
Author_Institution
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA
fYear
2009
fDate
19-24 April 2009
Firstpage
4765
Lastpage
4768
Abstract
The detection of laughter in conversational interaction presents an important challenge in meeting understanding, important primarily because laughter is predictive of the emotional state of participants. We present evidence which suggests that ignoring unvoiced laughter improves the prediction of emotional involvement in collocated speech, making a case for the distinction between voiced and unvoiced laughter during laughter detection. Our experiments show that the exclusion of unvoiced laughter during laughter model training as well as its explicit modeling lead to detection scores for voiced laughter which are much higher than those otherwise obtained for all laughter. Furthermore, duration modeling is shown to be a more effective means of improving precision than interaction modeling through joint-participant decoding. Taken together, the final detection F-scores we present for voiced laughter on our development set comprise a 20% reduction of error, relative to F-scores for all laughter reported in previous work, and 6% and 22% relative reductions in error on two larger datasets unseen during development.
Keywords
decoding; emotion recognition; human computer interaction; speech coding; speech recognition; collocated speech; conversational interaction modeling; duration modeling; emotion-bearing laughter type contrasting; joint-participant decoding; laughter model training; meeting multiparticipant vocal activity detection; meeting understanding; nonvoiced laughter detection; voiced laughter detection; Decoding; Detectors; Error analysis; Microphones; Performance analysis; Rain; Speech; Laughter detection; Meetings; Speech detection; Vocal interaction;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4960696
Filename
4960696
Link To Document