DocumentCode
652774
Title
Stress Detection from Audio on Multiple Window Analysis Size in a Public Speaking Task
Author
Soury, Mariette ; Devillers, L.
Author_Institution
LIMSI, Orsay, France
fYear
2013
fDate
2-5 Sept. 2013
Firstpage
529
Lastpage
533
Abstract
Speech production modifications are one of the many indications of stress in humans. A job interview simulation task permitted the collection of a multimodal corpus, including physiological data. Physiological cues of stress are reliable on long periods, and require invasive sensors. Human voice variations have been proved to be a non-invasive stress cue. In this paper, we focus on a frame-wise detection of stress on several window analysis sizes and analyze the behavior of different audio features classes. We trained our system on 19 subjects, and test it on 10 other subjects. Our best system obtains a 71.9 percent Unweighted Average Recall on 5s windows.
Keywords
behavioural sciences computing; speech processing; 5s windows; audio features classes; frame-wise stress detection; human voice variations; invasive sensors; job interview simulation task; multimodal corpus; multiple window analysis size; noninvasive stress cue; physiological data; physiological stress cues; public speaking task; speech production modifications; stress detection; unweighted average recall; Feature extraction; Jitter; Physiology; Protocols; Public speaking; Speech; Stress; prosodic cues; public speaking task; stress;
fLanguage
English
Publisher
ieee
Conference_Titel
Affective Computing and Intelligent Interaction (ACII), 2013 Humaine Association Conference on
Conference_Location
Geneva
ISSN
2156-8103
Type
conf
DOI
10.1109/ACII.2013.93
Filename
6681484
Link To Document