DocumentCode
259504
Title
Predicting Evoked Emotions in Video
Author
Ellis, Joseph G. ; Lin, W. Sabrina ; Ching-Yung Lin ; Shih-Fu Chang
Author_Institution
Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
fYear
2014
fDate
10-12 Dec. 2014
Firstpage
287
Lastpage
294
Abstract
Understanding how human emotion is evoked from visual content is a task that we as people do every day, but machines have not yet mastered. In this work we address the problem of predicting the intended evoked emotion at given points within movie trailers. Movie Trailers are carefully curated to elicit distinct and specific emotional responses from viewers, and are therefore well-suited for emotion prediction. However, current emotion recognition systems struggle to bridge the "affective gap", which refers to the difficulty in modeling high-level human emotions with low-level audio and visual features. To address this problem, we propose a mid-level concept feature, which is based on detectable movie shot concepts which we believe to be tied closely to emotions. Examples of these concepts are "Fight", "Rock Music", and "Kiss". We also create 2 datasets, the first with shot-level concept annotations for learning our concept detectors, and a separate, second dataset with emotion annotations taken throughout the trailers using the two dimensional arousal and valence model for emotion annotation. We report the performance of our concept detectors, and show that by using the output of these detectors as a mid-level representation for the movie shots we are able to more accurately predict the evoked emotion throughout a trailer than by using low-level features.
Keywords
emotion recognition; feature extraction; humanities; video signal processing; concept detectors; emotion recognition systems; emotional responses; evoked emotion prediction; high-level human emotion modeling; low-level audio features; low-level visual features; movie trailers; shot-level concept annotations; visual content; Detectors; Feature extraction; Histograms; Image color analysis; Motion pictures; Pipelines; Visualization; affective computing; audio processing; computer vision; emotion analysis; movie; movie analysis; mulitmedia; multimodal; signal processing; video processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia (ISM), 2014 IEEE International Symposium on
Conference_Location
Taichung
Print_ISBN
978-1-4799-4312-8
Type
conf
DOI
10.1109/ISM.2014.69
Filename
7033041
Link To Document