• DocumentCode
    259504
  • Title

    Predicting Evoked Emotions in Video

  • Author

    Ellis, Joseph G. ; Lin, W. Sabrina ; Ching-Yung Lin ; Shih-Fu Chang

  • Author_Institution
    Dept. of Electr. Eng., Columbia Univ., New York, NY, USA
  • fYear
    2014
  • fDate
    10-12 Dec. 2014
  • Firstpage
    287
  • Lastpage
    294
  • Abstract
    Understanding how human emotion is evoked from visual content is a task that we as people do every day, but machines have not yet mastered. In this work we address the problem of predicting the intended evoked emotion at given points within movie trailers. Movie Trailers are carefully curated to elicit distinct and specific emotional responses from viewers, and are therefore well-suited for emotion prediction. However, current emotion recognition systems struggle to bridge the "affective gap", which refers to the difficulty in modeling high-level human emotions with low-level audio and visual features. To address this problem, we propose a mid-level concept feature, which is based on detectable movie shot concepts which we believe to be tied closely to emotions. Examples of these concepts are "Fight", "Rock Music", and "Kiss". We also create 2 datasets, the first with shot-level concept annotations for learning our concept detectors, and a separate, second dataset with emotion annotations taken throughout the trailers using the two dimensional arousal and valence model for emotion annotation. We report the performance of our concept detectors, and show that by using the output of these detectors as a mid-level representation for the movie shots we are able to more accurately predict the evoked emotion throughout a trailer than by using low-level features.
  • Keywords
    emotion recognition; feature extraction; humanities; video signal processing; concept detectors; emotion recognition systems; emotional responses; evoked emotion prediction; high-level human emotion modeling; low-level audio features; low-level visual features; movie trailers; shot-level concept annotations; visual content; Detectors; Feature extraction; Histograms; Image color analysis; Motion pictures; Pipelines; Visualization; affective computing; audio processing; computer vision; emotion analysis; movie; movie analysis; mulitmedia; multimodal; signal processing; video processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia (ISM), 2014 IEEE International Symposium on
  • Conference_Location
    Taichung
  • Print_ISBN
    978-1-4799-4312-8
  • Type

    conf

  • DOI
    10.1109/ISM.2014.69
  • Filename
    7033041