• DocumentCode
    1324549
  • Title

    Affective Audio-Visual Words and Latent Topic Driving Model for Realizing Movie Affective Scene Classification

  • Author

    Irie, Go ; Satou, Takashi ; Kojima, Akira ; Yamasaki, Toshihiko ; Aizawa, Kiyoharu

  • Author_Institution
    NTT Cyber Solutions Labs., NTT Corp., Yokosuka, Japan
  • Volume
    12
  • Issue
    6
  • fYear
    2010
  • Firstpage
    523
  • Lastpage
    535
  • Abstract
    This paper presents a novel method for movie affective scene classification that outputs the emotion (in the form of labels) that the scene is likely to arouse in viewers. Since the affective preferences of users play an important role in movie selection, affective scene classification has the potential to develop more attractive user-centric movie search and browsing applications. Two main issues in designing movie affective scene classification are considered. One is “how to extract features that are strongly related to the viewer´s emotions”, and the other is “how to map the extracted features to the emotion categories”. For the former, we propose a method to extract emotion-category-specific audio-visual features named affective audio-visual words (AAVWs). For the latter issue, we propose a classification model named latent topic driving model (LTDM). Assuming that viewers´ emotions are dynamically changed by the movie scene sequences, LTDM models emotions as Markovian dynamic systems driven by the sequential stimuli of the movie content. Experiments on 206 movie scenes extracted from 24 movie titles and the corresponding labels of eight emotion categories given by 16 subjects show that our method outperforms conventional approaches in terms of the subject agreement rate.
  • Keywords
    audio-visual systems; emotion recognition; feature extraction; humanities; image classification; image sequences; video signal processing; affective audio-visual words; attractive user-centric movie search; emotion categories; feature extraction; latent topic driving model; movie affective scene classification; movie scene sequences; Feature extraction; Hidden Markov models; Humans; Image color analysis; Motion pictures; Music; Visualization; Affective audio-visual word; Plutchik´s basic emotions; affective scene classification; latent topic driving model;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2010.2051871
  • Filename
    5571819