DocumentCode :
1324549
Title :
Affective Audio-Visual Words and Latent Topic Driving Model for Realizing Movie Affective Scene Classification
Author :
Irie, Go ; Satou, Takashi ; Kojima, Akira ; Yamasaki, Toshihiko ; Aizawa, Kiyoharu
Author_Institution :
NTT Cyber Solutions Labs., NTT Corp., Yokosuka, Japan
Volume :
12
Issue :
6
fYear :
2010
Firstpage :
523
Lastpage :
535
Abstract :
This paper presents a novel method for movie affective scene classification that outputs the emotion (in the form of labels) that the scene is likely to arouse in viewers. Since the affective preferences of users play an important role in movie selection, affective scene classification has the potential to develop more attractive user-centric movie search and browsing applications. Two main issues in designing movie affective scene classification are considered. One is “how to extract features that are strongly related to the viewer´s emotions”, and the other is “how to map the extracted features to the emotion categories”. For the former, we propose a method to extract emotion-category-specific audio-visual features named affective audio-visual words (AAVWs). For the latter issue, we propose a classification model named latent topic driving model (LTDM). Assuming that viewers´ emotions are dynamically changed by the movie scene sequences, LTDM models emotions as Markovian dynamic systems driven by the sequential stimuli of the movie content. Experiments on 206 movie scenes extracted from 24 movie titles and the corresponding labels of eight emotion categories given by 16 subjects show that our method outperforms conventional approaches in terms of the subject agreement rate.
Keywords :
audio-visual systems; emotion recognition; feature extraction; humanities; image classification; image sequences; video signal processing; affective audio-visual words; attractive user-centric movie search; emotion categories; feature extraction; latent topic driving model; movie affective scene classification; movie scene sequences; Feature extraction; Hidden Markov models; Humans; Image color analysis; Motion pictures; Music; Visualization; Affective audio-visual word; Plutchik´s basic emotions; affective scene classification; latent topic driving model;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2010.2051871
Filename :
5571819
Link To Document :
بازگشت