DocumentCode :
45697
Title :
Modeling the Affective Content of Music with a Gaussian Mixture Model
Author :
Ju-Chiang Wang ; Yi-Hsuan Yang ; Hsin-Min Wang ; Shyh-Kang Jeng
Author_Institution :
Inst. of Inf. Sci., Taipei, Taiwan
Volume :
6
Issue :
1
fYear :
2015
fDate :
Jan.-March 1 2015
Firstpage :
56
Lastpage :
68
Abstract :
Modeling the association between music and emotion has been considered important for music information retrieval and affective human computer interaction. This paper presents a novel generative model called acoustic emotion Gaussians (AEG) for computational modeling of emotion. Instead of assigning a music excerpt with a deterministic (hard) emotion label, AEG treats the affective content of music as a (soft) probability distribution in the valence-arousal space and parameterizes it with a Gaussian mixture model (GMM). In this way, the subjective nature of emotion perception is explicitly modeled. Specifically, AEG employs two GMMs to characterize the audio and emotion data. The fitting algorithm of the GMM parameters makes the model learning process transparent and interpretable. Based on AEG, a probabilistic graphical structure for predicting the emotion distribution from music audio data is also developed. A comprehensive performance study over two emotion-labeled datasets demonstrates that AEG offers new insights into the relationship between music and emotion (e.g., to assess the “affective diversity” of a corpus) and represents an effective means of emotion modeling. Readers can easily implement AEG via the publicly available codes. As the AEG model is generic, it holds the promise of analyzing any signal that carries affective or other highly subjective information.
Keywords :
Gaussian processes; acoustic signal processing; audio signal processing; emotion recognition; human computer interaction; information retrieval; learning (artificial intelligence); mixture models; music; probability; AEG; Gaussian mixture model; acoustic emotion Gaussians; affective diversity assessment; affective human computer interaction; audio data characterization; emotion computational modeling; emotion data characterization; emotion distribution prediction; emotion perception; emotion-labeled datasets; fitting algorithm; generative model; model learning process; music affective content modeling; music information retrieval; probabilistic graphical structure; probability distribution; valence-arousal space; Adaptation models; Computational modeling; Data models; Music; Predictive models; Probabilistic logic; Gaussian mixture model; Music information retrieval; arousal; music emotion recognition; subjectivity; valence;
fLanguage :
English
Journal_Title :
Affective Computing, IEEE Transactions on
Publisher :
ieee
ISSN :
1949-3045
Type :
jour
DOI :
10.1109/TAFFC.2015.2397457
Filename :
7029060
Link To Document :
بازگشت