مرکز منطقه ای اطلاع رساني علوم و فناوري - Modeling the Affective Content of Music with a Gaussian Mixture Model

DocumentCode :

45697

Title :

Modeling the Affective Content of Music with a Gaussian Mixture Model

Author :

Ju-Chiang Wang ; Yi-Hsuan Yang ; Hsin-Min Wang ; Shyh-Kang Jeng

Author_Institution :

Inst. of Inf. Sci., Taipei, Taiwan

Volume :

Issue :

fYear :

2015

fDate :

Jan.-March 1 2015

Firstpage :

Lastpage :

Abstract :

Modeling the association between music and emotion has been considered important for music information retrieval and affective human computer interaction. This paper presents a novel generative model called acoustic emotion Gaussians (AEG) for computational modeling of emotion. Instead of assigning a music excerpt with a deterministic (hard) emotion label, AEG treats the affective content of music as a (soft) probability distribution in the valence-arousal space and parameterizes it with a Gaussian mixture model (GMM). In this way, the subjective nature of emotion perception is explicitly modeled. Specifically, AEG employs two GMMs to characterize the audio and emotion data. The fitting algorithm of the GMM parameters makes the model learning process transparent and interpretable. Based on AEG, a probabilistic graphical structure for predicting the emotion distribution from music audio data is also developed. A comprehensive performance study over two emotion-labeled datasets demonstrates that AEG offers new insights into the relationship between music and emotion (e.g., to assess the “affective diversity” of a corpus) and represents an effective means of emotion modeling. Readers can easily implement AEG via the publicly available codes. As the AEG model is generic, it holds the promise of analyzing any signal that carries affective or other highly subjective information.

Keywords :

Gaussian processes; acoustic signal processing; audio signal processing; emotion recognition; human computer interaction; information retrieval; learning (artificial intelligence); mixture models; music; probability; AEG; Gaussian mixture model; acoustic emotion Gaussians; affective diversity assessment; affective human computer interaction; audio data characterization; emotion computational modeling; emotion data characterization; emotion distribution prediction; emotion perception; emotion-labeled datasets; fitting algorithm; generative model; model learning process; music affective content modeling; music information retrieval; probabilistic graphical structure; probability distribution; valence-arousal space; Adaptation models; Computational modeling; Data models; Music; Predictive models; Probabilistic logic; Gaussian mixture model; Music information retrieval; arousal; music emotion recognition; subjectivity; valence;

fLanguage :

English

Journal_Title :

Affective Computing, IEEE Transactions on

Publisher :

ieee

ISSN :

1949-3045

Type :

jour

DOI :

10.1109/TAFFC.2015.2397457

Filename :

7029060

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=45697