DocumentCode :
1458197
Title :
Prediction of the Distribution of Perceived Music Emotions Using Discrete Samples
Author :
Yang, Yi-Hsuan ; Chen, Homer H.
Author_Institution :
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume :
19
Issue :
7
fYear :
2011
Firstpage :
2184
Lastpage :
2196
Abstract :
Typically, a machine learning model of automatic music emotion recognition is trained to learn the relationship between music features and perceived emotion values. However, simply assigning an emotion value to a clip in the training phase does not work well because the perceived emotion of a clip varies from person to person. To resolve this problem, we propose a novel approach that represents the perceived emotion of a clip as a probability distribution in the emotion plane. In addition, we develop a methodology that predicts the emotion distribution of a clip by estimating the emotion mass at discrete samples of the emotion plane. We also develop model fusion algorithms to integrate different perceptual dimensions of music listening and to enhance the modeling of emotion perception. The effectiveness of the proposed approach is validated through an extensive performance study. An average R2 statistics of 0.5439 for emotion prediction is achieved. We also show how this approach can be applied to enhance our understanding of music emotion.
Keywords :
emotion recognition; information retrieval; learning (artificial intelligence); music; probability; statistical analysis; R2 statistics; automatic music emotion recognition; discrete samples; emotion distribution; emotion perception; emotion plane; machine learning model; model fusion algorithms; music features; music listening; perceived emotion values; perceived music emotions; perceptual dimensions; probability distribution; training phase; Accuracy; Computational modeling; Feature extraction; Kernel; Machine learning; Predictive models; Training; Arousal; emotion distribution prediction; music emotion recognition; regression; subjectivity; valence;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2011.2118752
Filename :
5719548
Link To Document :
بازگشت