DocumentCode :
763345
Title :
Automatic mood detection and tracking of music audio signals
Author :
Lu, Lie ; Liu, Dan ; Zhang, Hong-Jiang
Author_Institution :
Microsoft Res. Asia, Beijing, China
Volume :
14
Issue :
1
fYear :
2006
Firstpage :
5
Lastpage :
18
Abstract :
Music mood describes the inherent emotional expression of a music clip. It is helpful in music understanding, music retrieval, and some other music-related applications. In this paper, a hierarchical framework is presented to automate the task of mood detection from acoustic music data, by following some music psychological theories in western cultures. The hierarchical framework has the advantage of emphasizing the most suitable features in different detection tasks. Three feature sets, including intensity, timbre, and rhythm are extracted to represent the characteristics of a music clip. The intensity feature set is represented by the energy in each subband, the timbre feature set is composed of the spectral shape features and spectral contrast features, and the rhythm feature set indicates three aspects that are closely related with an individual´s mood response, including rhythm strength, rhythm regularity, and tempo. Furthermore, since mood is usually changeable in an entire piece of classical music, the approach to mood detection is extended to mood tracking for a music piece, by dividing the music into several independent segments, each of which contains a homogeneous emotional expression. Preliminary evaluations indicate that the proposed algorithms produce satisfactory results. On our testing database composed of 800 representative music clips, the average accuracy of mood detection achieves up to 86.3%. We can also on average recall 84.1% of the mood boundaries from nine testing music pieces.
Keywords :
acoustic signal detection; audio signal processing; feature extraction; music; automatic music mood detection; intensity feature set; music audio signal tracking; music clip emotional expression; music retrieval; music understanding; rhythm feature set; rhythm regularity; rhythm strength; spectral contrast features; spectral shape features; tempo; timbre feature set; Acoustic signal detection; Computer vision; Data mining; Mood; Multiple signal classification; Music information retrieval; Psychology; Rhythm; Testing; Timbre; Affective computing; hierarchical framework; mood detection; mood tracking; music emotion; music information retrieval; music mood;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.860344
Filename :
1561259
Link To Document :
بازگشت