DocumentCode
763345
Title
Automatic mood detection and tracking of music audio signals
Author
Lu, Lie ; Liu, Dan ; Zhang, Hong-Jiang
Author_Institution
Microsoft Res. Asia, Beijing, China
Volume
14
Issue
1
fYear
2006
Firstpage
5
Lastpage
18
Abstract
Music mood describes the inherent emotional expression of a music clip. It is helpful in music understanding, music retrieval, and some other music-related applications. In this paper, a hierarchical framework is presented to automate the task of mood detection from acoustic music data, by following some music psychological theories in western cultures. The hierarchical framework has the advantage of emphasizing the most suitable features in different detection tasks. Three feature sets, including intensity, timbre, and rhythm are extracted to represent the characteristics of a music clip. The intensity feature set is represented by the energy in each subband, the timbre feature set is composed of the spectral shape features and spectral contrast features, and the rhythm feature set indicates three aspects that are closely related with an individual´s mood response, including rhythm strength, rhythm regularity, and tempo. Furthermore, since mood is usually changeable in an entire piece of classical music, the approach to mood detection is extended to mood tracking for a music piece, by dividing the music into several independent segments, each of which contains a homogeneous emotional expression. Preliminary evaluations indicate that the proposed algorithms produce satisfactory results. On our testing database composed of 800 representative music clips, the average accuracy of mood detection achieves up to 86.3%. We can also on average recall 84.1% of the mood boundaries from nine testing music pieces.
Keywords
acoustic signal detection; audio signal processing; feature extraction; music; automatic music mood detection; intensity feature set; music audio signal tracking; music clip emotional expression; music retrieval; music understanding; rhythm feature set; rhythm regularity; rhythm strength; spectral contrast features; spectral shape features; tempo; timbre feature set; Acoustic signal detection; Computer vision; Data mining; Mood; Multiple signal classification; Music information retrieval; Psychology; Rhythm; Testing; Timbre; Affective computing; hierarchical framework; mood detection; mood tracking; music emotion; music information retrieval; music mood;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TSA.2005.860344
Filename
1561259
Link To Document