• DocumentCode
    763345
  • Title

    Automatic mood detection and tracking of music audio signals

  • Author

    Lu, Lie ; Liu, Dan ; Zhang, Hong-Jiang

  • Author_Institution
    Microsoft Res. Asia, Beijing, China
  • Volume
    14
  • Issue
    1
  • fYear
    2006
  • Firstpage
    5
  • Lastpage
    18
  • Abstract
    Music mood describes the inherent emotional expression of a music clip. It is helpful in music understanding, music retrieval, and some other music-related applications. In this paper, a hierarchical framework is presented to automate the task of mood detection from acoustic music data, by following some music psychological theories in western cultures. The hierarchical framework has the advantage of emphasizing the most suitable features in different detection tasks. Three feature sets, including intensity, timbre, and rhythm are extracted to represent the characteristics of a music clip. The intensity feature set is represented by the energy in each subband, the timbre feature set is composed of the spectral shape features and spectral contrast features, and the rhythm feature set indicates three aspects that are closely related with an individual´s mood response, including rhythm strength, rhythm regularity, and tempo. Furthermore, since mood is usually changeable in an entire piece of classical music, the approach to mood detection is extended to mood tracking for a music piece, by dividing the music into several independent segments, each of which contains a homogeneous emotional expression. Preliminary evaluations indicate that the proposed algorithms produce satisfactory results. On our testing database composed of 800 representative music clips, the average accuracy of mood detection achieves up to 86.3%. We can also on average recall 84.1% of the mood boundaries from nine testing music pieces.
  • Keywords
    acoustic signal detection; audio signal processing; feature extraction; music; automatic music mood detection; intensity feature set; music audio signal tracking; music clip emotional expression; music retrieval; music understanding; rhythm feature set; rhythm regularity; rhythm strength; spectral contrast features; spectral shape features; tempo; timbre feature set; Acoustic signal detection; Computer vision; Data mining; Mood; Multiple signal classification; Music information retrieval; Psychology; Rhythm; Testing; Timbre; Affective computing; hierarchical framework; mood detection; mood tracking; music emotion; music information retrieval; music mood;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.860344
  • Filename
    1561259