DocumentCode
68427
Title
Dynamic Time Warping for Music Retrieval Using Time Series Modeling of Musical Emotions
Author
Deng, James J. ; Leung, Clement H. C.
Author_Institution
Dept. of Comput. Sci., Hong Kong Baptist Univ., Hong Kong, China
Volume
6
Issue
2
fYear
2015
fDate
April-June 1 2015
Firstpage
137
Lastpage
151
Abstract
Musical signals have rich temporal information not only at the physical level but at the emotion level. The listeners may wish to find music excerpts that have similar sequence patterns of musical emotions with given excerpts. Most state-of-the-art systems for emotion-based music retrieval concentrate on static analysis of musical emotions, and ignore dynamic analysis and modeling of musical emotions overtime. This paper presents a novel approach to perform music retrieval based on time-varying musical emotion dynamics. A three-dimensional musical emotion model-Resonance-Arousal-Valence (RAV)-is used, and emotions of a piece of music are represented by musical emotion dynamics in a time series. A multiple dynamic textures (MDT) model is proposed to model music and emotion dynamics overtime, and expectation maximization (EM) algorithm along with Kalman filtering and smoothing is used to estimate model parameters. Two smoothing methods-Rauch-Tung-Striebel (RTS) and minimum-variance smoothing (MVS)-to robust model are investigated and compared to find an optimal solution to enhance prediction. To find similar sequence patterns of musical emotions, subsequence dynamic time warping (DTW) for emotion dynamics matching is presented. Experimental results demonstrate the benefits of MDT to predict time-varying musical emotions, and our proposed method for music retrieval based on emotion dynamics outperforms retrieval methods based on acoustic features.
Keywords
Kalman filters; acoustic signal processing; emotion recognition; expectation-maximisation algorithm; information retrieval; music; parameter estimation; smoothing methods; time series; DTW; EM algorithm; Kalman filtering; MDT; MVS; RAV; RTS smoothing methods; Rauch-Tung-Striebel smoothing methods; dynamic analysis; emotion dynamics matching; emotion level; emotion-based music retrieval; expectation maximization; minimum-variance smoothing; multiple dynamic textures model; musical emotions; musical signals; parameters estimation; physical level; resonance-arousal-valence; sequence patterns; static analysis; subsequence dynamic time warping; three-dimensional musical emotion model; time series modeling; time-varying musical emotion dynamics; Analytical models; Heuristic algorithms; Kalman filters; Multiple signal classification; Music; Smoothing methods; Time series analysis; EM algorithm; Kalman filter and smoother; Musical emotion; dynamic time warping; multiple dynamic textures;
fLanguage
English
Journal_Title
Affective Computing, IEEE Transactions on
Publisher
ieee
ISSN
1949-3045
Type
jour
DOI
10.1109/TAFFC.2015.2404352
Filename
7042773
Link To Document