Title :
Dynamic Time Warping for Music Retrieval Using Time Series Modeling of Musical Emotions
Author :
Deng, James J. ; Leung, Clement H. C.
Author_Institution :
Dept. of Comput. Sci., Hong Kong Baptist Univ., Hong Kong, China
fDate :
April-June 1 2015
Abstract :
Musical signals have rich temporal information not only at the physical level but at the emotion level. The listeners may wish to find music excerpts that have similar sequence patterns of musical emotions with given excerpts. Most state-of-the-art systems for emotion-based music retrieval concentrate on static analysis of musical emotions, and ignore dynamic analysis and modeling of musical emotions overtime. This paper presents a novel approach to perform music retrieval based on time-varying musical emotion dynamics. A three-dimensional musical emotion model-Resonance-Arousal-Valence (RAV)-is used, and emotions of a piece of music are represented by musical emotion dynamics in a time series. A multiple dynamic textures (MDT) model is proposed to model music and emotion dynamics overtime, and expectation maximization (EM) algorithm along with Kalman filtering and smoothing is used to estimate model parameters. Two smoothing methods-Rauch-Tung-Striebel (RTS) and minimum-variance smoothing (MVS)-to robust model are investigated and compared to find an optimal solution to enhance prediction. To find similar sequence patterns of musical emotions, subsequence dynamic time warping (DTW) for emotion dynamics matching is presented. Experimental results demonstrate the benefits of MDT to predict time-varying musical emotions, and our proposed method for music retrieval based on emotion dynamics outperforms retrieval methods based on acoustic features.
Keywords :
Kalman filters; acoustic signal processing; emotion recognition; expectation-maximisation algorithm; information retrieval; music; parameter estimation; smoothing methods; time series; DTW; EM algorithm; Kalman filtering; MDT; MVS; RAV; RTS smoothing methods; Rauch-Tung-Striebel smoothing methods; dynamic analysis; emotion dynamics matching; emotion level; emotion-based music retrieval; expectation maximization; minimum-variance smoothing; multiple dynamic textures model; musical emotions; musical signals; parameters estimation; physical level; resonance-arousal-valence; sequence patterns; static analysis; subsequence dynamic time warping; three-dimensional musical emotion model; time series modeling; time-varying musical emotion dynamics; Analytical models; Heuristic algorithms; Kalman filters; Multiple signal classification; Music; Smoothing methods; Time series analysis; EM algorithm; Kalman filter and smoother; Musical emotion; dynamic time warping; multiple dynamic textures;
Journal_Title :
Affective Computing, IEEE Transactions on
DOI :
10.1109/TAFFC.2015.2404352