DocumentCode :
1469236
Title :
Unifying Low-Level and High-Level Music Similarity Measures
Author :
Bogdanov, Dmitry ; Serrà, Joan ; Wack, Nicolas ; Herrera, Perfecto ; Serra, Xavier
Author_Institution :
Music Technol. Group, Univ. Pompeu Fabra, Barcelona, Spain
Volume :
13
Issue :
4
fYear :
2011
Firstpage :
687
Lastpage :
701
Abstract :
Measuring music similarity is essential for multimedia retrieval. For music items, this task can be regarded as obtaining a suitable distance measurement between songs defined on a certain feature space. In this paper, we propose three of such distance measures based on the audio content: first, a low-level measure based on tempo-related description; second, a high-level semantic measure based on the inference of different musical dimensions by support vector machines. These dimensions include genre, culture, moods, instruments, rhythm, and tempo annotations. Third, a hybrid measure which combines the above-mentioned distance measures with two existing low-level measures: a Euclidean distance based on principal component analysis of timbral, temporal, and tonal descriptors, and a timbral distance based on single Gaussian Mel-frequency cepstral coefficient (MFCC) modeling. We evaluate our proposed measures against a number of baseline measures. We do this objectively based on a comprehensive set of music collections, and subjectively based on listeners´ ratings. Results show that the proposed methods achieve accuracies comparable to the baseline approaches in the case of the tempo and classifier-based measures. The highest accuracies are obtained by the hybrid distance. Furthermore, the proposed classifier-based approach opens up the possibility to explore distance measures that are based on semantic notions.
Keywords :
cepstral analysis; distance measurement; geometry; information retrieval; multimedia computing; music; principal component analysis; support vector machines; Euclidean distance; Gaussian mel-frequency cepstral coefficient modeling; audio content:; culture; distance measurement; genre; instruments; moods; multimedia retrieval; music similarity measures; musical dimensions; principal component analysis; rhythm; support vector machines; tempo annotations; tempo-related description; temporal; timbral distance; tonal descriptors; Euclidean distance; Mel frequency cepstral coefficient; Mood; Principal component analysis; Semantics; Support vector machines; Distance measurement; information retrieval; knowledge acquisition; multimedia computing; multimedia databases; music;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2011.2125784
Filename :
5728926
Link To Document :
بازگشت