DocumentCode
63489
Title
Music Annotation and Retrieval using Unlabeled Exemplars: Correlation and Sparse Codes
Author
Ping-Keng Jao ; Yi-Hsuan Yang
Author_Institution
Res. Center for Inf. Technol. Innovation, Taipei, Taiwan
Volume
22
Issue
10
fYear
2015
fDate
Oct. 2015
Firstpage
1771
Lastpage
1775
Abstract
Tagging music signals with semantic labels such as genres, moods and instruments is important for content-based music retrieval and recommendation. While considerable effort has been made, automatic music annotation is still considered challenging due to the difficulty of extracting good audio features that capture the characteristics of different tags. To address this issue, we present in this letter two exemplar-based approaches that represent the content of a music clip by referring to a large set of unlabeled audio exemplars. The first approach represents a music clip by the set of audio exemplars that is highly correlated with the short-time feature vectors of the clip, whereas the second approach represents a music clip as sparse linear combinations of its short-time feature vectors over the audio exemplars. Music annotation is then performed by learning the relevance of the audio examples to different tags using labeled data. These two approaches effectively capitalize the availability of unlabeled data to explore the commonality of music signals to find out tag-specific acoustic patterns, without domain knowledge and feature design. Evaluation on the CAL10k music genre tagging dataset for tag-based music retrieval shows that, with thousands of unlabeled audio examples randomly drawn from the Million Song Dataset, the proposed approaches lead to remarkably higher precision rates than existing approaches.
Keywords
content-based retrieval; feature extraction; music; CAL10k music genre tagging dataset; Million Song Dataset; audio feature extraction; automatic music annotation; content-based music recommendation; content-based music retrieval; correlation codes; exemplar-based approach; genre labels; instrument labels; mood labels; music clip; music signal tagging; semantic labels; short-time feature vectors; sparse codes; tag-based music retrieval; tag-specific acoustic patterns; Correlation; Dictionaries; Feature extraction; Multiple signal classification; Signal processing algorithms; Tagging; Training; Music tagging; retrieval; sparse representation;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2015.2433061
Filename
7106493
Link To Document