DocumentCode :
1351826
Title :
A Generative Context Model for Semantic Music Annotation and Retrieval
Author :
Miotto, Riccardo ; Lanckriet, Gert
Author_Institution :
Dept. of Inf. Eng., Univ. of Padova, Padova, Italy
Volume :
20
Issue :
4
fYear :
2012
fDate :
5/1/2012 12:00:00 AM
Firstpage :
1096
Lastpage :
1108
Abstract :
While a listener may derive semantic associations for audio clips from direct auditory cues (e.g., hearing “bass guitar”) as well as from “context” (e.g., inferring “bass guitar” in the context of a “rock” song), most state-of-the-art systems for automatic music annotation ignore this context. Indeed, although contextual relationships correlate tags, many auto-taggers model tags independently. This paper presents a novel, generative approach to improve automatic music annotation by modeling contextual relationships between tags. A Dirichlet mixture model (DMM) is proposed as a second, additional stage in the modeling process, to supplement any auto-tagging system that generates a semantic multinomial (SMN) over a vocabulary of tags when annotating a song. For each tag in the vocabulary, a DMM captures the broader context the tag defines by modeling tag co-occurrence patterns in the SMNs of songs associated with the tag. When annotating songs, the DMMs refine SMN annotations by leveraging contextual evidence. Experimental results demonstrate the benefits of combining a variety of auto-taggers with this generative context model. It generally outperforms other approaches to modeling context as well.
Keywords :
audio signal processing; information retrieval; music; polynomials; Dirichlet mixture model; audio clips; auto-taggers model; automatic music annotation; contextual relationship modeling; direct auditory cues; generative context model; semantic associations; semantic multinomial; semantic music annotation; semantic music retrieval; song annotation; tag vocabulary; Acoustics; Context; Context modeling; Correlation; Rocks; Semantics; Vocabulary; Audio annotation and retrieval; Dirichlet mixture models; context modeling; music information retrieval;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2011.2172423
Filename :
6047567
Link To Document :
بازگشت