مرکز منطقه ای اطلاع رساني علوم و فناوري - A Generative Context Model for Semantic Music Annotation and Retrieval

DocumentCode :

1351826

Title :

A Generative Context Model for Semantic Music Annotation and Retrieval

Author :

Miotto, Riccardo ; Lanckriet, Gert

Author_Institution :

Dept. of Inf. Eng., Univ. of Padova, Padova, Italy

Volume :

Issue :

fYear :

2012

fDate :

5/1/2012 12:00:00 AM

Firstpage :

1096

Lastpage :

1108

Abstract :

While a listener may derive semantic associations for audio clips from direct auditory cues (e.g., hearing “bass guitar”) as well as from “context” (e.g., inferring “bass guitar” in the context of a “rock” song), most state-of-the-art systems for automatic music annotation ignore this context. Indeed, although contextual relationships correlate tags, many auto-taggers model tags independently. This paper presents a novel, generative approach to improve automatic music annotation by modeling contextual relationships between tags. A Dirichlet mixture model (DMM) is proposed as a second, additional stage in the modeling process, to supplement any auto-tagging system that generates a semantic multinomial (SMN) over a vocabulary of tags when annotating a song. For each tag in the vocabulary, a DMM captures the broader context the tag defines by modeling tag co-occurrence patterns in the SMNs of songs associated with the tag. When annotating songs, the DMMs refine SMN annotations by leveraging contextual evidence. Experimental results demonstrate the benefits of combining a variety of auto-taggers with this generative context model. It generally outperforms other approaches to modeling context as well.

Keywords :

audio signal processing; information retrieval; music; polynomials; Dirichlet mixture model; audio clips; auto-taggers model; automatic music annotation; contextual relationship modeling; direct auditory cues; generative context model; semantic associations; semantic multinomial; semantic music annotation; semantic music retrieval; song annotation; tag vocabulary; Acoustics; Context; Context modeling; Correlation; Rocks; Semantics; Vocabulary; Audio annotation and retrieval; Dirichlet mixture models; context modeling; music information retrieval;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2011.2172423

Filename :

6047567

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1351826