• DocumentCode
    1351826
  • Title

    A Generative Context Model for Semantic Music Annotation and Retrieval

  • Author

    Miotto, Riccardo ; Lanckriet, Gert

  • Author_Institution
    Dept. of Inf. Eng., Univ. of Padova, Padova, Italy
  • Volume
    20
  • Issue
    4
  • fYear
    2012
  • fDate
    5/1/2012 12:00:00 AM
  • Firstpage
    1096
  • Lastpage
    1108
  • Abstract
    While a listener may derive semantic associations for audio clips from direct auditory cues (e.g., hearing “bass guitar”) as well as from “context” (e.g., inferring “bass guitar” in the context of a “rock” song), most state-of-the-art systems for automatic music annotation ignore this context. Indeed, although contextual relationships correlate tags, many auto-taggers model tags independently. This paper presents a novel, generative approach to improve automatic music annotation by modeling contextual relationships between tags. A Dirichlet mixture model (DMM) is proposed as a second, additional stage in the modeling process, to supplement any auto-tagging system that generates a semantic multinomial (SMN) over a vocabulary of tags when annotating a song. For each tag in the vocabulary, a DMM captures the broader context the tag defines by modeling tag co-occurrence patterns in the SMNs of songs associated with the tag. When annotating songs, the DMMs refine SMN annotations by leveraging contextual evidence. Experimental results demonstrate the benefits of combining a variety of auto-taggers with this generative context model. It generally outperforms other approaches to modeling context as well.
  • Keywords
    audio signal processing; information retrieval; music; polynomials; Dirichlet mixture model; audio clips; auto-taggers model; automatic music annotation; contextual relationship modeling; direct auditory cues; generative context model; semantic associations; semantic multinomial; semantic music annotation; semantic music retrieval; song annotation; tag vocabulary; Acoustics; Context; Context modeling; Correlation; Rocks; Semantics; Vocabulary; Audio annotation and retrieval; Dirichlet mixture models; context modeling; music information retrieval;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2011.2172423
  • Filename
    6047567