• DocumentCode
    835987
  • Title

    A generative model for music transcription

  • Author

    Cemgil, A. Taylan ; Kappen, Hilbert J. ; Barber, David

  • Author_Institution
    Stichfing Neurale Netwerken, Nimegen, Netherlands
  • Volume
    14
  • Issue
    2
  • fYear
    2006
  • fDate
    3/1/2006 12:00:00 AM
  • Firstpage
    679
  • Lastpage
    694
  • Abstract
    In this paper, we present a graphical model for polyphonic music transcription. Our model, formulated as a dynamical Bayesian network, embodies a transparent and computationally tractable approach to this acoustic analysis problem. An advantage of our approach is that it places emphasis on explicitly modeling the sound generation procedure. It provides a clear framework in which both high level (cognitive) prior information on music structure can be coupled with low level (acoustic physical) information in a principled manner to perform the analysis. The model is a special case of the, generally intractable, switching Kalman filter model. Where possible, we derive, exact polynomial time inference procedures, and otherwise efficient approximations. We argue that our generative model based approach is computationally feasible for many music applications and is readily extensible to more general auditory scene analysis scenarios.
  • Keywords
    Bayes methods; Kalman filters; acoustic signal processing; music; polynomials; acoustic analysis problem; dynamical Bayesian network; polynomial time inference; polyphonic music transcription; sound generation procedure; switching Kalman filter; Audio databases; Content based retrieval; Graphical models; Humans; Image analysis; Information analysis; Multiple signal classification; Music information retrieval; Performance analysis; Speech processing; Bayesian signal processing; music transcription; polyphonic pitch tracking; switching Kalman filters;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TSA.2005.852985
  • Filename
    1597270