• DocumentCode
    3414014
  • Title

    On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence

  • Author

    Roebel, Axel ; Pons, Jordi ; Liuni, Marco ; Lagrangey, Mathieu

  • Author_Institution
    IRCAM, UPMC, Paris, France
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    414
  • Lastpage
    418
  • Abstract
    This paper presents an investigation into the detection and classification of drum sounds in polyphonic music and drum loops using non-negative matrix deconvolution (NMD) and the Itakura Saito divergence. The Itakura Saito divergence has recently been proposed as especially appropriate for decomposing audio spectra due to the fact that it is scale invariant, but it has not yet been widely adopted. The article studies new contributions for audio event detection methods using the Itakura Saito divergence that improve efficiency and numerical stability, and simplify the generation of target pattern sets. A new approach for handling background sounds is proposed and moreover, a new detection criteria based on estimating the perceptual presence of the target class sources is introduced. Experimental results obtained for drum detection in polyphonic music and drum soli demonstrate the beneficial effects of the proposed extensions.
  • Keywords
    audio signal processing; deconvolution; information retrieval; matrix decomposition; music; musical instruments; numerical stability; signal classification; signal detection; source separation; Itakura Saito divergence; NMD; audio event detection methods; audio spectra decomposition; automatic drum transcription; background sound handling; drum loops; drum sound classification; drum sound detection; efficiency improvement; music information retrieval; nonnegative matrix deconvolution; numerical stability improvement; polyphonic music; source separation; target pattern set generation; Algorithm design and analysis; Art; Convergence; Databases; Noise; Training; Training data; Source separation; audio event detection; drum transcription; music information retrieval; non-negative matrix deconvolution;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178002
  • Filename
    7178002