DocumentCode
3414014
Title
On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence
Author
Roebel, Axel ; Pons, Jordi ; Liuni, Marco ; Lagrangey, Mathieu
Author_Institution
IRCAM, UPMC, Paris, France
fYear
2015
fDate
19-24 April 2015
Firstpage
414
Lastpage
418
Abstract
This paper presents an investigation into the detection and classification of drum sounds in polyphonic music and drum loops using non-negative matrix deconvolution (NMD) and the Itakura Saito divergence. The Itakura Saito divergence has recently been proposed as especially appropriate for decomposing audio spectra due to the fact that it is scale invariant, but it has not yet been widely adopted. The article studies new contributions for audio event detection methods using the Itakura Saito divergence that improve efficiency and numerical stability, and simplify the generation of target pattern sets. A new approach for handling background sounds is proposed and moreover, a new detection criteria based on estimating the perceptual presence of the target class sources is introduced. Experimental results obtained for drum detection in polyphonic music and drum soli demonstrate the beneficial effects of the proposed extensions.
Keywords
audio signal processing; deconvolution; information retrieval; matrix decomposition; music; musical instruments; numerical stability; signal classification; signal detection; source separation; Itakura Saito divergence; NMD; audio event detection methods; audio spectra decomposition; automatic drum transcription; background sound handling; drum loops; drum sound classification; drum sound detection; efficiency improvement; music information retrieval; nonnegative matrix deconvolution; numerical stability improvement; polyphonic music; source separation; target pattern set generation; Algorithm design and analysis; Art; Convergence; Databases; Noise; Training; Training data; Source separation; audio event detection; drum transcription; music information retrieval; non-negative matrix deconvolution;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178002
Filename
7178002
Link To Document