DocumentCode :
3716043
Title :
Music boundary detection using neural networks on spectrograms and self-similarity lag matrices
Author :
Thomas Grill;Jan Schluter
Author_Institution :
Austrian Research Institute for Artificial Intelligence (OFAI), Vienna, Austria
fYear :
2015
Firstpage :
1296
Lastpage :
1300
Abstract :
The first step of understanding the structure of a music piece is to segment it into formative parts. A recently successful method for finding segment boundaries employs a Convolutional Neural Network (CNN) trained on spectrogram excerpts. While setting a new state of the art, it often misses boundaries defined by non-local musical cues, such as segment repetitions. To account for this, we propose a refined variant of self-similarity lag matrices representing long-term relationships. We then demonstrate different ways of fusing this feature with spectrogram excerpts within a CNN, resulting in a boundary recognition performance superior to the previous state of the art. We assume that the integration of more features in a similar fashion would improve the performance even further.
Keywords :
"Spectrogram","Context","Convolution","Neural networks","Europe","Kernel"
Publisher :
ieee
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN :
2076-1465
Type :
conf
DOI :
10.1109/EUSIPCO.2015.7362593
Filename :
7362593
Link To Document :
بازگشت