DocumentCode :
3605631
Title :
Deep Learning and Music Adversaries
Author :
Kereliuk, Corey ; Sturm, Bob L. ; Larsen, Jan
Author_Institution :
DTU Compute, Tech. Univ. of Denmark, Frederiksberg, Denmark
Volume :
17
Issue :
11
fYear :
2015
Firstpage :
2059
Lastpage :
2071
Abstract :
An adversary is an agent designed to make a classification system perform in some particular way, e.g., increase the probability of a false negative. Recent work builds adversaries for deep learning systems applied to image object recognition, exploiting the parameters of the system to find the minimal perturbation of the input image such that the system misclassifies it with high confidence. We adapt this approach to construct and deploy an adversary of deep learning systems applied to music content analysis. In our case, however, the system inputs are magnitude spectral frames, which require special care in order to produce valid input audio signals from network- derived perturbations . For two different train-test partitionings of two benchmark datasets, and two different architectures , we find that this adversary is very effective. We find that convolutional architectures are more robust compared to systems based on a majority vote over individually classified audio frames. Furthermore , we experiment with a new system that integrates an adversary into the training loop, but do not find that this improves the resilience of the system to new adversaries.
Keywords :
audio signal processing; content management; convolution; image classification; information retrieval; learning (artificial intelligence); music; object recognition; audio frame classification; audio signal; classification system; convolutional architecture; deep learning system; image object recognition; magnitude spectral frame; music adversary; music content analysis; network derived perturbation; training loop; Benchmark testing; Computer architecture; Machine learning; Neural networks; Rhythm; Training; AEA-MIR content-based processing and music information retrieval; deep learning;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2015.2478068
Filename :
7254179
Link To Document :
بازگشت