Title :
Multi-label vs. combined single-label sound event detection with deep neural networks
Author :
Emre Cakir;Toni Heittola;Heikki Huttunen;Tuomas Virtanen
Author_Institution :
Department of Signal Processing, Tampere University of Technology, Finland
Abstract :
In real-life audio scenes, many sound events from different sources are simultaneously active, which makes the automatic sound event detection challenging. In this paper, we compare two different deep learning methods for the detection of environmental sound events: combined single-label classification and multi-label classification. We investigate the accuracy of both methods on the audio with different levels of polyphony. Multi-label classification achieves an overall 62.8% accuracy, whereas combined single-label classification achieves a very close 61.9% accuracy. The latter approach offers more flexibility on real-world applications by gathering the relevant group of sound events in a single classifier with various combinations.
Keywords :
"Training","Feature extraction","Signal processing","Europe","Event detection","Databases","Cost function"
Conference_Titel :
Signal Processing Conference (EUSIPCO), 2015 23rd European
Electronic_ISBN :
2076-1465
DOI :
10.1109/EUSIPCO.2015.7362845