• DocumentCode
    3716294
  • Title

    Multi-label vs. combined single-label sound event detection with deep neural networks

  • Author

    Emre Cakir;Toni Heittola;Heikki Huttunen;Tuomas Virtanen

  • Author_Institution
    Department of Signal Processing, Tampere University of Technology, Finland
  • fYear
    2015
  • Firstpage
    2551
  • Lastpage
    2555
  • Abstract
    In real-life audio scenes, many sound events from different sources are simultaneously active, which makes the automatic sound event detection challenging. In this paper, we compare two different deep learning methods for the detection of environmental sound events: combined single-label classification and multi-label classification. We investigate the accuracy of both methods on the audio with different levels of polyphony. Multi-label classification achieves an overall 62.8% accuracy, whereas combined single-label classification achieves a very close 61.9% accuracy. The latter approach offers more flexibility on real-world applications by gathering the relevant group of sound events in a single classifier with various combinations.
  • Keywords
    "Training","Feature extraction","Signal processing","Europe","Event detection","Databases","Cost function"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2015 23rd European
  • Electronic_ISBN
    2076-1465
  • Type

    conf

  • DOI
    10.1109/EUSIPCO.2015.7362845
  • Filename
    7362845