• DocumentCode
    741355
  • Title

    Detection and Classification of Acoustic Scenes and Events

  • Author

    Stowell, Dan ; Giannoulis, Dimitrios ; Benetos, Emmanouil ; Lagrange, Mathieu ; Plumbley, Mark D.

  • Author_Institution
    Centre for Digital Music, Queen Mary University of London, London, United Kingdom
  • Volume
    17
  • Issue
    10
  • fYear
    2015
  • Firstpage
    1733
  • Lastpage
    1746
  • Abstract
    For intelligent systems to make best use of the audio modality, it is important that they can recognize not just speech and music, which have been researched as specific tasks, but also general sounds in everyday environments. To stimulate research in this field we conducted a public research challenge: the IEEE Audio and Acoustic Signal Processing Technical Committee challenge on Detection and Classification of Acoustic Scenes and Events (DCASE). In this paper, we report on the state of the art in automatically classifying audio scenes, and automatically detecting and classifying audio events. We survey prior work as well as the state of the art represented by the submissions to the challenge from various research groups. We also provide detail on the organization of the challenge, so that our experience as challenge hosts may be useful to those organizing challenges in similar domains. We created new audio datasets and baseline systems for the challenge; these, as well as some submitted systems, are publicly available under open licenses, to serve as benchmarks for further research in general-purpose machine listening.
  • Keywords
    Event detection; Licenses; Microphones; Music; Speech; Speech recognition; Audio databases; event detection; machine intelligence; pattern recognition;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2015.2428998
  • Filename
    7100934