• DocumentCode
    667501
  • Title

    An exemplar-based NMF approach to audio event detection

  • Author

    Gemmeke, Jort F. ; Vuegen, Lode ; Karsmakers, Peter ; Vanrumste, Bart ; Van hamme, Hugo

  • Author_Institution
    ESAT-PSI, KU Leuven, Leuven, Belgium
  • fYear
    2013
  • fDate
    20-23 Oct. 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    We present a novel, exemplar-based method for audio event detection based on non-negative matrix factorisation. Building on recent work in noise robust automatic speech recognition, we model events as a linear combination of dictionary atoms, and mixtures as a linear combination of overlapping events. The weights of activated atoms in an observation serve directly as evidence for the underlying event classes. The atoms in the dictionary span multiple frames and are created by extracting all possible fixed-length exemplars from the training data. To combat data scarcity of small training datasets, we propose to artificially augment the amount of training data by linear time warping in the feature domain at multiple rates. The method is evaluated on the Office Live and Office Synthetic datasets released by the AASP Challenge on Detection and Classification of Acoustic Scenes and Events.
  • Keywords
    acoustic signal detection; acoustic signal processing; audio signal processing; matrix decomposition; signal classification; speech recognition; AASP Challenge; Office Live datasets; Office Synthetic datasets; acoustic event classification; acoustic event detection; acoustic scene classification; acoustic scene detection; audio event detection; data scarcity; dictionary atoms; dictionary span multiple frames; exemplar-based NMF approach; linear overlapping event combination; linear time warping; noise robust automatic speech recognition; nonnegative matrix factorisation; possible fixed-length exemplar extraction; Acoustics; Dictionaries; Event detection; Hidden Markov models; Measurement; Noise; Training data; Audio event detection; NMF; exemplars;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
  • Conference_Location
    New Paltz, NY
  • ISSN
    1931-1168
  • Type

    conf

  • DOI
    10.1109/WASPAA.2013.6701847
  • Filename
    6701847