• DocumentCode
    730072
  • Title

    Sound event detection in real life recordings using coupled matrix factorization of spectral representations and class activity annotations

  • Author

    Mesaros, Annamaria ; Heittola, Toni ; Dikmen, Onur ; Virtanen, Tuomas

  • Author_Institution
    Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    151
  • Lastpage
    155
  • Abstract
    Methods for detection of overlapping sound events in audio involve matrix factorization approaches, often assigning separated components to event classes. We present a method that bypasses the supervised construction of class models. The method learns the components as a non-negative dictionary in a coupled matrix factorization problem, where the spectral representation and the class activity annotation of the audio signal share the activation matrix. In testing, the dictionaries are used to estimate directly the class activations. For dealing with large amount of training data, two methods are proposed for reducing the size of the dictionary. The methods were tested on a database of real life recordings, and outperformed previous approaches by over 10%.
  • Keywords
    acoustic signal detection; audio recording; audio signal processing; dictionaries; matrix decomposition; signal representation; spectral analysis; activation matrix; audio signal; class activity annotation; coupled matrix factorization; non-negative dictionary; overlapping sound event detection; real life recordings; spectral representation; Accuracy; Acoustics; Context; Dictionaries; Event detection; Testing; Training; coupled non-negative matrix factorization; non-negative dictionaries; sound event detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7177950
  • Filename
    7177950