• DocumentCode
    2268417
  • Title

    Discriminative Acoustic Event Recognition in multimedia recordings

  • Author

    Khanwalkar, Saurabh ; Saikumar, Guruprasad ; Srivastava, Amit ; Natarajan, Premkumar

  • Author_Institution
    Raytheon BBN Technol., Cambridge, MA, USA
  • fYear
    2011
  • fDate
    Aug. 29 2011-Sept. 2 2011
  • Firstpage
    1312
  • Lastpage
    1316
  • Abstract
    In this paper, we describe an Acoustic Event Recognition (AER) system for locating events of interest in the audio stream of multimedia recordings. We focus on two non-speech acoustic events; bomb explosions and gunfire, which typically exist in surveillance videos and are of importance in monitoring and alerting applications. Recognition is performed using a discriminative approach based on Support Vector Machines (SVM). We compare the new approach to a baseline system that utilizes a Hidden Markov Model (HMM)-based classification approach. We performed experiments on a corpus of publicly available video files containing gunfire and explosion events. Our results show that the new discriminative approach, when configured to use a rich combination of acoustic features, achieves a high retrieval precision at a notable recall under noisy conditions. As compared to HMM-based system, we achieved 54% relative improvement in F-score for explosion recognition with 1.5% relative improvement in F-score for gunfire recognition.
  • Keywords
    audio streaming; hidden Markov models; image classification; image recognition; multimedia communication; multimedia computing; support vector machines; video surveillance; AER system; F-score recognition; HMM-based classification approach; SVM; audio streaming; bomb explosion; discriminative acoustic event recognition system; gunfire recognition; hidden Markov model-based classification approach; multimedia recording; nonspeech acoustic event; support vector machine; video surveillance; Explosions; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2011 19th European
  • Conference_Location
    Barcelona
  • ISSN
    2076-1465
  • Type

    conf

  • Filename
    7074061