• DocumentCode
    629066
  • Title

    Detecting violent content in Hollywood movies by mid-level audio representations

  • Author

    Acar, Esra ; Hopfgartner, Frank ; Albayrak, Sahin

  • Author_Institution
    DAI Lab., Tech. Univ. Berlin, Berlin, Germany
  • fYear
    2013
  • fDate
    17-19 June 2013
  • Firstpage
    73
  • Lastpage
    78
  • Abstract
    Movie violent content detection e.g., for providing automated youth protection services is a valuable video content analysis functionality. Choosing discriminative features for the representation of video segments is a key issue in designing violence detection algorithms. In this paper, we employ mid-level audio features which are based on a Bag-of-Audio Words (BoAW) method using Mel-Frequency Cepstral Coefficients (MFCCs). BoAW representations are constructed with two different methods, namely the vector quantization-based (VQ-based) method and the sparse coding-based (SC-based) method. We choose two-class support vector machines (SVMs) for classifying video shots as (non-)violent. Our experiments on detecting violent video shots in Hollywood movies show that the mid-level audio features provide promising results. Additionally, we establish that the SC-based method outperforms the VQ-based one. More importantly, the SC-based method outperforms the unimodal submissions in the MediaEval Violent Scenes Detection (VSD) task, except one vision-based method in terms of average precision.
  • Keywords
    audio signal processing; cinematography; vector quantisation; video coding; BoAW; Hollywood movies; MFCC; MediaEval violent scenes detection; Mel-frequency cepstral coefficients; SC; VQ; VSD; automated youth protection services; bag-of-audio words method; midlevel audio representations; sparse coding-based method; unimodal submissions; vector quantization-based method; video content analysis functionality; video segments representation; violent content detection; violent video shots; Dictionaries; Feature extraction; Mel frequency cepstral coefficient; Motion pictures; Support vector machines; Training; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing (CBMI), 2013 11th International Workshop on
  • Conference_Location
    Veszprem
  • ISSN
    1949-3983
  • Print_ISBN
    978-1-4799-0955-1
  • Type

    conf

  • DOI
    10.1109/CBMI.2013.6576556
  • Filename
    6576556