• DocumentCode
    3569623
  • Title

    A saliency-based approach to audio event detection and summarization

  • Author

    Zlatintsi, A. ; Maragos, P. ; Potamianos, A. ; Evangelopoulos, G.

  • Author_Institution
    Sch. of ECE, Nat. Tech. Univ. of Athens, Athens, Greece
  • fYear
    2012
  • Firstpage
    1294
  • Lastpage
    1298
  • Abstract
    In this paper, we approach the problem of audio summarization by saliency computation of audio streams, exploring the potential of a modulation model for the detection of perceptually important audio events based on saliency models, along with various fusion schemes for their combination. The fusion schemes include linear, adaptive and nonlinear methods. A machine learning approach, where training of the features is performed, was also applied for the purpose of comparison with the proposed technique. For the evaluation of the algorithm we use audio data taken from movies and we show that nonlinear fusion schemes perform best. The results are reported on the MovSum database, using objective evaluations (against ground-truth denoting the perceptually important audio events). Analysis of the selected audio segments is also performed against a labeled database in respect to audio categories, while a method for fine-tuning of the selected audio events is proposed.
  • Keywords
    audio databases; audio streaming; classification; information retrieval; learning (artificial intelligence); modulation; sensor fusion; MovSum database; adaptive method; audio categories; audio data; audio event detection; audio segment selection; audio streams; audio summarization problem; labeled database; linear method; machine learning; modulation model; nonlinear fusion schemes; nonlinear methods; saliency-based approach; Computational modeling; Databases; Event detection; Feature extraction; Frequency modulation; Motion pictures; Speech; audio summarization; modulation model; monomodal audio saliency;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
  • ISSN
    2219-5491
  • Print_ISBN
    978-1-4673-1068-0
  • Type

    conf

  • Filename
    6334317