• DocumentCode
    270509
  • Title

    The importance of audio descriptors in automatic soccer highlights generation

  • Author

    Raventós, Arnau ; Quijada, Raul ; Torres, L. ; Tarrés, Francesc ; Carasusán, Eusebio ; Giribet, Daniel

  • Author_Institution
    Signal Theor. & Commun. Dept., UPC - Barcelona Tech., Barcelona, Spain
  • fYear
    2014
  • fDate
    11-14 Feb. 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Automatic generation of sports highlights from recorded audiovisual content has been object of great interest in recent years. The problem is indeed important in the production of second and third division leagues highlights videos where the quantity of raw material is significant and does not contain manual annotations. Many approaches are mostly based on the analysis of the video and disregard the important information provided by the audio track. In this paper, a new approach that combines audio and video descriptors for automatic soccer highlights generation is proposed. The approach is based on the segmentation of the video contents into shots that are further analyzed in order to determine its relevance and interest. These video-shots are scored taking into account the fusion between different audio and video features. The paper is mainly focused to emphasize the importance of audio detectors that play a key role in the analysis and scoring of the video-shots. Specifically, a new algorithm for referee´s whistle detection is proposed. The algorithm has been proven to be very robust and efficiently discriminates professional whistles against other types of noises such as public cheering-up, music instruments, etc. Several results have been produced using real soccer video sequences that prove the validity of the proposed audio and video fusion scheme.
  • Keywords
    acoustic signal detection; audio signal processing; feature extraction; image segmentation; sport; video signal processing; audio descriptors; audio detectors; audio features; audio track; automatic soccer highlights generation; leagues highlights videos; manual annotations; music instruments; professional whistles; public cheering-up; recorded audiovisual content; referee whistle detection; soccer video sequences; sports highlights; video descriptors; video features; video segmentation; video shots; System-on-chip; XML; audio descriptors; content analysis; multimodal processing and fusion; semantic detection; video highlights; whistle detector;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multi-Conference on Systems, Signals & Devices (SSD), 2014 11th International
  • Conference_Location
    Barcelona
  • Type

    conf

  • DOI
    10.1109/SSD.2014.6808845
  • Filename
    6808845