• DocumentCode
    270537
  • Title

    Sport Type Classification of Mobile Videos

  • Author

    Cricri, Francesco ; Roininen, Mikko J. ; Leppänen, Jussi ; Mate, Sujeet ; Curcio, Igor D. D. ; Uhlmann, Stefan ; Gabbouj, Moncef

  • Author_Institution
    Dept. of Signal Process., Tampere Univ. of Technol. (TUT), Tampere, Finland
  • Volume
    16
  • Issue
    4
  • fYear
    2014
  • fDate
    Jun-14
  • Firstpage
    917
  • Lastpage
    932
  • Abstract
    The recent proliferation of mobile video content has emphasized the need for applications such as automatic organization and automatic editing of videos. These applications could greatly benefit from domain knowledge about the content. However, extracting semantic information from mobile videos is a challenging task, due to their unconstrained nature. We extract domain knowledge about sport events recorded by multiple users, by classifying the sport type into soccer, American football, basketball, tennis, ice-hockey, or volleyball. We adopt a multi-user and multimodal approach, where each user simultaneously captures audio-visual content and auxiliary sensor data (from magnetometers and accelerometers). Firstly, each modality is separately analyzed; then, analysis results are fused for obtaining the sport type. The auxiliary sensor data is used for extracting more discriminative spatio-temporal visual features and efficient camera motion features. The contribution of each modality to the fusion process is adapted according to the quality of the input data. We performed extensive experiments on data collected at public sport events, showing the merits of using different combinations of modalities and fusion methods. The results indicate that analyzing multimodal and multi-user data, coupled with adaptive fusion, improves classification accuracies in most tested cases, up to 95.45%.
  • Keywords
    feature extraction; image classification; mobile computing; sport; video signal processing; American football; adaptive fusion; audiovisual content; automatic video editing; automatic video organization; auxiliary sensor data; basketball; camera motion features; discriminative spatio-temporal visual feature extraction; fusion process; ice hockey; mobile video content; modality; multimodal data; multiuser data; public sport events; soccer; sport type classification; tennis; volleyball; Accelerometers; Cameras; Data mining; Feature extraction; Magnetometers; Videos; Visualization; Fusion; mobile; sport; video;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2014.2307552
  • Filename
    6746214