• DocumentCode
    3648262
  • Title

    A multi-modal highlight extraction scheme for sports videos using an information-theoretic excitability measure

  • Author

    Taufiq Hasan;Hynek Bořil;Abhijeet Sangwan;John H. L. Hansen

  • Author_Institution
    Center for Robust Speech Systems (CRSS), University of Texas at Dallas, Richardson, 75080, USA
  • fYear
    2012
  • fDate
    3/1/2012 12:00:00 AM
  • Firstpage
    2381
  • Lastpage
    2384
  • Abstract
    A generic method for sports video highlight selection is presented in this study. Processing begins where the video is divided into short segments and several multi-modal features are extracted from each video segment. Excitability is computed based on the likelihood of the features lying in certain regions of their probability density functions that are exciting and rare. The proposed measure is used to rank order the partitioned segment stream to compress the overall video sequence and produce a contiguous set of highlights. Experiments are performed on baseball videos using excitement in the commentators´ speech, audio energy, slow motion replay, scene cut density, and motion activity as features. Subjective evaluation of excitability and ranking of video segments yield a higher correlation with the proposed measure compared to well-established techniques indicating the effectiveness of the approach.
  • Keywords
    "Videos","Feature extraction","Speech","Games","Sports equipment","Motion segmentation","Correlation"
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288394
  • Filename
    6288394