DocumentCode
3648262
Title
A multi-modal highlight extraction scheme for sports videos using an information-theoretic excitability measure
Author
Taufiq Hasan;Hynek Bořil;Abhijeet Sangwan;John H. L. Hansen
Author_Institution
Center for Robust Speech Systems (CRSS), University of Texas at Dallas, Richardson, 75080, USA
fYear
2012
fDate
3/1/2012 12:00:00 AM
Firstpage
2381
Lastpage
2384
Abstract
A generic method for sports video highlight selection is presented in this study. Processing begins where the video is divided into short segments and several multi-modal features are extracted from each video segment. Excitability is computed based on the likelihood of the features lying in certain regions of their probability density functions that are exciting and rare. The proposed measure is used to rank order the partitioned segment stream to compress the overall video sequence and produce a contiguous set of highlights. Experiments are performed on baseball videos using excitement in the commentators´ speech, audio energy, slow motion replay, scene cut density, and motion activity as features. Subjective evaluation of excitability and ranking of video segments yield a higher correlation with the proposed measure compared to well-established techniques indicating the effectiveness of the approach.
Keywords
"Videos","Feature extraction","Speech","Games","Sports equipment","Motion segmentation","Correlation"
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Type
conf
DOI
10.1109/ICASSP.2012.6288394
Filename
6288394
Link To Document