DocumentCode :
419883
Title :
Efficient multimodal features for automatic soccer highlight generation
Author :
Wan, Kongwah ; Xu, Changsheng
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
Volume :
3
fYear :
2004
fDate :
23-26 Aug. 2004
Firstpage :
973
Abstract :
We describe efficient audio/visual features and their multimodal combination to detect highlights in soccer video. A novel audio feature first detects dominant speech portions in the commentary coincident with segments of high excitement in the game. Verification is then performed in the visual domain by detecting the presence of goal-mouth in the current shot and a high frequency of camera shot change in the subsequent shots. The cascaded process filters spurious candidate highlights from the noisy audio. The impressive results obtained on a large video test-set belie the technical simplicity in the system, which may now enable rapid generation of highlights on low-cost devices such as household set-top-boxes.
Keywords :
audio-visual systems; feature extraction; filtering theory; speech recognition; video signal processing; audio-visual features; automatic soccer highlight generation; camera shots; cascaded process filters; dominant speech portion detection; goal mouth detection; household set top boxes; large video test set; multimodal features; soccer video; Cameras; Computer vision; Filters; Frequency; Games; Gunshot detection systems; Logic; Speech; System testing; Wide area networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
ISSN :
1051-4651
Print_ISBN :
0-7695-2128-2
Type :
conf
DOI :
10.1109/ICPR.2004.1334691
Filename :
1334691
Link To Document :
بازگشت