Title :
Efficient multimodal features for automatic soccer highlight generation
Author :
Wan, Kongwah ; Xu, Changsheng
Author_Institution :
Inst. for Infocomm Res., Singapore, Singapore
Abstract :
We describe efficient audio/visual features and their multimodal combination to detect highlights in soccer video. A novel audio feature first detects dominant speech portions in the commentary coincident with segments of high excitement in the game. Verification is then performed in the visual domain by detecting the presence of goal-mouth in the current shot and a high frequency of camera shot change in the subsequent shots. The cascaded process filters spurious candidate highlights from the noisy audio. The impressive results obtained on a large video test-set belie the technical simplicity in the system, which may now enable rapid generation of highlights on low-cost devices such as household set-top-boxes.
Keywords :
audio-visual systems; feature extraction; filtering theory; speech recognition; video signal processing; audio-visual features; automatic soccer highlight generation; camera shots; cascaded process filters; dominant speech portion detection; goal mouth detection; household set top boxes; large video test set; multimodal features; soccer video; Cameras; Computer vision; Filters; Frequency; Games; Gunshot detection systems; Logic; Speech; System testing; Wide area networks;
Conference_Titel :
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
Print_ISBN :
0-7695-2128-2
DOI :
10.1109/ICPR.2004.1334691