Title :
Temporal confusion network for speech-based soccer event retrieval
Author :
Pham, Nhut M. ; Vu, Quan H.
Author_Institution :
Artificial Intell. Lab., Univ. of Sci., Ho Chi Minh City, Vietnam
Abstract :
This paper introduces temporal confusion network and its application for speech-based soccer event retrieval, where an event is remarked by the announcer´s spoken words. A temporal confusion network is a confusion network in which each cluster is marked with temporal information. Since the purpose of soccer event retrieval is to retrieve only the interesting highlights - not the whole video clip, temporal information is crucial in tracking them. By expanding the indexing model from 1-best transcriptions to temporal confusion networks, recall rates for event retrieval can be improved. Experiments are conducted on the first round of World Cup 2010 and the Vietnamese AFF Suzuki-cup 2008 databases. In the best case, an average improvement of 7.1% recall rate is achieved.
Keywords :
content-based retrieval; speech recognition; video retrieval; 1-best transcriptions; announcer spoken words; indexing model; speech-based soccer event retrieval; temporal confusion network; temporal information; video retrieval; Indexing; Lattices; Semantics; Speech; Speech recognition; Timing; content-based multimedia retrieval; event detection; soccer video; temporal confusion network;
Conference_Titel :
Advanced Technologies for Communications (ATC), 2013 International Conference on
Conference_Location :
Ho Chi Minh City
Print_ISBN :
978-1-4799-1086-1
DOI :
10.1109/ATC.2013.6698176