DocumentCode :
665016
Title :
Temporal confusion network for speech-based soccer event retrieval
Author :
Pham, Nhut M. ; Vu, Quan H.
Author_Institution :
Artificial Intell. Lab., Univ. of Sci., Ho Chi Minh City, Vietnam
fYear :
2013
fDate :
16-18 Oct. 2013
Firstpage :
549
Lastpage :
553
Abstract :
This paper introduces temporal confusion network and its application for speech-based soccer event retrieval, where an event is remarked by the announcer´s spoken words. A temporal confusion network is a confusion network in which each cluster is marked with temporal information. Since the purpose of soccer event retrieval is to retrieve only the interesting highlights - not the whole video clip, temporal information is crucial in tracking them. By expanding the indexing model from 1-best transcriptions to temporal confusion networks, recall rates for event retrieval can be improved. Experiments are conducted on the first round of World Cup 2010 and the Vietnamese AFF Suzuki-cup 2008 databases. In the best case, an average improvement of 7.1% recall rate is achieved.
Keywords :
content-based retrieval; speech recognition; video retrieval; 1-best transcriptions; announcer spoken words; indexing model; speech-based soccer event retrieval; temporal confusion network; temporal information; video retrieval; Indexing; Lattices; Semantics; Speech; Speech recognition; Timing; content-based multimedia retrieval; event detection; soccer video; temporal confusion network;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Technologies for Communications (ATC), 2013 International Conference on
Conference_Location :
Ho Chi Minh City
ISSN :
2162-1020
Print_ISBN :
978-1-4799-1086-1
Type :
conf
DOI :
10.1109/ATC.2013.6698176
Filename :
6698176
Link To Document :
بازگشت