DocumentCode :
800149
Title :
ARGOS: automatically extracting repeating objects from multimedia streams
Author :
Herley, Cormac
Author_Institution :
Microsoft Res., Redmond, WA, USA
Volume :
8
Issue :
1
fYear :
2006
Firstpage :
115
Lastpage :
129
Abstract :
Many media streams consist of distinct objects that repeat. For example, broadcast television and radio signals contain advertisements, call sign jingles, songs, and even whole programs that repeat. The problem we address is to explicitly identify the underlying structure in repetitive streams and de-construct them into their component objects. Our algorithm exploits dimension reduction techniques on the audio portion of a multimedia stream to make search and buffering feasible. Our architecture assumes no a priori knowledge of the streams, and does not require that the repeating objects (ROs) be known. Everything the system needs, including the position and duration of the ROs, is learned on the fly. We demonstrate that it is perfectly feasible to identify in realtime ROs that occur days or even weeks apart in audio or video streams. Both the compute and buffering requirements are comfortably within reach for a basic desktop computer. We outline the algorithms, enumerate several applications and present results from real broadcast streams.
Keywords :
audio signal processing; digital video broadcasting; media streaming; object detection; audio dimension reduction; buffering requirements; multimedia stream; object search; real broadcast stream; repeating objects extraction; Application software; Fingerprint recognition; Layout; Libraries; Multimedia communication; Radio broadcasting; Speech; Streaming media; TV broadcasting; Technological innovation; Audio fingerprint; low-dimension representation; multimedia; repeats; segmentation;
fLanguage :
English
Journal_Title :
Multimedia, IEEE Transactions on
Publisher :
ieee
ISSN :
1520-9210
Type :
jour
DOI :
10.1109/TMM.2005.861286
Filename :
1580531
Link To Document :
بازگشت