DocumentCode
800149
Title
ARGOS: automatically extracting repeating objects from multimedia streams
Author
Herley, Cormac
Author_Institution
Microsoft Res., Redmond, WA, USA
Volume
8
Issue
1
fYear
2006
Firstpage
115
Lastpage
129
Abstract
Many media streams consist of distinct objects that repeat. For example, broadcast television and radio signals contain advertisements, call sign jingles, songs, and even whole programs that repeat. The problem we address is to explicitly identify the underlying structure in repetitive streams and de-construct them into their component objects. Our algorithm exploits dimension reduction techniques on the audio portion of a multimedia stream to make search and buffering feasible. Our architecture assumes no a priori knowledge of the streams, and does not require that the repeating objects (ROs) be known. Everything the system needs, including the position and duration of the ROs, is learned on the fly. We demonstrate that it is perfectly feasible to identify in realtime ROs that occur days or even weeks apart in audio or video streams. Both the compute and buffering requirements are comfortably within reach for a basic desktop computer. We outline the algorithms, enumerate several applications and present results from real broadcast streams.
Keywords
audio signal processing; digital video broadcasting; media streaming; object detection; audio dimension reduction; buffering requirements; multimedia stream; object search; real broadcast stream; repeating objects extraction; Application software; Fingerprint recognition; Layout; Libraries; Multimedia communication; Radio broadcasting; Speech; Streaming media; TV broadcasting; Technological innovation; Audio fingerprint; low-dimension representation; multimedia; repeats; segmentation;
fLanguage
English
Journal_Title
Multimedia, IEEE Transactions on
Publisher
ieee
ISSN
1520-9210
Type
jour
DOI
10.1109/TMM.2005.861286
Filename
1580531
Link To Document