DocumentCode :
254610
Title :
Multi-source Multi-modal Activity Recognition in Aerial Video Surveillance
Author :
Hammoud, Riad I. ; Sahin, Cem S. ; Blasch, Erik P. ; Rhodes, Bradley J.
Author_Institution :
BAE Syst., Burlington, MA, USA
fYear :
2014
fDate :
23-28 June 2014
Firstpage :
237
Lastpage :
244
Abstract :
Recognizing activities in wide aerial/overhead imagery remains a challenging problem due in part to low-resolution video and cluttered scenes with a large number of moving objects. In the context of this research, we deal with two un-synchronized data sources collected in real-world operating scenarios: full-motion videos (FMV) and analyst call-outs (ACO) in the form of chat messages (voice-to-text) made by a human watching the streamed FMV from an aerial platform. We present a multi-source multi-modal activity/event recognition system for surveillance applications, consisting of: (1) detecting and tracking multiple dynamic targets from a moving platform, (2) representing FMV target tracks and chat messages as graphs of attributes, (3) associating FMV tracks and chat messages using a probabilistic graph-based matching approach, and (4) detecting spatial-temporal activity boundaries. We also present an activity pattern learning framework which uses the multi-source associated data as training to index a large archive of FMV videos. Finally, we describe a multi-intelligence user interface for querying an index of activities of interest (AOIs) by movement type and geo-location, and for playing-back a summary of associated text (ACO) and activity video segments of targets-of-interest (TOIs) (in both pixel and geo-coordinates). Such tools help the end-user to quickly search, browse, and prepare mission reports from multi-source data.
Keywords :
image matching; image motion analysis; image representation; indexing; learning (artificial intelligence); object detection; query processing; target tracking; user interfaces; video streaming; video surveillance; ACO; FMV streaming; FMV target track representation; FMV videos; activities of interest; activity pattern learning framework; activity video segments; aerial imagery; aerial video surveillance; analyst call-outs; associated text; full-motion video; geolocation; index query; multi-intelligence user interface; multiple dynamic target detection; multiple dynamic target tracking; multisource associated data; multisource multimodal activity recognition; multisource multimodal event recognition; overhead imagery; probabilistic graph-based matching approach; spatial-temporal activity boundary detection; targets-of-interest; unsynchronized data sources; voice-to-text chat messages; Pattern recognition; Radar tracking; Semantics; Streaming media; Target tracking; Vehicles; FMV exploitation; MINER; activity recognition; chat and video fusion; event recognition; fusion; graph matching; graph representation; surveillance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition Workshops (CVPRW), 2014 IEEE Conference on
Conference_Location :
Columbus, OH
Type :
conf
DOI :
10.1109/CVPRW.2014.44
Filename :
6909989
Link To Document :
بازگشت