DocumentCode :
1758572
Title :
Learning to Track Multiple Targets
Author :
Xiao Liu ; Dacheng Tao ; Mingli Song ; Luming Zhang ; Jiajun Bu ; Chun Chen
Author_Institution :
Coll. of Comput. Sci., Zhejiang Univ., Hangzhou, China
Volume :
26
Issue :
5
fYear :
2015
fDate :
42125
Firstpage :
1060
Lastpage :
1073
Abstract :
Monocular multiple-object tracking is a fundamental yet under-addressed computer vision problem. In this paper, we propose a novel learning framework for tracking multiple objects by detection. First, instead of heuristically defining a tracking algorithm, we learn that a discriminative structure prediction model from labeled video data captures the interdependence of multiple influence factors. Given the joint targets state from the last time step and the observation at the current frame, the joint targets state at the current time step can then be inferred by maximizing the joint probability score. Second, our detection results benefit from tracking cues. The traditional detection algorithms need a nonmaximal suppression postprocessing to select a subset from the total detection responses as the final output and a large number of selection mistakes are induced, especially under a congested circumstance. Our method integrates both detection and tracking cues. This integration helps to decrease the postprocessing mistake risk and to improve performance in tracking. Finally, we formulate the entire model training into a convex optimization problem and estimate its parameters using the cutting plane optimization. Experiments show that our method performs effectively in a large variety of scenarios, including pedestrian tracking in crowd scenes and vehicle tracking in congested traffic.
Keywords :
computer vision; convex programming; learning (artificial intelligence); object detection; probability; target tracking; video signal processing; computer vision problem; congested traffic; convex optimization problem; crowd scenes; cutting plane optimization; detection cues; discriminative structure prediction model; joint probability score maximization; joint target state; labeled video data; learning framework; model training; monocular multiple-object tracking; multiple influence factor interdependence; multiple target tracking; nonmaximal suppression postprocessing; object detection; parameter estimation; pedestrian tracking; performance improvement; postprocessing mistake risk; subset selection; time step; total detection responses; tracking cues; vehicle tracking; Cameras; Detectors; Joints; Prediction algorithms; Target tracking; Trajectory; Cutting plane; discriminative model; interdependence; learning to track; multiple-object tracking; structure prediction; tracking-by-detection; tracking-by-detection.;
fLanguage :
English
Journal_Title :
Neural Networks and Learning Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
2162-237X
Type :
jour
DOI :
10.1109/TNNLS.2014.2333751
Filename :
6855348
Link To Document :
بازگشت