Title :
Multihuman Tracking Based on a Spatial–Temporal Appearance Match
Author :
Yuan Shen ; Zhenjiang Miao
Author_Institution :
Inst. of Inf. Sci., Beijing Jiaotong Univ., Beijing, China
Abstract :
In this paper, we focus on the improvements of appearance representation for multihuman tracking. Many previous methods extracted low-level appearance features, such as color histogram and texture, even combined with spatial information for each frame. These methods ignore the temporal distribution of features. The features of each frame may not be stable due to illumination, human pose variation, and image noise. In order to improve it, we propose a novel appearance representation called the spatial-temporal appearance model based on the statistical distribution of Gaussian mixture model (GMM). It represents the appearance of a tracklet as a whole with dynamic spatial and temporal information. The spatial information is the dynamic subregions. The temporal information is the dynamic duration time of each subregion. Each subregion is modeled as the weighted Gaussian distribution of GMM. The online expectation-maximization (online EM) algorithm is used to estimate the parameters of GMM. Then, we propose a tracklet association method using Bayesian prediction and Jensen-Shannon divergence. The Bayesian prediction is used to predict the locations of targets. The Jensen-Shannon divergence is used to compute the distance of spatial-temporal appearance distribution between two tracklets. Finally, we test our approach on four challenging datasets (TRECVID, CAVIAR, ETH, and EPFL Terrace) and achieve good results.
Keywords :
Bayes methods; Gaussian distribution; expectation-maximisation algorithm; image colour analysis; image representation; image texture; tracking; Bayesian prediction; CAVIAR; EPFL Terrace; ETH; Gaussian mixture model; Jensen-Shannon divergence; TRECVID; appearance representation; color histogram; dynamic duration time; dynamic spatial information; dynamic subregions; human pose variation; image noise; low-level appearance features; multihuman tracking; online expectation-maximization algorithm; spatial-temporal appearance distribution; spatial-temporal appearance match; spatial-temporal appearance model; statistical distribution; temporal distribution; temporal information; tracklet association method; weighted Gaussian distribution; Computational modeling; Feature extraction; Gaussian distribution; Histograms; Image color analysis; Target tracking; Jensen–Shannon divergence; multihuman tracking; online EM; spatial–temporal appearance;
Journal_Title :
Circuits and Systems for Video Technology, IEEE Transactions on
DOI :
10.1109/TCSVT.2013.2280073