DocumentCode :
1143242
Title :
PPCA-Based Missing Data Imputation for Traffic Flow Volume: A Systematical Approach
Author :
Qu, Li ; Li, Li ; Zhang, Yi ; Hu, Jianming
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing, China
Volume :
10
Issue :
3
fYear :
2009
Firstpage :
512
Lastpage :
522
Abstract :
The missing data problem greatly affects traffic analysis. In this paper, we put forward a new reliable method called probabilistic principal component analysis (PPCA) to impute the missing flow volume data based on historical data mining. First, we review the current missing data-imputation method and why it may fail to yield acceptable results in many traffic flow applications. Second, we examine the statistical properties of traffic flow volume time series. We show that the fluctuations of traffic flow are Gaussian type and that principal component analysis (PCA) can be used to retrieve the features of traffic flow. Third, we discuss how to use a robust PCA to filter out the abnormal traffic flow data that disturb the imputation process. Finally, we recall the theories of PPCA/Bayesian PCA-based imputation algorithms and compare their performance with some conventional methods, including the nearest/mean historical imputation methods and the local interpolation/regression methods. The experiments prove that the PPCA method provides significantly better performance than the conventional methods, reducing the root-mean-square imputation error by at least 25%.
Keywords :
Bayes methods; Gaussian processes; data mining; interpolation; principal component analysis; probability; regression analysis; road traffic; time series; traffic engineering computing; Bayesian PCA; Gaussian type; PPCA; historical data mining; local interpolation/regression method; missing data imputation; nearest/mean historical imputation method; probabilistic principal component analysis; statistical property; traffic flow volume; traffic flow volume time series; transportation systems; Missing data; probabilistic principal component analysis (PPCA); traffic flow volume;
fLanguage :
English
Journal_Title :
Intelligent Transportation Systems, IEEE Transactions on
Publisher :
ieee
ISSN :
1524-9050
Type :
jour
DOI :
10.1109/TITS.2009.2026312
Filename :
5169998
Link To Document :
بازگشت