Title :
Parallel clustering of big data of spatio-temporal trajectory
Author :
Chunchun Hu; Xionghua Kang; Nianxue Luo; Qiansheng Zhao
Author_Institution :
School of Geodesy and Geomatics, Wuhan University, China
Abstract :
Generally, computing efficiency of many spatial data analysis algorithm will sharply decline as data size increase. It is very meaningful for extending the analysis method of spatial data and enhancing computational efficiency by introducing the distributed parallel computing model. Considering the features of spatio-temporal trajectory data, which is massive, related to time and dynamic, we proposed the fast calculation method of the trajectory similarity based on coarse-grained Dynamic Time Warping. The algorithm will reduce the consuming time greatly when the length of trajectory sequences are very long. We also proposed the parallel trajectory clustering strategy of big data under the Hadoop MapReduce model in this paper. The big data of trajectory are sliced, and the trajectory similarity and the iteration computation of cluster center are dealt with by multiwork nodes simultaneously. The experimental results of the parallel trajectory clustering, which based on the open source project Mahout, implemented on the vehicle trajectory data show that the clustering results are valid. The computing performance of parallel clustering are obviously improved as the trajectory data size increases. And the new parallel clustering method outperforms the traditional algorithm like k-means algorithm.
Keywords :
"Trajectory","Clustering algorithms","Big data","Algorithm design and analysis","Computational modeling","Clustering methods","Data mining"
Conference_Titel :
Natural Computation (ICNC), 2015 11th International Conference on
Electronic_ISBN :
2157-9563
DOI :
10.1109/ICNC.2015.7378088