Title :
An efficient PAM spatial clustering algorithm based on MapReduce
Author :
Jun Yue ; Shanjun Mao ; Mei Li ; Xuesen Zou
Author_Institution :
Inst. of Remote Sensing & Geographic Inf. Syst., Peking Univ., Beijing, China
Abstract :
Clustering analysis has been a hot area of spatial data mining for several years. With the rapid development of the spatial information technology, the amount of spatial data is growing exponentially and it makes spatial clustering of massive spatial data a challenging task. Aiming to improve the efficiency of the clustering process on massive spatial data, an implementation of parallel Partitioning Around Medoids (PAM) spatial clustering algorithm based on MapReduce is proposed. The experiments on Hadoop and HBase demonstrate that the proposed algorithm can process massive spatial data efficiently and scale well on commodity hardware.
Keywords :
geographic information systems; geophysics computing; HBase; Hadoop; MapReduce; Partitioning Around Medoids; clustering analysis; clustering process; commodity hardware; parallel PAM spatial clustering algorithm; spatial data; spatial information technology; Joints; HBase; Hadoop; K-Medoids; MapReduce; PAM; Spatial Clustering;
Conference_Titel :
Geoinformatics (GeoInformatics), 2014 22nd International Conference on
Conference_Location :
Kaohsiung
DOI :
10.1109/GEOINFORMATICS.2014.6950803