Title :
Coercion: A Distributed Clustering Algorithm for Categorical Data
Author :
Bin Wang ; Yang Zhou ; Xinhong Hei
Author_Institution :
Sch. of Comput. Sci. & Eng., Xi´an Univ. of Technol., Xi´an, China
Abstract :
Clustering is an important technology in data mining. Squeezer is one such clustering algorithm for categorical data and it is more efficient than most existing algorithms for categorical data. But Squeezer is time consuming for very large datasets which are distributed in different servers. Thus, we employ the distributed thinking to improve Squeezer and a distributed algorithm for categorical data called Coercion is proposed in this paper. In order to present detailed complexity results for Coercion, we also conduct an experimental study with standard as well as synthetic data sets to demonstrate the effectiveness of the new algorithm.
Keywords :
data mining; pattern clustering; Coercion; Squeezer; categorical data; data mining; distributed clustering algorithm; Algorithm design and analysis; Clustering algorithms; Distributed databases; Partitioning algorithms; Presses; Rocks; Servers; Sqeezer; categorical data; clustering; data mining; distributed;
Conference_Titel :
Computational Intelligence and Security (CIS), 2013 9th International Conference on
Conference_Location :
Leshan
Print_ISBN :
978-1-4799-2548-3
DOI :
10.1109/CIS.2013.149