DocumentCode :
3226405
Title :
Efficient fusion of cluster ensembles using inherent voting
Author :
Anandhi, R.J. ; Subramanyam, Natarajan
Author_Institution :
Dept. of CSE, Dr MGR Univ., Chennai, India
fYear :
2009
fDate :
22-24 July 2009
Firstpage :
1
Lastpage :
5
Abstract :
Discovering interesting, implicit knowledge and general relationships in geographic information databases is very important to understand and to use the spatial data. Spatial clustering has been recognized as a primary data mining method for knowledge discovery in spatial databases. In this paper, we have analyzed an efficient method for the fusion of the outputs of the various clusterers, with less computing. We have discussed our proposed slice and dice cluster ensemble merging technique (SDEM) for spatial datasets and used it in our three-phase clustering combination technique in this paper. Voting procedure is normally used to assign labels for the clusters and resolving the correspondence problem, but we have eliminated by usage of degree of agreement vector. Another common problem in any cluster ensembles is the computation of voting matrix which is in the order of n2, where n is the number of data points, which is very expensive with respect to spatial datasets. In our method, as we travel down the layered merge, we calculate degree of agreement (DOA) factor, based on the count of agreed clusterers. Using the updated DOA at every layer, the movement of unresolved, unsettled data elements will be handled at much reduced the computational cost. Added advantage of this approach is the reuse of the gained knowledge in previous layers, thereby yielding better cluster accuracy and robustness.
Keywords :
data mining; geographic information systems; pattern clustering; visual databases; cluster ensemble fusion; degree of agreement factor; degree of agreement vector; geographic information databases; implicit knowledge; inherent voting; knowledge discovery; primary data mining method; slice and dice cluster ensemble merging technique; spatial clustering; spatial databases; three-phase clustering combination technique; Clustering algorithms; Data mining; Data visualization; Iterative algorithms; Merging; Partitioning algorithms; Spatial databases; Spatial resolution; Visual databases; Voting; Clustering ensembles; Consensus function; Data mining; Degree of Agreement; Spatial data mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Agent & Multi-Agent Systems, 2009. IAMA 2009. International Conference on
Conference_Location :
Chennai
Print_ISBN :
978-1-4244-4710-7
Type :
conf
DOI :
10.1109/IAMA.2009.5228053
Filename :
5228053
Link To Document :
بازگشت