DocumentCode :
480590
Title :
The Algorithm of Deviation Measure for Cluster Models Based on the FOCUS Framework and BIRCH
Author :
Feng, Xing-jie ; Pan, Qi
Author_Institution :
Coll. of Comput. Sci. & Technol., Civil Aviation Univ. of China, Tianjin
Volume :
1
fYear :
2008
fDate :
20-22 Dec. 2008
Firstpage :
44
Lastpage :
49
Abstract :
The work of how to measure the difference, or deviation, between two models is an important studied problem. Venkatesh Ganti has developed the FOCUS framework for computing an interpretable, qualifiable deviation measure between two datasets in terms of the models they induce. In this paper, we propose a novel method to measure the deviation for cluster models making use of CF-tree based on the FOCUS framework and BIRCH algorithm. First, for the different comparable datasets, we build the CF-tree as the GCR of different structural components. Second, we use geometrical distance that we defined to quantify the difference between datasets. An experiment of the algorithm on the 2-dimensional synthetic datasets is presented, demonstrating the applicability of the proposed algorithm.
Keywords :
data mining; 2-dimensional synthetic datasets; BIRCH; FOCUS framework; cluster models; data mining; Application software; Clustering algorithms; Computer science; Data mining; Databases; Decision trees; Educational institutions; Information technology; Predictive models; Tree data structures;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Information Technology Application, 2008. IITA '08. Second International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3497-8
Type :
conf
DOI :
10.1109/IITA.2008.427
Filename :
4739532
Link To Document :
بازگشت