مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2832158

Title :

Using two-stage Approach to Clustering

Author :

Yue, Shihong ; Song, Kai ; Li, Yi

Author_Institution :

Tianjin Univ., Tianjin

fYear :

2006

fDate :

15-17 Dec. 2006

Firstpage :

419

Lastpage :

423

Abstract :

A grid-based approach to clustering is presented. Each grid is a hypercube in data space, and Appriori algorithm is used to find the representing subsets of each cluster. The two-stage procedure -first finding all representing subsets then clustering in the second stage -is found to perform well when compared with direct clustering of data. The use of the representing subsets can efficiently find the data structure of the give dataset. Consequently, the new approach can effectively overcome the parameter-sensitive problem that is encountered in most of the conventional grid-based approaches to clustering. At the same time, if a proper threshold in the new approach is chosen, the computation time to cluster a large dataset will further decrease greatly. Two experiments are used to illustrate the performances of the new proposed approach and verify its merits.

Keywords :

data handling; data mining; data structures; grid computing; Appriori algorithm; data clustering; data structure; data-mining; grid-based approach; knowledge discovery; Association rules; Chemical engineering; Clustering algorithms; Computational complexity; Data mining; Data structures; Databases; Hypercubes; Itemsets; Partitioning algorithms;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Industrial Technology, 2006. ICIT 2006. IEEE International Conference on

Conference_Location :

Mumbai

Print_ISBN :

1-4244-0726-5

Electronic_ISBN :

1-4244-0726-5

Type :

conf

DOI :

10.1109/ICIT.2006.372238

Filename :

4237560

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2832158