DocumentCode
2396284
Title
A hierarchical clustering algorithm based on density for data stratification
Author
Sun, Zhiwei
Author_Institution
Coll. of Comput. Sci. & Inf. Eng., TianJin Univ. of Sci. & Technol., Tianjin, China
fYear
2012
fDate
19-20 May 2012
Firstpage
2208
Lastpage
2211
Abstract
Cluster analysis is a primary method for data mining. Existing cluster approaches use global input parameters. But the real data can´t be described by them, and each input parameter will have a significant effect to the result. A new algorithm named HDBSCAN will be introduced for the purpose of cluster analysis which can produce nature stratification. The algorithm have a preprocess procedure which use graph to express the structure of neighborhood, thus the parameters can be easily set, this is important for determination of input parameters; and then a hierarchical approach based on density clustering algorithm is used to analysis the data with the different Eps-neighborhood. At last the relationship among the cluster results will be got by scanning the cluster results above. We show how to get the intrinsic clustering structure and show the results. Both theory analysis and experimental results confirm the approach can cluster data with automatic setting different parameters in different partitions.
Keywords
data analysis; data mining; graph theory; pattern clustering; Eps-neighborhood; cluster analysis; data mining; data stratification; density clustering algorithm; global input parameters; hierarchical clustering algorithm; intrinsic clustering structure; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Complexity theory; Partitioning algorithms; Spatial databases; Clustering; Density; hierarchical; stratification; visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems and Informatics (ICSAI), 2012 International Conference on
Conference_Location
Yantai
Print_ISBN
978-1-4673-0198-5
Type
conf
DOI
10.1109/ICSAI.2012.6223489
Filename
6223489
Link To Document