مرکز منطقه ای اطلاع رساني علوم و فناوري - Sequential extraction of clusters for imbalanced data

DocumentCode :

3302012

Title :

Sequential extraction of clusters for imbalanced data

Author :

Hengjin Tang ; Miyamoto, Sadaaki

Author_Institution :

Grad. Sch. of Syst. & Inf. Eng., Univ. of Tsukuba, Tsukuba, Japan

fYear :

2013

fDate :

13-15 Dec. 2013

Firstpage :

281

Lastpage :

285

Abstract :

K-means type clustering has a central role in various clustering algorithms. In spite of its usefulness, there is a well-known drawback, the number of clusters should be determined beforehand, and clustering results are strongly depends of this number. Many researchers study on how to estimate this number and one algorithm is using sequential extraction of clusters. However, the clustering results by this algorithm is severely affected by the initial parameter setting. Additionally, if the dataset consists of imbalanced clusters and shapes, the results also can be worse. To overcome such problems, we propose automatic estimation of parameter values during the clustering process. We show the effectiveness of the proposed algorithm by using numerical examples.

Keywords :

data analysis; parameter estimation; pattern clustering; K-means type clustering; automatic parameter value estimation; clusters sequential extraction; imbalanced data; sequential clustering; Algorithm design and analysis; Clustering algorithms; Data mining; Educational institutions; Noise; Optimization; Shape; clustering; imbalanced data; sequential extraction of clusters;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Granular Computing (GrC), 2013 IEEE International Conference on

Conference_Location :

Beijing

Type :

conf

DOI :

10.1109/GrC.2013.6740422

Filename :

6740422

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3302012