Title :
A clustering method for very large mixed data sets
Author :
Sánchez-Díaz, Guillermo ; Ruiz-Shulcloper, José
Author_Institution :
Autonomous Univ., Hidalgo State, Mexico
Abstract :
In developed countries, especially over the last decade, there has been an explosive growth in the capability to generate, collect and use very large data sets. The objects of these data sets could be simultaneously described by quantitative and qualitative attributes. At present, algorithms able to process either very large data sets (in metric spaces) or mixed (qualitative and quantitative) incomplete data (missing value) sets have been developed, but not for very large mixed incomplete data sets. In this paper we introduce a new clustering method named GLC+ to process very large mixed incomplete data sets in order to obtain a partition in connected sets
Keywords :
data mining; pattern clustering; very large databases; GLC+; clustering method; connected set partition; very large mixed incomplete data sets; Art; Clustering algorithms; Clustering methods; Cybernetics; Explosives; Extraterrestrial measurements; Finance; Mathematics; Physics; Skeleton;
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
DOI :
10.1109/ICDM.2001.989590