DocumentCode :
2335932
Title :
A clustering method for very large mixed data sets
Author :
Sánchez-Díaz, Guillermo ; Ruiz-Shulcloper, José
Author_Institution :
Autonomous Univ., Hidalgo State, Mexico
fYear :
2001
fDate :
2001
Firstpage :
643
Lastpage :
644
Abstract :
In developed countries, especially over the last decade, there has been an explosive growth in the capability to generate, collect and use very large data sets. The objects of these data sets could be simultaneously described by quantitative and qualitative attributes. At present, algorithms able to process either very large data sets (in metric spaces) or mixed (qualitative and quantitative) incomplete data (missing value) sets have been developed, but not for very large mixed incomplete data sets. In this paper we introduce a new clustering method named GLC+ to process very large mixed incomplete data sets in order to obtain a partition in connected sets
Keywords :
data mining; pattern clustering; very large databases; GLC+; clustering method; connected set partition; very large mixed incomplete data sets; Art; Clustering algorithms; Clustering methods; Cybernetics; Explosives; Extraterrestrial measurements; Finance; Mathematics; Physics; Skeleton;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location :
San Jose, CA
Print_ISBN :
0-7695-1119-8
Type :
conf
DOI :
10.1109/ICDM.2001.989590
Filename :
989590
Link To Document :
بازگشت