DocumentCode
2335932
Title
A clustering method for very large mixed data sets
Author
Sánchez-Díaz, Guillermo ; Ruiz-Shulcloper, José
Author_Institution
Autonomous Univ., Hidalgo State, Mexico
fYear
2001
fDate
2001
Firstpage
643
Lastpage
644
Abstract
In developed countries, especially over the last decade, there has been an explosive growth in the capability to generate, collect and use very large data sets. The objects of these data sets could be simultaneously described by quantitative and qualitative attributes. At present, algorithms able to process either very large data sets (in metric spaces) or mixed (qualitative and quantitative) incomplete data (missing value) sets have been developed, but not for very large mixed incomplete data sets. In this paper we introduce a new clustering method named GLC+ to process very large mixed incomplete data sets in order to obtain a partition in connected sets
Keywords
data mining; pattern clustering; very large databases; GLC+; clustering method; connected set partition; very large mixed incomplete data sets; Art; Clustering algorithms; Clustering methods; Cybernetics; Explosives; Extraterrestrial measurements; Finance; Mathematics; Physics; Skeleton;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
Conference_Location
San Jose, CA
Print_ISBN
0-7695-1119-8
Type
conf
DOI
10.1109/ICDM.2001.989590
Filename
989590
Link To Document