A clustering method for very large mixed data sets

Author

Sánchez-Díaz, Guillermo ; Ruiz-Shulcloper, José

Author_Institution

Autonomous Univ., Hidalgo State, Mexico

fYear

2001

fDate

2001

Firstpage

643

Lastpage

644

Abstract

In developed countries, especially over the last decade, there has been an explosive growth in the capability to generate, collect and use very large data sets. The objects of these data sets could be simultaneously described by quantitative and qualitative attributes. At present, algorithms able to process either very large data sets (in metric spaces) or mixed (qualitative and quantitative) incomplete data (missing value) sets have been developed, but not for very large mixed incomplete data sets. In this paper we introduce a new clustering method named GLC+ to process very large mixed incomplete data sets in order to obtain a partition in connected sets

Keywords

data mining; pattern clustering; very large databases; GLC+; clustering method; connected set partition; very large mixed incomplete data sets; Art; Clustering algorithms; Clustering methods; Cybernetics; Explosives; Extraterrestrial measurements; Finance; Mathematics; Physics; Skeleton;

fLanguage

English

Publisher

ieee

Conference_Titel

Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on

Conference_Location

San Jose, CA

Print_ISBN

0-7695-1119-8

Type

conf

DOI

10.1109/ICDM.2001.989590

Filename

989590

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2335932