• DocumentCode
    2335932
  • Title

    A clustering method for very large mixed data sets

  • Author

    Sánchez-Díaz, Guillermo ; Ruiz-Shulcloper, José

  • Author_Institution
    Autonomous Univ., Hidalgo State, Mexico
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    643
  • Lastpage
    644
  • Abstract
    In developed countries, especially over the last decade, there has been an explosive growth in the capability to generate, collect and use very large data sets. The objects of these data sets could be simultaneously described by quantitative and qualitative attributes. At present, algorithms able to process either very large data sets (in metric spaces) or mixed (qualitative and quantitative) incomplete data (missing value) sets have been developed, but not for very large mixed incomplete data sets. In this paper we introduce a new clustering method named GLC+ to process very large mixed incomplete data sets in order to obtain a partition in connected sets
  • Keywords
    data mining; pattern clustering; very large databases; GLC+; clustering method; connected set partition; very large mixed incomplete data sets; Art; Clustering algorithms; Clustering methods; Cybernetics; Explosives; Extraterrestrial measurements; Finance; Mathematics; Physics; Skeleton;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2001. ICDM 2001, Proceedings IEEE International Conference on
  • Conference_Location
    San Jose, CA
  • Print_ISBN
    0-7695-1119-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2001.989590
  • Filename
    989590