• DocumentCode
    420299
  • Title

    Knowledge-based clustering: a semi-autonomous algorithm using local and global data properties

  • Author

    Bean, C.L. ; Kambhampati, C.

  • Author_Institution
    Dept. of Comput. Sci., Hull Univ., UK
  • Volume
    1
  • fYear
    2004
  • fDate
    27-30 June 2004
  • Firstpage
    95
  • Abstract
    Cluster analysis is a heuristic technique used to reveal inherent groupings in data, but most modern clustering algorithms are highly data and person dependent. This paper presents a clustering technique that minimises the need for user-defined parameters and handles both single and mixed attribute type data sets. The algorithm is based on elements of rough set theory and uses a combination of local and global data properties to obtain meaningful clustering solutions. It is self-consistent in its approach to clustering; thus ensuring the same clustering solution when applied to the same data by different users. The results from a range of real-world and synthetic data sets are used to establish its performance.
  • Keywords
    knowledge based systems; minimisation; pattern clustering; rough set theory; statistical analysis; cluster analysis; clustering algorithms; global data properties; heuristic technique; inherent groupings; knowledge based clustering; local data properties; minimisation; mixed attribute type data sets; rough set theory; semiautonomous algorithm; user defined parameters; Algorithm design and analysis; Clustering algorithms; Computer science; Data analysis; Data mining; Data structures; Humans; Partitioning algorithms; Proposals; Set theory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Information, 2004. Processing NAFIPS '04. IEEE Annual Meeting of the
  • Print_ISBN
    0-7803-8376-1
  • Type

    conf

  • DOI
    10.1109/NAFIPS.2004.1336256
  • Filename
    1336256