DocumentCode
2813715
Title
A mathematical model of similarity and clustering
Author
Sun, Fu-Shing ; Tzeng, Chun-Hung
Author_Institution
Dept. of Comput. Sci., Ball State Univ., Muncie, IN, USA
Volume
1
fYear
2004
fDate
5-7 April 2004
Firstpage
460
Abstract
This paper introduces an abstract model of data similarity and clustering. A similarity on a space Ω is formulated explicitly by a reflexive and symmetric binary relation, called a tolerance relation, for which we introduce three types of coverings of Ω. Given a covering U, a clustering is defined to be minimal sub-covering. To search for an optimal clustering is to minimize the number of clusters, which is intractable in general. This paper proposes a heuristic method to search for sub-optimal clusterings for a given tolerance relation.
Keywords
data mining; data structures; pattern clustering; search problems; data clustering; data similarity; heuristic method; minimal subcovering; optimal clustering; reflexive binary relation; symmetric binary relation; tolerance relation; Clustering algorithms; Computer science; Data mining; Electronic mail; Euclidean distance; Fractals; Lattices; Mathematical model; Sun; Topology;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN
0-7695-2108-8
Type
conf
DOI
10.1109/ITCC.2004.1286499
Filename
1286499
Link To Document