DocumentCode :
2369445
Title :
OP-cluster: clustering by tendency in high dimensional space
Author :
Liu, Jinze ; Wang, Wei
Author_Institution :
Dept. of Comput. Sci., North Carolina Univ., Chapel Hill, NC, USA
fYear :
2003
fDate :
19-22 Nov. 2003
Firstpage :
187
Lastpage :
194
Abstract :
Clustering is the process of grouping a set of objects into classes of similar objects. Because of unknownness of the hidden patterns in the data sets, the definition of similarity is very subtle. Until recently, similarity measures are typically based on distances, e.g Euclidean distance and cosine distance. We propose a flexible yet powerful clustering model, namely OP-cluster (Order Preserving Cluster). Under this new model, two objects are similar on a subset of dimensions if the values of these two objects induce the same relative order of those dimensions. Such a cluster might arise when the expression levels of (coregulated) genes can rise or fall synchronously in response to a sequence of environment stimuli. Hence, discovery of OP-Cluster is essential in revealing significant gene regulatory networks. A deterministic algorithm is designed and implemented to discover all the significant OP-Clusters. A set of extensive experiments has been done on several real biological data sets to demonstrate its effectiveness and efficiency in detecting coregulated patterns.
Keywords :
biology computing; data mining; deterministic algorithms; pattern clustering; statistical analysis; Euclidean distance; OP cluster model; cosine distance; environment stimuli; gene regulatory network; high dimensional space; pattern clustering; Algorithm design and analysis; Clustering algorithms; Computer science; Data analysis; Databases; Euclidean distance; Machine learning; Pattern analysis; Pattern recognition; Statistical analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2003. ICDM 2003. Third IEEE International Conference on
Print_ISBN :
0-7695-1978-4
Type :
conf
DOI :
10.1109/ICDM.2003.1250919
Filename :
1250919
Link To Document :
بازگشت