DocumentCode
3268901
Title
Substructure clustering on sequential 3d object datasets
Author
Tan, Zhenqiang ; Tung, Anthony K H
Author_Institution
Sch. of Comput., Nat. Univ. of Singapore, Singapore
fYear
2004
fDate
30 March-2 April 2004
Firstpage
634
Lastpage
645
Abstract
We look at substructure clustering of sequential 3d objects. A sequential 3d object is a set of points located in a three dimensional space that are linked up to form a sequence. Given a set of sequential 3d objects, our aim is to find significantly large substructures which are present in many of the sequential 3d objects. Unlike traditional subspace clustering methods in which objects are compared based on values in the same dimension, the matching dimensions between two 3d sequential objects are affected by both the translation and rotation of the objects and are thus not well defined. Instead, similarity between the objects are judge by computing a structural distance measurement call rmsd (Root Mean Square Distance) which require proper alignment (including translation and rotation) of the objects. As the computation of rmsd is expensive, we proposed a new measure call ald (Angle Length Distance) which is shown experimentally to approximate rmsd. Based on ald, we define a new clustering model called sCluster and devise an algorithm for discovering all maximum sCluster in a 3d sequential dataset. Experiments are conducted to illustrate the efficiency and effectiveness of our algorithm.
Keywords
computational geometry; data mining; pattern clustering; statistical analysis; angle length distance; root mean square distance; sCluster clustering model; sequential 3d object dataset; structural distance measurement; subspace clustering methods; substructure clustering; Amino acids; Clustering algorithms; Clustering methods; Data mining; Distance measurement; Drugs; Molecular biophysics; Organisms; Protein engineering; Tree graphs;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2004. Proceedings. 20th International Conference on
ISSN
1063-6382
Print_ISBN
0-7695-2065-0
Type
conf
DOI
10.1109/ICDE.2004.1320033
Filename
1320033
Link To Document