Title :
On Mining Micro-array data by Order-Preserving Submatrix
Author :
Lin Cheung ; Yip, Kevin Y. ; Cheung, David W. ; Kao, B. ; Ng, Michael K.
Author_Institution :
The University of Hong Kong, Hong Kong
Abstract :
We study the problem of pattern-based subspace clustering. Unlike traditional clustering methods that focus on grouping objects with similar values on a set of dimensions, clustering by pattern similarity finds objects that exhibit a coherent pattern of rises and falls in subspaces. Applications of pattern-based subspace clustering include DNA micro-array data analysis, automatic recommendation systems and target marketing systems. Our goal is to devise pattern-based clustering methods that are capable of (1) discovering useful patterns of various shapes, and (2) discovering all significant patterns. We argue that previous solutions in pattern-based subspace clustering do not satisfy both requirements. Our approach is to extend the idea of Order-Preserving Submatrix (or OPSM). We devise a novel algorithm for mining OPSM, show that OPSM can be generalized to cover most existing pattern-based clustering models, and propose a number of extension to the original OPSM model.
Keywords :
Data mining; Gene Expression; Patternbased clustering; Artificial intelligence; Bioinformatics; Clustering algorithms; Clustering methods; Computer science; DNA; Data analysis; Data mining; Gene expression; Shape; Data mining; Gene Expression; Patternbased clustering;
Conference_Titel :
Data Engineering Workshops, 2005. 21st International Conference on
Print_ISBN :
0-7695-2657-8
DOI :
10.1109/ICDE.2005.253