DocumentCode
2709059
Title
Mining Order-Preserving Submatrices from Data with Repeated Measurements
Author
Chui, Chun Kit ; Kao, Ben ; Yip, Kevin Y. ; Lee, Sau Dan
Author_Institution
Dept. of Comput. Sci., Univ. of Hong Kong, Hong Kong
fYear
2008
fDate
15-19 Dec. 2008
Firstpage
133
Lastpage
142
Abstract
Order-preserving submatrices (OPSM´s) have been shown useful in capturing concurrent patterns in data when the relative magnitudes of data items are more important than their absolute values. To cope with data noise, repeated experiments are often conducted to collect multiple measurements. We propose and study a more robust version of OPSM, where each data item is represented by a set of values obtained from replicated experiments. We call the new problem OPSM-RM (OPSM with repeated measurements). We define OPSM-RM based on a number of practical requirements. We discuss the computational challenges of OPSM-RM and propose a generic mining algorithm. We further propose a series of techniques to speed up two time-dominating components of the algorithm. We clearly show the effectiveness of our methods through a series of experiments conducted on real microarray data.
Keywords
data mining; data item; data noise; generic mining algorithm; order-preserving submatrices; real microarray data; Bioinformatics; Computer science; Data analysis; Data mining; Gene expression; Intrusion detection; Noise level; Noise measurement; Noise robustness; OPSM; gene expression; sequence mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
Conference_Location
Pisa
ISSN
1550-4786
Print_ISBN
978-0-7695-3502-9
Type
conf
DOI
10.1109/ICDM.2008.12
Filename
4781108
Link To Document