• DocumentCode
    2709059
  • Title

    Mining Order-Preserving Submatrices from Data with Repeated Measurements

  • Author

    Chui, Chun Kit ; Kao, Ben ; Yip, Kevin Y. ; Lee, Sau Dan

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Hong Kong, Hong Kong
  • fYear
    2008
  • fDate
    15-19 Dec. 2008
  • Firstpage
    133
  • Lastpage
    142
  • Abstract
    Order-preserving submatrices (OPSM´s) have been shown useful in capturing concurrent patterns in data when the relative magnitudes of data items are more important than their absolute values. To cope with data noise, repeated experiments are often conducted to collect multiple measurements. We propose and study a more robust version of OPSM, where each data item is represented by a set of values obtained from replicated experiments. We call the new problem OPSM-RM (OPSM with repeated measurements). We define OPSM-RM based on a number of practical requirements. We discuss the computational challenges of OPSM-RM and propose a generic mining algorithm. We further propose a series of techniques to speed up two time-dominating components of the algorithm. We clearly show the effectiveness of our methods through a series of experiments conducted on real microarray data.
  • Keywords
    data mining; data item; data noise; generic mining algorithm; order-preserving submatrices; real microarray data; Bioinformatics; Computer science; Data analysis; Data mining; Gene expression; Intrusion detection; Noise level; Noise measurement; Noise robustness; OPSM; gene expression; sequence mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2008. ICDM '08. Eighth IEEE International Conference on
  • Conference_Location
    Pisa
  • ISSN
    1550-4786
  • Print_ISBN
    978-0-7695-3502-9
  • Type

    conf

  • DOI
    10.1109/ICDM.2008.12
  • Filename
    4781108