Title :
A new approach for mining deep order-preserving submatrices
Author :
Zhengling Liao ; Jie Luo ; Meihang Li ; Yun Xue ; Tiechen Li ; Xiaohui Hu
Author_Institution :
Sch. of Phys. & Telecommun. Eng., South China Normal Univ., Guangzhou, China
Abstract :
In this paper, we proposed an exact method to discover all order-preserving submatrices (OPSMs) based on frequent sequential pattern mining. Firstly, an existing algorithm calACS is adjusted to disclose all common subsequences between every two row sequences, therefore all the deep OPSMs corresponding to long patterns with few supporting sequences will not be missed. Then an improved data structure for prefix tree was used to store and traverse all common subsequences, and Apriori principle was employed to mine the frequent sequential pattern efficiently. Finally, experiments were implemented on real data set and GO analysis was applied to identify whether the patterns discovered were biologically significant. The results demonstrate the effectiveness and the efficiency of this method.
Keywords :
data mining; data structures; matrix algebra; trees (mathematics); GO analysis; OPSM; apriori principle; calACS algorithm; data structure; deep order-preserving submatrix mining; frequent sequential pattern mining; improved data structure; prefix tree; Atmospheric measurements; Bioinformatics; Biological system modeling; Data mining; Data structures; Gene expression; Apriori principle; OPSM; all common subsequences; biclustering; frequent sequence; the prefix tree;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4799-5147-5
DOI :
10.1109/FSKD.2014.6980857