DocumentCode
182990
Title
A new approach for mining deep order-preserving submatrices
Author
Zhengling Liao ; Jie Luo ; Meihang Li ; Yun Xue ; Tiechen Li ; Xiaohui Hu
Author_Institution
Sch. of Phys. & Telecommun. Eng., South China Normal Univ., Guangzhou, China
fYear
2014
fDate
19-21 Aug. 2014
Firstpage
341
Lastpage
345
Abstract
In this paper, we proposed an exact method to discover all order-preserving submatrices (OPSMs) based on frequent sequential pattern mining. Firstly, an existing algorithm calACS is adjusted to disclose all common subsequences between every two row sequences, therefore all the deep OPSMs corresponding to long patterns with few supporting sequences will not be missed. Then an improved data structure for prefix tree was used to store and traverse all common subsequences, and Apriori principle was employed to mine the frequent sequential pattern efficiently. Finally, experiments were implemented on real data set and GO analysis was applied to identify whether the patterns discovered were biologically significant. The results demonstrate the effectiveness and the efficiency of this method.
Keywords
data mining; data structures; matrix algebra; trees (mathematics); GO analysis; OPSM; apriori principle; calACS algorithm; data structure; deep order-preserving submatrix mining; frequent sequential pattern mining; improved data structure; prefix tree; Atmospheric measurements; Bioinformatics; Biological system modeling; Data mining; Data structures; Gene expression; Apriori principle; OPSM; all common subsequences; biclustering; frequent sequence; the prefix tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2014 11th International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4799-5147-5
Type
conf
DOI
10.1109/FSKD.2014.6980857
Filename
6980857
Link To Document