Title :
Closed Structured Patterns and Motifs Mining without Candidate Maintenance
Author :
Yan, Leiming ; Sun, Zhihui
Author_Institution :
Sch. of Comput. Sci. & Eng., Southeast Univ., Nanjing, China
Abstract :
Structured motif is a special kind of frequent sequential pattern, which consists of several components separated by gaps, with important applications, especially in DNA sequence analysis. Mining closed structured patterns can get a more compact and complete result set by eliminating redundant patterns subsumed by other super patterns. However, the traditional method, maintaining candidates and testing which ones are closed, is inherently costly in both runtime and space usage. In this paper, we present BMCM, an efficient algorithm for mining closed structured patterns and motifs. It adopts BI-Composite scheme to generate, prune patterns and check patterns´ closure, in which do not need maintain candidates. The experimental evaluation with synthetic data and biological data demonstrates the algorithm BMCM is effective in mining closed structured patterns and motifs.
Keywords :
DNA; bioinformatics; data mining; pattern recognition; DNA sequence analysis; biological data; candidate maintenance; closed structured patterns; frequent sequential pattern; motifs mining; structured motif; synthetic data; Conference management; DNA; Data mining; Databases; Engineering management; Information management; Information technology; Pattern analysis; Sequences; Technology management; bioinformatics; closed structured motif; data mining; suffix tree;
Conference_Titel :
Future Information Technology and Management Engineering, 2009. FITME '09. Second International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-5339-9
DOI :
10.1109/FITME.2009.128