Title :
An algorithm for mining frequent patterns in biological sequence
Author :
Chen, Ling ; Liu, Wei
Abstract :
Most of the existing algorithms for mining frequent patterns could produce lots of projected databases and short patterns which could increase the time and memory cost of mining. In order to overcome such shortcoming, a fast and efficient algorithm named FBPM for mining frequent patterns in biological sequence is proposed. We first present the concept of primary pattern, and then use prefix tree for mining frequent primary patterns. A pattern growth approach is also presented to mine all the frequent patterns without producing large amount of irrelevant patterns. Our experimental results show that FBPM not only improves the performance but also achieves effective mining results.
Keywords :
biology computing; data mining; molecular biophysics; molecular configurations; trees (mathematics); FBPM; biological sequence; frequent pattern mining; pattern growth approach; prefix tree; primary pattern; Algorithm design and analysis; Complexity theory; Data mining; Databases; Proteins; Sorting; biological sequence; frequent pattern mining; prefix tree; primary pattern;
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2011 IEEE 1st International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-61284-851-8
DOI :
10.1109/ICCABS.2011.5729943