Title :
Constrained Frequent Subtree Mining Method
Author :
Kun Han ; Weifeng Lv ; Baocai Yin ; Yongli Hu
Abstract :
With the semi-structured data rapidly growing, it is crucial to obtain valuable information for different applications. So many data mining methods are proposed and the frequent sub trees mining is an important and typical method. The current mining methods demand substantial computational time and space, and return a huge number of patterns, but some important sub trees are often missed and some patterns are uninteresting to users. In this paper we proposed two novel algorithms, namely FSMDC and FSMIC, for mining frequent embedded sub trees from rooted labeled ordered trees database. In these proposed algorithms, the distance and interest constraint are introduced respectively to achieve expected mining results. The experiments show that the two newly developed algorithms are efficient, scalable and more consistent with purpose of users.
Keywords :
data mining; database management systems; trees (mathematics); FSMDC; FSMIC; constrained frequent subtree mining method; data mining methods; distance constraint; interest constraint; rooted labeled ordered trees database; Algorithm design and analysis; Classification algorithms; Data mining; Databases; Educational institutions; Mathematical model; Vegetation; distance constraint; frequent subtree mining; interest constraint;
Conference_Titel :
Digital Home (ICDH), 2014 5th International Conference on
Conference_Location :
Guangzhou
Print_ISBN :
978-1-4799-4285-5
DOI :
10.1109/ICDH.2014.62