Title :
Rapid prototyping of pattern mining problems isomorphic to boolean lattices
Author :
Flouvat, Frédéric ; Marchi, Fabien De ; Petit, Jean-Marc
Author_Institution :
LIRIS, Univ. de Lyon, Lyon
Abstract :
Interesting pattern mining is an important family of data mining problems with applications in many domains. In this paper, we focus on the special class of pattern mining problems known to be dasiarepresentable as setspsila. The main contribution of this paper is to take advantage of the common theoretical background of these problems from an implementation point of view by providing efficient data structures for boolean lattice representation and several implementations of well known algorithms. By the way, these problems can be implemented with only minimal effort, i.e. programmers do not have to be aware of low level code, customized data structures and algorithms being available for free. A toolkit, called iZi, has been devised and applied to several problems such as itemset mining, constraint mining in relational databases and query rewriting in data integration systems. According to our first results, the programs obtained using our toolkit offer a very good tradeoff between performances and development simplicity. Some methodological guidelines are also provided to guide the programmers both at the theoretical level and at the code level.
Keywords :
Boolean algebra; data mining; data structures; set theory; software prototyping; Boolean lattice representation; data mining problem; data structure; iZi toolkit; pattern mining problem; rapid prototyping; sets; Application software; Artificial intelligence; Data mining; Data structures; Databases; Itemsets; Lattices; Programming profession; Prototypes; Software engineering; data mining; pattern; toolkit;
Conference_Titel :
Research Challenges in Information Science, 2008. RCIS 2008. Second International Conference on
Conference_Location :
Marrakech
Print_ISBN :
978-1-4244-1677-6
Electronic_ISBN :
978-1-4244-2273-9
DOI :
10.1109/RCIS.2008.4632104