DocumentCode
3739364
Title
Automatically Generate a Flat Mining Table with Dataconda
Author
Michele Samorani
Author_Institution
Univ. of Alberta, Edmonton, AB, Canada
fYear
2015
Firstpage
1644
Lastpage
1647
Abstract
Classification and regression algorithms require a flat mining table as input, which in most cases is built manually by summarizing relational data into propositional features. This task is not only time consuming, but it also inhibits the discovery of new knowledge, because a small portion of the possible features will be built. Dataconda, a software program available on www.dataconda.net, makes this task automatic, thereby facilitating new knowledge to emerge. The user selects a target variable from any table of a relational database, and Dataconda builds and tests a large number of predictors by aggregating information from the other tables. For example, Dataconda may find that the best predictor for "customer loyalty" is the amount of money spent by the customer in cheap products, even if the user has not built any such feature. This demo will illustrate how to use Dataconda in a tutorial database and in a real-world database.
Keywords
"Aggregates","Data mining","Relational databases","Conferences","Software","Knowledge discovery"
Publisher
ieee
Conference_Titel
Data Mining Workshop (ICDMW), 2015 IEEE International Conference on
Electronic_ISBN
2375-9259
Type
conf
DOI
10.1109/ICDMW.2015.100
Filename
7395878
Link To Document