DocumentCode :
2966749
Title :
A scalable bottom-up data mining algorithm for relational databases
Author :
Giuffrida, Giovanni ; Cooper, Lee G. ; Chu, Wesley W.
Author_Institution :
Dept. of Comput. Sci., California Univ., Los Angeles, CA, USA
fYear :
1998
fDate :
1-3 Jul 1998
Firstpage :
206
Lastpage :
209
Abstract :
Machine learning induction algorithms are difficult to scale to very large databases because of their memory-bound nature. Using virtual memory results in a significant performance degradation. To overcome such shortcomings, we developed a classification rule induction algorithm for relational databases. Our algorithm uses a bottom-up rule generation strategy that is more effective for mining databases having large cardinality of nominal variables. We have successfully used our algorithm to mine a retail grocery database containing more than 1.6 million records in about 5 hours on a dual Pentium processor PC
Keywords :
deductive databases; knowledge acquisition; learning by example; query processing; relational databases; retail data processing; software performance evaluation; very large databases; Pentium processor; bottom-up data mining algorithm; bottom-up rule generation; classification rule induction algorithm; induction algorithms; machine learning; memory-bound; performance; relational databases; retail grocery database; scalable algorithm; very large databases; virtual memory; Classification algorithms; Data mining; Indexing; Induction generators; Law; Machine learning; Operating systems; Relational databases; Spatial databases; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Scientific and Statistical Database Management, 1998. Proceedings. Tenth International Conference on
Conference_Location :
Capri
ISSN :
1099-3371
Print_ISBN :
0-8186-8575-1
Type :
conf
DOI :
10.1109/SSDM.1998.688125
Filename :
688125
Link To Document :
بازگشت