Title :
Efficient mining for association rules with relational database systems
Author :
Rajamani, Karthick ; Cox, Alan ; Iyer, Bala ; Chadha, Atul
Author_Institution :
Dept. of Electr. & Comput. Eng., Rice Univ., Houston, TX, USA
Abstract :
With the tremendous growth of large scale data repositories, a need for integrating the exploratory techniques of data mining with the capabilities of relational systems to efficiently handle large volumes of data has now risen. We look at the performance of the most prevalent association rule mining algorithm-Apriori with IBM´s DB2 Universal Database system. We show that a multi-column (MC) data model is preferable over the commonly used single column (SC) data model for association rule mining. We obtain factors of 4.8 to 6 improvement in performance for the MC data model over commercial implementations for the SC data model. We provide a new relational operator called Combinations, for efficient SQL implementation of Apriori in the database engine-this results in trivial parallelizability, reliability, and portability for the mining application
Keywords :
SQL; data mining; data models; data warehouses; relational databases; Apriori; Combinations; DB2 Universal Database system; MC data model; SC data model; SQL implementation; association rule mining; association rule mining algorithm; data mining; database engine; exploratory techniques; large data volumes; large scale data repositories; mining application; multi-column data model; parallelizability; portability; relational database systems; relational operator; relational systems; reliability; Association rules; Computer science; Data analysis; Data mining; Database systems; Electrical capacitance tomography; Engines; Prototypes; Relational databases;
Conference_Titel :
Database Engineering and Applications, 1999. IDEAS '99. International Symposium Proceedings
Conference_Location :
Montreal, Que.
Print_ISBN :
0-7695-0265-2
DOI :
10.1109/IDEAS.1999.787263