Title :
Web mining and its SQL based parallel execution
Author :
Kitsuregawa, Masaru ; Shintani, Takahiko ; Pramudiono, Iko
Author_Institution :
Inst. of Ind. Sci., Tokyo Univ., Japan
Abstract :
Web mining can be classified into two categories, Web access log mining and Web structure mining. We performed association rule mining and sequence pattern mining against the access log which was accumulated at NTT Software Mobile Info Search portal site. The detailed Web log mining process and the rules we derived are reported. The parallel association rule mining is explored on a large scale PC cluster system. Parallelism is key to improve the performance. We achieved substantial speed up through parallel SQL execution
Keywords :
Internet; SQL; data mining; information resources; information retrieval; relational databases; workstation clusters; Internet; Mobile Info Search portal site; PC cluster; SQL; Web access log mining; Web structure mining; association rule mining; parallel association rule mining; parallel execution; performance; sequence pattern mining; Association rules; Data mining; Extraterrestrial measurements; Information retrieval; Internet; Large-scale systems; Portals; Relational databases; Software performance; Web mining;
Conference_Titel :
Information Technology for Virtual Enterprises, 2001. ITVE 2001. Proceedings. Workshop on
Conference_Location :
Gold Coast, Qld.
Print_ISBN :
0-7695-0960-6
DOI :
10.1109/ITVE.2001.904496