DocumentCode
2112118
Title
Mining Web Browsing Log by Using Relaxed Biclique Enumeration Algorithm in MapReduce
Author
Chung-Tsai Su ; Wen-Kwang Tsao ; Wei-Rong Chu ; Ming-Ray Liao
Author_Institution
Trend Micro, Inc., Taipei, Taiwan
Volume
3
fYear
2012
fDate
4-7 Dec. 2012
Firstpage
54
Lastpage
58
Abstract
We propose a novel data mining framework using relaxed biclique for heterogeneous data. The framework is composed of three algorithms. First, an enumeration algorithm transforms heterogeneous databases into relaxed bicliques. Second, a tracking algorithm is used to find the bicliqueâs variations over time. Finally, a ranking algorithm classifies relaxed bicliques into groups according to their statistical properties and dynamic behaviors. The framework is highly flexible and can be easily extended to applications in different domains. The framework is implemented in MapReduce and is proven to be scalable for processing large-scale data in a reasonable amount of time. In addition, the experiments show that the algorithms are both scalable and efficient. The proposed framework can also be applied to web network analysis and deliver rapid-response solutions.
Keywords
Internet; data analysis; data mining; pattern classification; statistical analysis; MapReduce; Web browsing log mining; Web network analysis; data mining; dynamic behavior; heterogeneous database; ranking algorithm; rapid-response solution; relaxed biclique classification; relaxed biclique enumeration algorithm; statistical property; tracking algorithm; Bipartite Graph; Cloud Computing; Malicious Domain-IP Detection; MapReduce; Quasi-Clique;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location
Macau
Print_ISBN
978-1-4673-6057-9
Type
conf
DOI
10.1109/WI-IAT.2012.184
Filename
6511648
Link To Document