DocumentCode :
2112118
Title :
Mining Web Browsing Log by Using Relaxed Biclique Enumeration Algorithm in MapReduce
Author :
Chung-Tsai Su ; Wen-Kwang Tsao ; Wei-Rong Chu ; Ming-Ray Liao
Author_Institution :
Trend Micro, Inc., Taipei, Taiwan
Volume :
3
fYear :
2012
fDate :
4-7 Dec. 2012
Firstpage :
54
Lastpage :
58
Abstract :
We propose a novel data mining framework using relaxed biclique for heterogeneous data. The framework is composed of three algorithms. First, an enumeration algorithm transforms heterogeneous databases into relaxed bicliques. Second, a tracking algorithm is used to find the bicliqueâs variations over time. Finally, a ranking algorithm classifies relaxed bicliques into groups according to their statistical properties and dynamic behaviors. The framework is highly flexible and can be easily extended to applications in different domains. The framework is implemented in MapReduce and is proven to be scalable for processing large-scale data in a reasonable amount of time. In addition, the experiments show that the algorithms are both scalable and efficient. The proposed framework can also be applied to web network analysis and deliver rapid-response solutions.
Keywords :
Internet; data analysis; data mining; pattern classification; statistical analysis; MapReduce; Web browsing log mining; Web network analysis; data mining; dynamic behavior; heterogeneous database; ranking algorithm; rapid-response solution; relaxed biclique classification; relaxed biclique enumeration algorithm; statistical property; tracking algorithm; Bipartite Graph; Cloud Computing; Malicious Domain-IP Detection; MapReduce; Quasi-Clique;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-6057-9
Type :
conf
DOI :
10.1109/WI-IAT.2012.184
Filename :
6511648
Link To Document :
بازگشت