• DocumentCode
    2112118
  • Title

    Mining Web Browsing Log by Using Relaxed Biclique Enumeration Algorithm in MapReduce

  • Author

    Chung-Tsai Su ; Wen-Kwang Tsao ; Wei-Rong Chu ; Ming-Ray Liao

  • Author_Institution
    Trend Micro, Inc., Taipei, Taiwan
  • Volume
    3
  • fYear
    2012
  • fDate
    4-7 Dec. 2012
  • Firstpage
    54
  • Lastpage
    58
  • Abstract
    We propose a novel data mining framework using relaxed biclique for heterogeneous data. The framework is composed of three algorithms. First, an enumeration algorithm transforms heterogeneous databases into relaxed bicliques. Second, a tracking algorithm is used to find the bicliqueâs variations over time. Finally, a ranking algorithm classifies relaxed bicliques into groups according to their statistical properties and dynamic behaviors. The framework is highly flexible and can be easily extended to applications in different domains. The framework is implemented in MapReduce and is proven to be scalable for processing large-scale data in a reasonable amount of time. In addition, the experiments show that the algorithms are both scalable and efficient. The proposed framework can also be applied to web network analysis and deliver rapid-response solutions.
  • Keywords
    Internet; data analysis; data mining; pattern classification; statistical analysis; MapReduce; Web browsing log mining; Web network analysis; data mining; dynamic behavior; heterogeneous database; ranking algorithm; rapid-response solution; relaxed biclique classification; relaxed biclique enumeration algorithm; statistical property; tracking algorithm; Bipartite Graph; Cloud Computing; Malicious Domain-IP Detection; MapReduce; Quasi-Clique;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
  • Conference_Location
    Macau
  • Print_ISBN
    978-1-4673-6057-9
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2012.184
  • Filename
    6511648