• DocumentCode
    2861620
  • Title

    A Mediator Exploiting Approach for Mining Indirect Associations from Web Data Streams

  • Author

    Lin, Wen-Yang ; Chen, Yi-Ching

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Univ. of Kaohsiung, Kaohsiung, Taiwan
  • fYear
    2011
  • fDate
    16-18 Dec. 2011
  • Firstpage
    183
  • Lastpage
    186
  • Abstract
    Recently, the concept of indirect associations, a new type of infrequent patterns that indirectly connect two rarely co-occurred items via a frequent item set called "mediator", has been shown its power in capturing interesting information over web usage data. Most contemporary indirect association mining algorithms are developed for static dataset. Our previous work has proposed an algorithm, MIA-LM, tailored to streaming data. In this paper, we propose a new efficient algorithm, namely EMIA-LM, for mining indirect associations over web data streams. EMIA-LM employs a mediator-exploiting search strategy, which reduce the search space as well as computation cost for generating indirect associations. Besides, EMIA-LM adopts a compact data structure, alleviating unnecessary data transforming processes and consuming far less memory storage. Preliminary experiments conducted on real Web streaming datasets show that EMIA-LM is superior to the leading HI-mine* algorithm for static data and MIA-LM both in computation speed and memory consumption.
  • Keywords
    Internet; data mining; data structures; pattern classification; EMIA-LM algorithm; Web data streams; Web streaming datasets; Web usage data; co-occurred items; data structure; data transforming processes; frequent itemset; indirect association mining algorithm; infrequent pattern mining; mediator-exploiting search strategy; memory storage; static dataset; Algorithm design and analysis; Data mining; Data models; Data structures; Heuristic algorithms; Itemsets; Memory management; Data stream; indirect association; infrequent pattern; landmark model; mediator;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovations in Bio-inspired Computing and Applications (IBICA), 2011 Second International Conference on
  • Conference_Location
    Shenzhan
  • Print_ISBN
    978-1-4577-1219-7
  • Type

    conf

  • DOI
    10.1109/IBICA.2011.50
  • Filename
    6118672