DocumentCode
2861620
Title
A Mediator Exploiting Approach for Mining Indirect Associations from Web Data Streams
Author
Lin, Wen-Yang ; Chen, Yi-Ching
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Univ. of Kaohsiung, Kaohsiung, Taiwan
fYear
2011
fDate
16-18 Dec. 2011
Firstpage
183
Lastpage
186
Abstract
Recently, the concept of indirect associations, a new type of infrequent patterns that indirectly connect two rarely co-occurred items via a frequent item set called "mediator", has been shown its power in capturing interesting information over web usage data. Most contemporary indirect association mining algorithms are developed for static dataset. Our previous work has proposed an algorithm, MIA-LM, tailored to streaming data. In this paper, we propose a new efficient algorithm, namely EMIA-LM, for mining indirect associations over web data streams. EMIA-LM employs a mediator-exploiting search strategy, which reduce the search space as well as computation cost for generating indirect associations. Besides, EMIA-LM adopts a compact data structure, alleviating unnecessary data transforming processes and consuming far less memory storage. Preliminary experiments conducted on real Web streaming datasets show that EMIA-LM is superior to the leading HI-mine* algorithm for static data and MIA-LM both in computation speed and memory consumption.
Keywords
Internet; data mining; data structures; pattern classification; EMIA-LM algorithm; Web data streams; Web streaming datasets; Web usage data; co-occurred items; data structure; data transforming processes; frequent itemset; indirect association mining algorithm; infrequent pattern mining; mediator-exploiting search strategy; memory storage; static dataset; Algorithm design and analysis; Data mining; Data models; Data structures; Heuristic algorithms; Itemsets; Memory management; Data stream; indirect association; infrequent pattern; landmark model; mediator;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovations in Bio-inspired Computing and Applications (IBICA), 2011 Second International Conference on
Conference_Location
Shenzhan
Print_ISBN
978-1-4577-1219-7
Type
conf
DOI
10.1109/IBICA.2011.50
Filename
6118672
Link To Document