DocumentCode
2210742
Title
ABS: The Anti Bouncing Model for Usage Data Streams
Author
Zhang, Chongsheng ; Masseglia, Florent ; Lechevallier, Yves
Author_Institution
AxIS Project-Team, INRIA Sophia Antipolis-Mediterranee, Sophia Antipolis, France
fYear
2010
fDate
13-17 Dec. 2010
Firstpage
1169
Lastpage
1174
Abstract
Usage data mining is an important research area with applications in various fields. However, usage data is usually considered streaming, due to its high volumes and rates. Because of these characteristics, we only have access, at any point in time, to a small fraction of the stream. When the data is observed through such a limited window, it is challenging to give a reliable description of the recent usage data. We study the important consequences of these constraints, through the “bounce rate” problem and the clustering of usage data streams. Then, we propose the ABS (Anti-Bouncing Stream) model which combines the advantages of previous models but discards their drawbacks. First, under the same resource constraints as existing models in the literature, ABS can better model the recent data. Second, owing to its simple but effective management approach, the data in ABS is available at any time for analysis. We demonstrate its superiority through a theoretical study and experiments on two real-world data sets.
Keywords
data handling; pattern clustering; anti bouncing model; bounce rate problem; data stream; data streams clustering; bounce rate; clustering; data streams; usage;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location
Sydney, NSW
ISSN
1550-4786
Print_ISBN
978-1-4244-9131-5
Electronic_ISBN
1550-4786
Type
conf
DOI
10.1109/ICDM.2010.91
Filename
5694103
Link To Document