DocumentCode :
2408250
Title :
Real time Web usage mining with a distributed navigation analysis
Author :
Masseglia, Florent ; Teisseire, Maguelonne ; Poncelet, Pascal
Author_Institution :
LIRMM, Montpellier, France
fYear :
2002
fDate :
2002
Firstpage :
169
Lastpage :
174
Abstract :
The behaviour of a Web site\´s users may change so quickly that attempting to make predictions, according to the frequent patterns coming from the analysis of an access log file, becomes challenging. In order for the obsolescence of the behavioural patterns to become as null as possible, the ideal method would provide frequent patterns in real time, allowing the result to be available immediately. We propose a method allowing us to find frequent behavioural patterns in real time, whatever the number of connected users is. Considering how fast the frequent behaviour patterns can change since the last analysis of the access log file, this result thus provides completely adapted navigation schemas for user behaviour predictions. Based on a distributed heuristic, our method also answers several tackled problems within the data mining framework: discovering "interesting zones" (a great number of frequent patterns concentrated over a period of time, or the discovering of "super-frequent" patterns), discovering very long sequential patterns and interactive data mining ("on the fly" modification of the minimum support)
Keywords :
data mining; distributed programming; information resources; information retrieval; real-time systems; Web site users; access log file; adapted navigation schemas; behavioural patterns; connected users; data mining framework; distributed heuristic; distributed navigation analysis; frequent behavioural patterns; frequent patterns; interactive data mining; real time Web usage mining; super-frequent patterns; user behaviour predictions; very long sequential patterns; Data engineering; Data mining; Electronic commerce; Information analysis; Navigation; Pattern analysis; Performance analysis; Uniform resource locators; Web server; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Research Issues in Data Engineering: Engineering E-Commerce/E-Business Systems, 2002. RIDE-2EC 2002. Proceedings. Twelfth International Workshop on
Conference_Location :
San Jose, CA
ISSN :
1066-1395
Print_ISBN :
0-7695-1480-4
Type :
conf
DOI :
10.1109/RIDE.2002.995111
Filename :
995111
Link To Document :
بازگشت