DocumentCode :
2737656
Title :
Detecting Task-Based Query Sessions Using Collaborative Knowledge
Author :
Lucchese, Claudio ; Orlando, Salvatore ; Perego, Raffaele ; Silvestri, F. ; Tolomei, G.
Author_Institution :
HPC Lab., ISTI-CNR, Pisa, Italy
Volume :
3
fYear :
2010
fDate :
Aug. 31 2010-Sept. 3 2010
Firstpage :
128
Lastpage :
131
Abstract :
Our research challenge is to provide a mechanism for splitting into user task-based sessions a long-term log of queries submitted to a Web Search Engine (WSE). The hypothesis is that some query sessions entail the concept of user task. We present an approach that relies on a centroid-based and a density-based clustering algorithm, which consider queries inter-arrival times and use a novel distance function that takes care of query lexical content and exploits the collaborative knowledge collected by Wiktionary and Wikipedia.
Keywords :
Internet; Web sites; pattern clustering; query formulation; search engines; Web search engine; Wikipedia; Wiktionary; centroid-based clustering algorithm; collaborative knowledge; density-based clustering algorithm; distance function; query inter-arrival times; query lexical content; task-based query session detection; Clustering algorithms; Electronic publishing; Encyclopedias; Internet; Semantics; Silicon; Query clustering; Query log session breaking; Task-based session; Web data mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on
Conference_Location :
Toronto, ON
Print_ISBN :
978-1-4244-8482-9
Electronic_ISBN :
978-0-7695-4191-4
Type :
conf
DOI :
10.1109/WI-IAT.2010.281
Filename :
5614473
Link To Document :
بازگشت