DocumentCode
2737656
Title
Detecting Task-Based Query Sessions Using Collaborative Knowledge
Author
Lucchese, Claudio ; Orlando, Salvatore ; Perego, Raffaele ; Silvestri, F. ; Tolomei, G.
Author_Institution
HPC Lab., ISTI-CNR, Pisa, Italy
Volume
3
fYear
2010
fDate
Aug. 31 2010-Sept. 3 2010
Firstpage
128
Lastpage
131
Abstract
Our research challenge is to provide a mechanism for splitting into user task-based sessions a long-term log of queries submitted to a Web Search Engine (WSE). The hypothesis is that some query sessions entail the concept of user task. We present an approach that relies on a centroid-based and a density-based clustering algorithm, which consider queries inter-arrival times and use a novel distance function that takes care of query lexical content and exploits the collaborative knowledge collected by Wiktionary and Wikipedia.
Keywords
Internet; Web sites; pattern clustering; query formulation; search engines; Web search engine; Wikipedia; Wiktionary; centroid-based clustering algorithm; collaborative knowledge; density-based clustering algorithm; distance function; query inter-arrival times; query lexical content; task-based query session detection; Clustering algorithms; Electronic publishing; Encyclopedias; Internet; Semantics; Silicon; Query clustering; Query log session breaking; Task-based session; Web data mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on
Conference_Location
Toronto, ON
Print_ISBN
978-1-4244-8482-9
Electronic_ISBN
978-0-7695-4191-4
Type
conf
DOI
10.1109/WI-IAT.2010.281
Filename
5614473
Link To Document