• DocumentCode
    2737656
  • Title

    Detecting Task-Based Query Sessions Using Collaborative Knowledge

  • Author

    Lucchese, Claudio ; Orlando, Salvatore ; Perego, Raffaele ; Silvestri, F. ; Tolomei, G.

  • Author_Institution
    HPC Lab., ISTI-CNR, Pisa, Italy
  • Volume
    3
  • fYear
    2010
  • fDate
    Aug. 31 2010-Sept. 3 2010
  • Firstpage
    128
  • Lastpage
    131
  • Abstract
    Our research challenge is to provide a mechanism for splitting into user task-based sessions a long-term log of queries submitted to a Web Search Engine (WSE). The hypothesis is that some query sessions entail the concept of user task. We present an approach that relies on a centroid-based and a density-based clustering algorithm, which consider queries inter-arrival times and use a novel distance function that takes care of query lexical content and exploits the collaborative knowledge collected by Wiktionary and Wikipedia.
  • Keywords
    Internet; Web sites; pattern clustering; query formulation; search engines; Web search engine; Wikipedia; Wiktionary; centroid-based clustering algorithm; collaborative knowledge; density-based clustering algorithm; distance function; query inter-arrival times; query lexical content; task-based query session detection; Clustering algorithms; Electronic publishing; Encyclopedias; Internet; Semantics; Silicon; Query clustering; Query log session breaking; Task-based session; Web data mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology (WI-IAT), 2010 IEEE/WIC/ACM International Conference on
  • Conference_Location
    Toronto, ON
  • Print_ISBN
    978-1-4244-8482-9
  • Electronic_ISBN
    978-0-7695-4191-4
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2010.281
  • Filename
    5614473