• DocumentCode
    3248694
  • Title

    Towards automatic generation of query taxonomy: a hierarchical query clustering approach

  • Author

    Chuang, Shui-Lung ; Chien, Lee-Feng

  • Author_Institution
    Inst. of Inf. Sci., Acad. Sinica, Taipei, Taiwan
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    75
  • Lastpage
    82
  • Abstract
    Most previous work on automatic query clustering generated a flat, un-nested partition of query terms. In this work, we discuss the organization of query terms into a hierarchical structure and construct a query taxonomy in an automatic way. The proposed approach is designed based on a hierarchical agglomerative clustering algorithm to hierarchically group similar queries and generate cluster hierarchies using a novel cluster partition technique. The search processes of real-world search engines are combined to obtain highly ranked Web documents as the feature source for each query term. Preliminary experiments show that the proposed approach is effective for obtaining thesaurus information for query terms, and is also feasible for constructing a query taxonomy which provides a basis for in-depth analysis of users´ search interests and domain-specific vocabulary on a larger scale.
  • Keywords
    information needs; information retrieval; search engines; thesauri; automatic query taxonomy generation; cluster partition technique; domain-specific vocabulary; hierarchical agglomerative clustering algorithm; hierarchical query clustering approach; highly ranked Web documents; search engines; thesaurus information; user search interests; Classification tree analysis; Clustering algorithms; Information science; Marine vehicles; Performance analysis; Search engines; Taxonomy; Terminology; Thesauri; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
  • Print_ISBN
    0-7695-1754-4
  • Type

    conf

  • DOI
    10.1109/ICDM.2002.1183888
  • Filename
    1183888