DocumentCode
3248694
Title
Towards automatic generation of query taxonomy: a hierarchical query clustering approach
Author
Chuang, Shui-Lung ; Chien, Lee-Feng
Author_Institution
Inst. of Inf. Sci., Acad. Sinica, Taipei, Taiwan
fYear
2002
fDate
2002
Firstpage
75
Lastpage
82
Abstract
Most previous work on automatic query clustering generated a flat, un-nested partition of query terms. In this work, we discuss the organization of query terms into a hierarchical structure and construct a query taxonomy in an automatic way. The proposed approach is designed based on a hierarchical agglomerative clustering algorithm to hierarchically group similar queries and generate cluster hierarchies using a novel cluster partition technique. The search processes of real-world search engines are combined to obtain highly ranked Web documents as the feature source for each query term. Preliminary experiments show that the proposed approach is effective for obtaining thesaurus information for query terms, and is also feasible for constructing a query taxonomy which provides a basis for in-depth analysis of users´ search interests and domain-specific vocabulary on a larger scale.
Keywords
information needs; information retrieval; search engines; thesauri; automatic query taxonomy generation; cluster partition technique; domain-specific vocabulary; hierarchical agglomerative clustering algorithm; hierarchical query clustering approach; highly ranked Web documents; search engines; thesaurus information; user search interests; Classification tree analysis; Clustering algorithms; Information science; Marine vehicles; Performance analysis; Search engines; Taxonomy; Terminology; Thesauri; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining, 2002. ICDM 2003. Proceedings. 2002 IEEE International Conference on
Print_ISBN
0-7695-1754-4
Type
conf
DOI
10.1109/ICDM.2002.1183888
Filename
1183888
Link To Document