DocumentCode :
3285676
Title :
A New Search Results Clustering Algorithm Based on Formal Concept Analysis
Author :
Zhang, Yun ; Feng, BoQin ; Xue, Yewei
Author_Institution :
Sch. of Electron. & Inf. Eng., Xian Jiaotong Univ., Xian
Volume :
2
fYear :
2008
fDate :
18-20 Oct. 2008
Firstpage :
356
Lastpage :
360
Abstract :
Organizing Web search results into a hierarchy of topics and subtopics facilitates browsing the collection and locating results of interest. In this paper, we propose a new method based on formal concept analysis (FCA) to build a two-level hierarchy for retrieved search results of a query. After formal concepts are extracted using FCA, anew algorithm is proposed to extract concepts most relevant to the query and a two-level hierarchy is built and presented to the user. Evaluating the quality of the resulting clusters is a non-trivial task. Two improved objective metrics of clustering quality, ANMI@K and ANCE@K, are proposed in this paper. We compare our method with three other search results clustering (SRC) algorithms: Suffix tree clustering (STC), Lingo, and Vivisimo, using a comprehensive set of documents obtained from the Open Directory Project hierarchy as benchmark. In addition to comparison based on objective measures, we also subjectively analyze the properties of cluster labels produced by different SRC algorithms. The experimental results show that our method outperforms the other three SRC algorithms, and is helpful for browsing and locating the results of interests.
Keywords :
Internet; query formulation; Lingo; Suffix tree clustering; Vivisimo; Web search; formal concept analysis; information retrieval; query; search results clustering algorithm; Algorithm design and analysis; Clustering algorithms; Fuzzy systems; Information analysis; Information retrieval; Knowledge engineering; Lattices; Organizing; Singular value decomposition; Web search; ANCE@K; ANMI@K; formal concept analysis; search results clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2008. FSKD '08. Fifth International Conference on
Conference_Location :
Shandong
Print_ISBN :
978-0-7695-3305-6
Type :
conf
DOI :
10.1109/FSKD.2008.140
Filename :
4666138
Link To Document :
بازگشت