DocumentCode
2185907
Title
A method of Web search result clustering based on rough sets
Author
Ngo, Chi Lang ; Nguyen, Hung Son
Author_Institution
Inst. of Math., Warsaw Univ., Poland
fYear
2005
fDate
19-22 Sept. 2005
Firstpage
673
Lastpage
679
Abstract
Due to the enormous size of the Web and low precision of user queries, finding the right information from the Web can be difficult if not impossible. One approach that tries to solve this problem is using clustering techniques for grouping similar document together in order to facilitate presentation of results in more compact form and enable thematic browsing of the results set. The main problem of many Web search result (snippet) clustering algorithm is based on the poor vector representation of snippets. In this paper, we present a method of snippet representation enrichment using tolerance rough set model. We applied the proposed method to construct a rough set based search result clustering algorithm and compared it with other recent methods.
Keywords
Internet; document handling; information retrieval; pattern clustering; rough set theory; Web search; document clustering; snippet representation enrichment; tolerance rough set model; vector representation; Algorithm design and analysis; Clustering algorithms; Mathematics; Navigation; Rough sets; Scalability; Search engines; Set theory; Web pages; Web search; clustering; rough sets; snippet;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
Print_ISBN
0-7695-2415-X
Type
conf
DOI
10.1109/WI.2005.7
Filename
1517931
Link To Document