• DocumentCode
    2185907
  • Title

    A method of Web search result clustering based on rough sets

  • Author

    Ngo, Chi Lang ; Nguyen, Hung Son

  • Author_Institution
    Inst. of Math., Warsaw Univ., Poland
  • fYear
    2005
  • fDate
    19-22 Sept. 2005
  • Firstpage
    673
  • Lastpage
    679
  • Abstract
    Due to the enormous size of the Web and low precision of user queries, finding the right information from the Web can be difficult if not impossible. One approach that tries to solve this problem is using clustering techniques for grouping similar document together in order to facilitate presentation of results in more compact form and enable thematic browsing of the results set. The main problem of many Web search result (snippet) clustering algorithm is based on the poor vector representation of snippets. In this paper, we present a method of snippet representation enrichment using tolerance rough set model. We applied the proposed method to construct a rough set based search result clustering algorithm and compared it with other recent methods.
  • Keywords
    Internet; document handling; information retrieval; pattern clustering; rough set theory; Web search; document clustering; snippet representation enrichment; tolerance rough set model; vector representation; Algorithm design and analysis; Clustering algorithms; Mathematics; Navigation; Rough sets; Scalability; Search engines; Set theory; Web pages; Web search; clustering; rough sets; snippet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence, 2005. Proceedings. The 2005 IEEE/WIC/ACM International Conference on
  • Print_ISBN
    0-7695-2415-X
  • Type

    conf

  • DOI
    10.1109/WI.2005.7
  • Filename
    1517931