• DocumentCode
    506865
  • Title

    A K-means Approach Based on Concept Hierarchical Tree for Search Results Clustering

  • Author

    Jiang, Peng ; Zhang, Chunxia ; Guo, Guisuo ; Niu, Zhendong ; Gao, Dongping

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Beijing Inst. of Technol., Beijing, China
  • Volume
    1
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    380
  • Lastpage
    386
  • Abstract
    Search results clustering aims to facilitate users´ information retrieval process and query refinement by online grouping similar documents returned from the search engine. It has stringent requirements on performance and meaningful cluster labels. Thus, most existing clustering algorithms such as K-means and agglomerative hierarchical clustering cannot be directly applied to the task of online search results clustering. In this paper, we propose a K-means approach based on concept hierarchical tree to cluster search results. This algorithm not only over comes weaknesses of the classic K-means method: the results produced depend on the initial seeds and the parameter k is often unknown, but also satisfies the requirements of online search results clustering. Our method utilizes the semantic relation among documents by mapping terms to concepts in the concept hierarchical tree, which can be constructed by WordNet. We have developed a meta-search and clustering system based on our approach, followed by using an impersonal and repeatable evaluation solution. Experimental results indicate that our proposed algorithm is effective and suitable in performing the task of clustering search results.
  • Keywords
    pattern clustering; query formulation; search engines; trees (mathematics); K-means approach; WordNet; concept hierarchical tree; information retrieval process; meta-search; online search results clustering; query refinement; search engine; Clustering algorithms; Computer science; Filters; Fuzzy systems; Information retrieval; Metasearch; Rail transportation; Search engines; Web search; Web sites;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3735-1
  • Type

    conf

  • DOI
    10.1109/FSKD.2009.658
  • Filename
    5358569