• DocumentCode
    480742
  • Title

    An Unsupervised Approach to Cluster Web Search Results Based on Word Sense Communities

  • Author

    Chen, Jiyang ; Zaiane, Osmar R. ; Goebel, Randy

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Alberta, Edmonton, AB
  • Volume
    1
  • fYear
    2008
  • fDate
    9-12 Dec. 2008
  • Firstpage
    725
  • Lastpage
    729
  • Abstract
    Effectively organizing Web search results into clusters is important to facilitate quick user navigation to relevant documents. Previous methods may rely on a training process and do not provide a measure for whether page clustering is actually required. In this paper, we reformalize the clustering problem as a word sense discovery problem. Given a query and a list of result pages, our unsupervised method detects word sense communities in the extracted keyword network. The documents are assigned to several refined word sense communities to form clusters. We use the modularity score of the discovered keyword community structure to measure page clustering necessity. Experimental results verify our method´s feasibility and effectiveness.
  • Keywords
    Internet; natural languages; pattern clustering; query processing; relevance feedback; search engines; Web search result organization; keyword network; modularity score; page clustering; query sense identification; relevant documents; unsupervised Web search result clustering; user navigation; word sense community; word sense discovery problem; Clustering algorithms; Clustering methods; Communities; Intelligent agent; Navigation; Organizing; Search engines; Training data; Web pages; Web search; Community Mining; Document Clustering; Word Sense Community;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-0-7695-3496-1
  • Type

    conf

  • DOI
    10.1109/WIIAT.2008.24
  • Filename
    4740536