• DocumentCode
    1868337
  • Title

    Essential Pages

  • Author

    Swaminathan, Ashwin ; Mathew, Cherian V. ; Kirovski, Darko

  • Volume
    1
  • fYear
    2009
  • fDate
    15-18 Sept. 2009
  • Firstpage
    173
  • Lastpage
    182
  • Abstract
    Results to Web search queries are ranked using heuristics that typically analyze the global link topology, user behavior, and content relevance. We point to a particular inefficiency of such methods: information redundancy. In queries where learning about a subject is an objective, modern search engines return relatively unsatisfactory results as they consider the query coverage by each page individually, not a set of pages as a whole. We address this problem using essential pages. If we denote as $mathbb{S}_Q$ the total knowledge that exists on the Web about a given query $Q$, we want to build a search engine that returns a set of essential pages $E_Q$ that maximizes the information covered over $mathbb{S}_Q$. We present a preliminary prototype that optimizes the selection of essential pages; we draw some informal comparisons with respect to existing search engines; and finally, we evaluate our prototype using a blind-test user study.
  • Keywords
    Web page ranking; Web search; coverage; learning queries; redundancy elimination;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technologies, 2009. WI-IAT '09. IEEE/WIC/ACM International Joint Conferences on
  • Conference_Location
    Milan, Italy
  • Print_ISBN
    978-0-7695-3801-3
  • Electronic_ISBN
    978-1-4244-5331-3
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2009.33
  • Filename
    5286076