• DocumentCode
    884489
  • Title

    A unified probabilistic framework for Web page scoring systems

  • Author

    Diligenti, Michelangelo ; Gori, Marco ; Maggini, Marco

  • Author_Institution
    Dipt. di Ingegneria dell´´Informazione, Universita degli Studi di Siena, Italy
  • Volume
    16
  • Issue
    1
  • fYear
    2004
  • Firstpage
    4
  • Lastpage
    16
  • Abstract
    The definition of efficient page ranking algorithms is becoming an important issue in the design of the query interface of Web search engines. Information flooding is a common experience especially when broad topic queries are issued. Queries containing only one or two keywords usually match a huge number of documents, while users can only afford to visit the first positions of the returned list, which do not necessarily refer to the most appropriate answers. Some successful approaches to page ranking in a hyperlinked environment, like the Web, are based on link analysis. We propose a general probabilistic framework for Web page scoring systems (WPSS), which incorporates and extends many of the relevant models proposed in the literature. In particular, we introduce scoring systems for both generic (horizontal) and focused (vertical) search engines. Whereas horizontal scoring algorithms are only based on the topology of the Web graph, vertical ranking also takes the page contents into account and are the base for focused and user adapted search interfaces. Experimental results are reported to show the properties of some of the proposed scoring systems with special emphasis on vertical search.
  • Keywords
    Internet; Web design; human factors; search engines; user interfaces; WPSS; Web graph; Web page scoring systems; Web search engines; adapted search interfaces; broad topic queries; horizontal scoring algorithms; hyperlinked environment; information flooding; page ranking; page ranking algorithms; probabilistic framework; query interface; random walks; unified probabilistic framework; vertical ranking; vertical search; Algorithm design and analysis; Computer Society; Databases; Floods; Helium; Information retrieval; Search engines; Topology; Web pages; Web search;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2004.1264818
  • Filename
    1264818