• DocumentCode
    2427950
  • Title

    A Model for Ranking Entities and Its Application to Wikipedia

  • Author

    Demartini, Gianluca ; Firan, Claudiu S. ; Iofciu, Tereza ; Krestel, Ralf ; Nejdl, Wolfgang

  • Author_Institution
    L3S Res. Center, Leibniz Univ. Hannover, Hannover
  • fYear
    2008
  • fDate
    28-30 Oct. 2008
  • Firstpage
    29
  • Lastpage
    38
  • Abstract
    Entity Ranking (ER) is a recently emerging search task in Information Retrieval, where the goal is not finding documents matching the query words, but instead finding entities which match types and attributes mentioned in the query. In this paper we propose a formal model to define entities as well as a complete ER system, providing examples of its application to enterprise, Web, and Wikipedia scenarios. Since searching for entities on Web scale repositories is an open challenge as the effectiveness of ranking is usually not satisfactory, we present a set of algorithms based on our model and evaluate their retrieval effectiveness. The results show that combining simple Link Analysis, Natural Language Processing, and Named Entity Recognition methods improves retrieval performance of entity search by over 53% for P@10 and 35% for MAP.
  • Keywords
    Internet; pattern matching; query processing; Web scale repository; attribute matching; entity ranking; formal model; information retrieval; query word; search task; type matching; wikipedia; Algorithm design and analysis; Data mining; Erbium; Information retrieval; Natural language processing; Performance analysis; Search engines; Testing; Web pages; Wikipedia; Wikipedia; entity ranking; evaluation; model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Conference, 2008. LA-WEB '08., Latin American
  • Conference_Location
    Espfrito Santo
  • Print_ISBN
    978-0-7695-3397-1
  • Electronic_ISBN
    978-0-7695-3397-1
  • Type

    conf

  • DOI
    10.1109/LA-WEB.2008.8
  • Filename
    4756159