DocumentCode
2427950
Title
A Model for Ranking Entities and Its Application to Wikipedia
Author
Demartini, Gianluca ; Firan, Claudiu S. ; Iofciu, Tereza ; Krestel, Ralf ; Nejdl, Wolfgang
Author_Institution
L3S Res. Center, Leibniz Univ. Hannover, Hannover
fYear
2008
fDate
28-30 Oct. 2008
Firstpage
29
Lastpage
38
Abstract
Entity Ranking (ER) is a recently emerging search task in Information Retrieval, where the goal is not finding documents matching the query words, but instead finding entities which match types and attributes mentioned in the query. In this paper we propose a formal model to define entities as well as a complete ER system, providing examples of its application to enterprise, Web, and Wikipedia scenarios. Since searching for entities on Web scale repositories is an open challenge as the effectiveness of ranking is usually not satisfactory, we present a set of algorithms based on our model and evaluate their retrieval effectiveness. The results show that combining simple Link Analysis, Natural Language Processing, and Named Entity Recognition methods improves retrieval performance of entity search by over 53% for P@10 and 35% for MAP.
Keywords
Internet; pattern matching; query processing; Web scale repository; attribute matching; entity ranking; formal model; information retrieval; query word; search task; type matching; wikipedia; Algorithm design and analysis; Data mining; Erbium; Information retrieval; Natural language processing; Performance analysis; Search engines; Testing; Web pages; Wikipedia; Wikipedia; entity ranking; evaluation; model;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Conference, 2008. LA-WEB '08., Latin American
Conference_Location
Espfrito Santo
Print_ISBN
978-0-7695-3397-1
Electronic_ISBN
978-0-7695-3397-1
Type
conf
DOI
10.1109/LA-WEB.2008.8
Filename
4756159
Link To Document