DocumentCode :
610373
Title :
Learning to rank from distant supervision: Exploiting noisy redundancy for relational entity search
Author :
Mianwei Zhou ; Hongning Wang ; Change, Kevin Chen-Chuan
Author_Institution :
Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear :
2013
fDate :
8-12 April 2013
Firstpage :
829
Lastpage :
840
Abstract :
In this paper, we study the task of relational entity search which aims at automatically learning an entity ranking function for a desired relation. To rank entities, we exploit the redundancy abound in their snippets; however, such redundancy is noisy as not all the snippets represent information relevant to the desired relation. To explore useful information from such noisy redundancy, we abstract the task as a distantly supervised ranking problem - based on coarse entity-level annotations, deriving a relation-specific ranking function for the purpose of online searching. As the key challenge, without detailed snippet-level annotations, we have to learn an entity ranking function that can effectively filter noise; furthermore, the ranking function should also be online executable. We develop Pattern-based Filter Network (PFNet), a novel probabilistic graphical model, as our solution. To balance the accuracy and efficiency requirements, PFNet selects a limited size of indicative patterns to filter noisy snippets, and inverted indexes are utilized to retrieve required features. Experiments on the large scale CuleWeb09 data set for six different relations confirm the effectiveness of the proposed PFNet model, which outperforms five state-of-the-art relational entity ranking methods.
Keywords :
feature extraction; graph theory; information retrieval; learning (artificial intelligence); probability; CuleWeb09 data set; PFNet; coarse entity-level annotation; distant supervision; distantly supervised ranking problem; entity ranking function; feature retrieval; indicative pattern; inverted index; noise filtering; noisy redundancy; noisy snippet filtering; online searching; pattern-based filter network; probabilistic graphical model; rank learning; relation-specific ranking function; relational entity search; snippet-level annotation; Accuracy; Indexes; Logic gates; Noise; Noise measurement; Redundancy; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
ISSN :
1063-6382
Print_ISBN :
978-1-4673-4909-3
Electronic_ISBN :
1063-6382
Type :
conf
DOI :
10.1109/ICDE.2013.6544878
Filename :
6544878
Link To Document :
بازگشت