مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning to Rank for Query-Focused Multi-document Summarization

DocumentCode :

3125162

Title :

Learning to Rank for Query-Focused Multi-document Summarization

Author :

Shen, Chao ; Li, Tao

Author_Institution :

Sch. of Comput. & Inf. Sci., Florida Int. Univ., Miami, FL, USA

fYear :

2011

fDate :

11-14 Dec. 2011

Firstpage :

626

Lastpage :

634

Abstract :

In this paper, we explore how to use ranking SVM to train the feature weights for query-focused multi-document summarization. To apply a supervised learning method to sentence extraction in multi-document summarization, we need to derive the sentence labels for training corpus from the existing human labeling data in form of. However, this process is not trivial, because the human summaries are abstractive, and do not necessarily well match the sentences in the documents. In this paper, we try to address the above problem from the following two aspects. First, we make use of sentence-to-sentence relationships to better estimate the probability of a sentence in the document set to be a summary sentence. Second, to make the derived training data less sensitive, we adopt a cost sensitive loss in the ranking SVM´s objective function. The experimental results demonstrate the effectiveness of our proposed method.

Keywords :

document handling; learning (artificial intelligence); probability; query processing; support vector machines; SVM ranking; feature weights; query-focused multidocument summarization; rank learning; sentence probability estimation; sentence-to-sentence relationships; support vector machine; Feature extraction; Humans; Redundancy; Supervised learning; Support vector machines; Training; Training data; learning to rank; query-based multi-document summarization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Mining (ICDM), 2011 IEEE 11th International Conference on

Conference_Location :

Vancouver,BC

ISSN :

1550-4786

Print_ISBN :

978-1-4577-2075-8

Type :

conf

DOI :

10.1109/ICDM.2011.91

Filename :

6137267

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3125162