DocumentCode :
3125162
Title :
Learning to Rank for Query-Focused Multi-document Summarization
Author :
Shen, Chao ; Li, Tao
Author_Institution :
Sch. of Comput. & Inf. Sci., Florida Int. Univ., Miami, FL, USA
fYear :
2011
fDate :
11-14 Dec. 2011
Firstpage :
626
Lastpage :
634
Abstract :
In this paper, we explore how to use ranking SVM to train the feature weights for query-focused multi-document summarization. To apply a supervised learning method to sentence extraction in multi-document summarization, we need to derive the sentence labels for training corpus from the existing human labeling data in form of. However, this process is not trivial, because the human summaries are abstractive, and do not necessarily well match the sentences in the documents. In this paper, we try to address the above problem from the following two aspects. First, we make use of sentence-to-sentence relationships to better estimate the probability of a sentence in the document set to be a summary sentence. Second, to make the derived training data less sensitive, we adopt a cost sensitive loss in the ranking SVM´s objective function. The experimental results demonstrate the effectiveness of our proposed method.
Keywords :
document handling; learning (artificial intelligence); probability; query processing; support vector machines; SVM ranking; feature weights; query-focused multidocument summarization; rank learning; sentence probability estimation; sentence-to-sentence relationships; support vector machine; Feature extraction; Humans; Redundancy; Supervised learning; Support vector machines; Training; Training data; learning to rank; query-based multi-document summarization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver,BC
ISSN :
1550-4786
Print_ISBN :
978-1-4577-2075-8
Type :
conf
DOI :
10.1109/ICDM.2011.91
Filename :
6137267
Link To Document :
بازگشت