DocumentCode :
730823
Title :
Automatic broadcast news summarization via rank classifiers and crowdsourced annotation
Author :
Parthasarathy, Srinivas ; Hasan, Taufiq
Author_Institution :
Res. & Technol. Center, Robert Bosch LLC, USA
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
5256
Lastpage :
5260
Abstract :
Extractive speech summarization methods generally operate as a binary classifier deciding if a sentence belongs to the summary or not. However, it is well known that even human annotators do not agree on selecting most summary sentences. In this paper, we take a probabilistic view of the summarization ground-truth and assume that more frequently selected sentences by annotators are of higher importance. Using a large summary data-set obtained through crowdsourcing, we empirically show that sentence selection frequency is inversely related to its summarization rank. Consequently, we model the relative importance between sentences using a rank-based classifier. Additionally, we utilize an extended paralinguistic feature set that has not been previously used for speech summarization. Lexical and structural features are also included. Support Vector Machine (SVM) is used as the baseline binary classifier and rank classifier. Experimental evaluations show that the proposed approach outperforms traditional binary classifiers with respect to various ROUGE summarization metrics for different summarization compression ratios (CR).
Keywords :
speech processing; support vector machines; SVM; automatic broadcast news summarization; baseline binary classifier; crowdsourced annotation; extended paralinguistic feature; extractive speech summarization methods; lexical features; probabilistic view; rank-based classifier; structural features; summarization ground-truth; support vector machine; Measurement; Spoken document summarization; crowdsourcing; paralinguistic features;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178974
Filename :
7178974
Link To Document :
بازگشت