DocumentCode
730823
Title
Automatic broadcast news summarization via rank classifiers and crowdsourced annotation
Author
Parthasarathy, Srinivas ; Hasan, Taufiq
Author_Institution
Res. & Technol. Center, Robert Bosch LLC, USA
fYear
2015
fDate
19-24 April 2015
Firstpage
5256
Lastpage
5260
Abstract
Extractive speech summarization methods generally operate as a binary classifier deciding if a sentence belongs to the summary or not. However, it is well known that even human annotators do not agree on selecting most summary sentences. In this paper, we take a probabilistic view of the summarization ground-truth and assume that more frequently selected sentences by annotators are of higher importance. Using a large summary data-set obtained through crowdsourcing, we empirically show that sentence selection frequency is inversely related to its summarization rank. Consequently, we model the relative importance between sentences using a rank-based classifier. Additionally, we utilize an extended paralinguistic feature set that has not been previously used for speech summarization. Lexical and structural features are also included. Support Vector Machine (SVM) is used as the baseline binary classifier and rank classifier. Experimental evaluations show that the proposed approach outperforms traditional binary classifiers with respect to various ROUGE summarization metrics for different summarization compression ratios (CR).
Keywords
speech processing; support vector machines; SVM; automatic broadcast news summarization; baseline binary classifier; crowdsourced annotation; extended paralinguistic feature; extractive speech summarization methods; lexical features; probabilistic view; rank-based classifier; structural features; summarization ground-truth; support vector machine; Measurement; Spoken document summarization; crowdsourcing; paralinguistic features;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178974
Filename
7178974
Link To Document