Title :
A comparative study of probabilistic ranking models for spoken document summarization
Author :
Lin, Shih-Hsiang ; Yi-Ting Chen ; Wang, Hsin-Min ; Chen, Yi-Ting
Author_Institution :
Nat. Taiwan Normal Univ., Taipei
fDate :
March 31 2008-April 4 2008
Abstract :
The purpose of extractive document summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a target summarization ratio and then sequence them to form a concise summary. In the paper, we present a comparative study of various supervised and unsupervised probabilistic ranking models for spoken document summarization on the Chinese broadcast news. Moreover, we also investigate the possibility of using unsupervised summarizers to boost the performance of supervised summarizers when manual labels are not available for the training of supervised summarizers. Encouraging results were initially demonstrated.
Keywords :
document handling; probability; speech processing; Chinese broadcast news; indicative sentences; paragraphs; passages; spoken document summarization; unsupervised probabilistic ranking models; Bayesian methods; Broadcasting; Data mining; Frequency; Hidden Markov models; Information science; Labeling; Personnel; Support vector machine classification; Support vector machines; extractive summarization; probabilistic ranking models; spoken document summarization; unsupervised summarizers;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518787