Title :
Towards Abstractive Speech Summarization: Exploring Unsupervised and Supervised Approaches for Spoken Utterance Compression
Author :
Fei Liu ; Yang Liu
Author_Institution :
Dept. of Comput. Sci., Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
Most previous studies on speech summarization focus on the extractive approaches. Yet directly concatenating the extracted speech utterances may not form a good summary due to the presence of disfluencies and redundancy in the unplanned spontaneous speech. In this paper, we proposed to generate compressed speech summaries by coupling the sentence level compression and summarization approaches, as a viable step towards generating abstractive summaries. We compared two utterance compression approaches: an unsupervised approach based on the Integer Linear Programming (ILP) framework, and a supervised method using conditional random fileds (CRF) that formulates the utterance compression problem as a sequence labeling task. We evaluated the compression performance using both human and ASR transcripts from the ICSI meeting corpus, and performed both automatic and human evaluation. Our results show that we can achieve reasonable utterance compression performance, and that the CRF-based method generally performs better. By coupling the compression and summarization approaches, we generated compressed speech summaries that cover more important information within the given length limit, yielding 5% absolute performance gain on both human and ASR transcripts as evaluated by the ROUGE-1 F-scores.
Keywords :
data compression; integer programming; linear programming; speech recognition; ASR transcripts; CRF; CRF-based method; ICSI meeting corpus; ILP framework; ROUGE-1 F-scores; abstractive speech summarization; abstractive summary generation; automatic speech recognizers; conditional random fields; extractive approach; integer linear programming framework; sentence level compression; sequence labeling task; speech utterances; spoken utterance compression; unsupervised approach; Conditional random fields; ICSI meeting corpus; integer linear programming; speech summarization; spoken utterance compression;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2013.2255279