Title :
Automatic speech summarization based on sentence extraction and compaction
Author :
Kikuchi, Tomonori ; Furui, Sadaoki ; Hori, Chiori
Author_Institution :
Dept. of Comput. Sci., Tokyo Inst. of Technol., Japan
Abstract :
This paper proposes a new automatic speech summarization method having two stages: important sentence extraction and sentence compaction. Relatively important sentences are extracted based on the amount of information and the confidence measures of constituent words, and the set of extracted sentences is compressed by our sentence compaction method. The sentence compaction is performed by selecting a word set that maximizes a summarization score consisting of the amount of information and the confidence measure of each word, the linguistic likelihood of word strings, and the word concatenation probability. The selected words are concatenated to create a summary. Effectiveness of the proposed method was confirmed by summarizing a spontaneous presentation.
Keywords :
natural languages; probability; speech processing; speech recognition; automatic speech summarization; confidence measures; linguistic likelihood; natural language processing; sentence compaction; sentence extraction; speech documents transcription; spontaneous presentation; summarization score; word concatenation probability; word recognition accuracy; word strings; Application software; Broadcasting; Compaction; Computer science; Concatenated codes; Data mining; Laboratories; Pervasive computing; Speech recognition; Text recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198798