DocumentCode :
2788967
Title :
Using n-best recognition output for extractive summarization and keyword extraction in meeting speech
Author :
Liu, Yang ; Xie, Shasha ; Liu, Fei
Author_Institution :
Univ. of Texas at Dallas, Richardson, TX, USA
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5310
Lastpage :
5313
Abstract :
There has been increasing interest recently in meeting understanding, such as summarization, browsing, action item detection, and topic segmentation. However, there is very limited effort on using rich recognition output (e.g., recognition confidence measure or more recognition candidates) for these downstream tasks. This paper presents an initial study using n-best recognition hypotheses for two tasks, extractive summarization and keyword extraction. We extend the approach used on 1-best output to n-best hypotheses: MMR (maximum marginal relevance) for summarization and TFIDF (term frequency, inverse document frequency) weighting for keyword extraction. Our experiments on the ICSI meeting corpus demonstrate promising improvement using n-best hypotheses over 1-best output. These results suggest worthy future studies using n-best or lattices as the interface between speech recognition and downstream tasks.
Keywords :
feature extraction; speech processing; speech recognition; MMR; TFIDF weighting; action item detection; browsing; extractive summarization; keyword extraction; maximum marginal relevance; meeting speech; n-best recognition hypotheses; n-best recognition output; recognition candidates; recognition confidence measure; term frequency inverse document frequency; topic segmentation; Automatic speech recognition; Broadcasting; Data mining; Degradation; Frequency; Humans; Information management; Lattices; Natural languages; Speech recognition; keyword extraction; n-best hypotheses; summarization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5494972
Filename :
5494972
Link To Document :
بازگشت