DocumentCode :
2329535
Title :
Improving hmm-based extractive summarization for multi-domain contact center dialogues
Author :
Higashinaka, Ryuichiro ; Minami, Yasuhiro ; Nishikawa, Hitoshi ; Dohsaka, Kohji ; Meguro, Toyomi ; Kobashikawa, Satoshi ; Masataki, Hirokazu ; Yoshioka, Osamu ; Takahashi, Satoshi ; Kikui, Genichiro
Author_Institution :
NTT Cyber Space Labs., NTT Corp., Tokyo, Japan
fYear :
2010
fDate :
12-15 Dec. 2010
Firstpage :
61
Lastpage :
66
Abstract :
This paper reports the improvements we made to our previously proposed hidden Markov model (HMM) based summarization method for multi-domain contact center dialogues. Since the method relied on Viterbi decoding for selecting utterances to include in a summary, it had the inability to control compression rates. We enhance our method by using the forward-backward algorithm together with integer linear programming (ILP) to enable the control of compression rates, realizing summaries that contain as many domain-related utterances and as many important words as possible within a predefined character length. Using call transcripts as input, we verify the effectiveness of our enhancement.
Keywords :
Viterbi decoding; call centres; hidden Markov models; integer programming; interactive systems; linear programming; HMM-based extractive summarization; Viterbi decoding; call transcripts; forward-backward algorithm; hidden Markov model based summarization method; integer linear programming; multidomain contact center dialogues; Hidden Markov models; Natural language interfaces; Natural languages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2010 IEEE
Conference_Location :
Berkeley, CA
Print_ISBN :
978-1-4244-7904-7
Electronic_ISBN :
978-1-4244-7902-3
Type :
conf
DOI :
10.1109/SLT.2010.5700823
Filename :
5700823
Link To Document :
بازگشت