Improving hmm-based extractive summarization for multi-domain contact center dialogues

Author

Higashinaka, Ryuichiro ; Minami, Yasuhiro ; Nishikawa, Hitoshi ; Dohsaka, Kohji ; Meguro, Toyomi ; Kobashikawa, Satoshi ; Masataki, Hirokazu ; Yoshioka, Osamu ; Takahashi, Satoshi ; Kikui, Genichiro

Author_Institution

NTT Cyber Space Labs., NTT Corp., Tokyo, Japan

fYear

2010

fDate

12-15 Dec. 2010

Firstpage

61

Lastpage

66

Abstract

This paper reports the improvements we made to our previously proposed hidden Markov model (HMM) based summarization method for multi-domain contact center dialogues. Since the method relied on Viterbi decoding for selecting utterances to include in a summary, it had the inability to control compression rates. We enhance our method by using the forward-backward algorithm together with integer linear programming (ILP) to enable the control of compression rates, realizing summaries that contain as many domain-related utterances and as many important words as possible within a predefined character length. Using call transcripts as input, we verify the effectiveness of our enhancement.

Keywords

Viterbi decoding; call centres; hidden Markov models; integer programming; interactive systems; linear programming; HMM-based extractive summarization; Viterbi decoding; call transcripts; forward-backward algorithm; hidden Markov model based summarization method; integer linear programming; multidomain contact center dialogues; Hidden Markov models; Natural language interfaces; Natural languages;

fLanguage

English

Publisher

ieee

Conference_Titel

Spoken Language Technology Workshop (SLT), 2010 IEEE

Conference_Location

Berkeley, CA

Print_ISBN

978-1-4244-7904-7

Electronic_ISBN

978-1-4244-7902-3

Type

conf

DOI

10.1109/SLT.2010.5700823

Filename

5700823