مرکز منطقه ای اطلاع رساني علوم و فناوري - Improving hmm-based extractive summarization for multi-domain contact center dialogues

DocumentCode :

2329535

Title :

Improving hmm-based extractive summarization for multi-domain contact center dialogues

Author :

Higashinaka, Ryuichiro ; Minami, Yasuhiro ; Nishikawa, Hitoshi ; Dohsaka, Kohji ; Meguro, Toyomi ; Kobashikawa, Satoshi ; Masataki, Hirokazu ; Yoshioka, Osamu ; Takahashi, Satoshi ; Kikui, Genichiro

Author_Institution :

NTT Cyber Space Labs., NTT Corp., Tokyo, Japan

fYear :

2010

fDate :

12-15 Dec. 2010

Firstpage :

Lastpage :

Abstract :

This paper reports the improvements we made to our previously proposed hidden Markov model (HMM) based summarization method for multi-domain contact center dialogues. Since the method relied on Viterbi decoding for selecting utterances to include in a summary, it had the inability to control compression rates. We enhance our method by using the forward-backward algorithm together with integer linear programming (ILP) to enable the control of compression rates, realizing summaries that contain as many domain-related utterances and as many important words as possible within a predefined character length. Using call transcripts as input, we verify the effectiveness of our enhancement.

Keywords :

Viterbi decoding; call centres; hidden Markov models; integer programming; interactive systems; linear programming; HMM-based extractive summarization; Viterbi decoding; call transcripts; forward-backward algorithm; hidden Markov model based summarization method; integer linear programming; multidomain contact center dialogues; Hidden Markov models; Natural language interfaces; Natural languages;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Spoken Language Technology Workshop (SLT), 2010 IEEE

Conference_Location :

Berkeley, CA

Print_ISBN :

978-1-4244-7904-7

Electronic_ISBN :

978-1-4244-7902-3

Type :

conf

DOI :

10.1109/SLT.2010.5700823

Filename :

5700823

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2329535