• DocumentCode
    2329535
  • Title

    Improving hmm-based extractive summarization for multi-domain contact center dialogues

  • Author

    Higashinaka, Ryuichiro ; Minami, Yasuhiro ; Nishikawa, Hitoshi ; Dohsaka, Kohji ; Meguro, Toyomi ; Kobashikawa, Satoshi ; Masataki, Hirokazu ; Yoshioka, Osamu ; Takahashi, Satoshi ; Kikui, Genichiro

  • Author_Institution
    NTT Cyber Space Labs., NTT Corp., Tokyo, Japan
  • fYear
    2010
  • fDate
    12-15 Dec. 2010
  • Firstpage
    61
  • Lastpage
    66
  • Abstract
    This paper reports the improvements we made to our previously proposed hidden Markov model (HMM) based summarization method for multi-domain contact center dialogues. Since the method relied on Viterbi decoding for selecting utterances to include in a summary, it had the inability to control compression rates. We enhance our method by using the forward-backward algorithm together with integer linear programming (ILP) to enable the control of compression rates, realizing summaries that contain as many domain-related utterances and as many important words as possible within a predefined character length. Using call transcripts as input, we verify the effectiveness of our enhancement.
  • Keywords
    Viterbi decoding; call centres; hidden Markov models; integer programming; interactive systems; linear programming; HMM-based extractive summarization; Viterbi decoding; call transcripts; forward-backward algorithm; hidden Markov model based summarization method; integer linear programming; multidomain contact center dialogues; Hidden Markov models; Natural language interfaces; Natural languages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2010 IEEE
  • Conference_Location
    Berkeley, CA
  • Print_ISBN
    978-1-4244-7904-7
  • Electronic_ISBN
    978-1-4244-7902-3
  • Type

    conf

  • DOI
    10.1109/SLT.2010.5700823
  • Filename
    5700823