• DocumentCode
    2323642
  • Title

    Automatic speech summarization based on word significance and linguistic likelihood

  • Author

    Hori, Chiori ; Furui, Sadaoki

  • Author_Institution
    Dept. of Comput. Sci., Tokyo Inst. of Technol., Japan
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1579
  • Abstract
    This paper proposes a new method of automatically summarizing speech by extracting a limited number of relatively important words from its automatic transcription according to a target compression ratio for the number of characters. To determine a word set to be extracted, we define a summarization score consisting of a topic score (significance measure) of words and a linguistic score (likelihood) of the word concatenation. A set of words maximizing the score is efficiently selected using a dynamic programming (DP) technique. Japanese broadcast news speech transcribed using a large vocabulary continuous speech recognition system was summarized. As a result 86% of important words in the original speech were correctly included in the summarizing sentences and 72% of the summarizing sentences could maintain the meanings of the original speech under the 60-70% summarization condition
  • Keywords
    computational linguistics; dynamic programming; natural languages; speech recognition; Japanese broadcast news speech; automatic speech summarization; automatic transcription; dynamic programming; large vocabulary continuous speech recognition system; likelihood; linguistic likelihood; linguistic score; relatively important words; significance measure; summarization score; target compression ratio; topic score; word concatenation; word significance; Casting; Computer science; Data mining; Dynamic programming; Indexing; Information retrieval; Information science; Speech recognition; TV broadcasting; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.861983
  • Filename
    861983