Title :
Automatic detection of topic boundaries and keywords in arbitrary speech using incremental reference interval-free continuous DP
Author :
Kiyama, Jiro ; Itoh, Yoshiaki ; Oka, Ryuichi
Author_Institution :
Tsukuba Res. Center, Real World Comput. Partnership, Tsukuba, Japan
Abstract :
We propose an approach for detecting topic boundaries and keywords in arbitrary speech, with neither recognition nor prosodic processing, aiming at guide access to the content of recorded raw speech. This approach is based on the general tendency that frequently repeated phrases/words in speech are characteristic of topics in discourse. So it uses pairs of phonetically similar segments (PPSSs) of speech to represent topics in speech. This approach has the advantage of being domain and language independent and robust against variations in the speaker and background noise, as it needs neither a language nor acoustic model in advance. Experiments using simulated dialogues confirmed the good performance of this approach. We also propose Incremental Reference Interval Free Continuous Dynamic Programming (IRIFCDP) as an algorithm for detecting PPSSs in speech for the above method. IRIFCDP can detect PPSSs efficiently in synchronization with the speech, so it is suitable for handling long speech samples
Keywords :
dynamic programming; information retrieval; interactive systems; speech processing; IRIFCDP; Incremental Reference Interval Free Continuous Dynamic Programming; arbitrary speech; automatic detection; frequently repeated phrases/words; keywords; language independent; long speech samples; phonetically similar segments; recorded raw speech; simulated dialogues; topic boundaries; Acoustic signal detection; Background noise; Dynamic programming; Loudspeakers; Natural languages; Noise robustness; Pattern matching; Speech processing; Speech recognition; Steel;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.608016