DocumentCode
542158
Title
Automatic indexing of lecture speech by extracting topic-independent discourse markers
Author
Kawahara, Tatsuya ; Hasegawa, Masahiro
Author_Institution
School of Informatics, Kyoto University, Sakyo-ku, 606-8501, Japan
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
Automatic detection of section (sub-topic) boundaries in lecture speech is addressed. The method makes use of the characteristic expressions used in initial utterances of sections defined as discourse makers, as well as pause and language model information. The discourse markers are derived in a totally unsupervised manner based on word statistics used in the information retrieval technique. The statistics is used to select candidates picked up by other information. Experimental results show that the proposed method realizes better indexing performance (better precision at high recall rates) than the simple baseline method using pause information only. Moreover, it is shown to be robust against speech recognition errors.
Keywords
Computational modeling; Machine assisted indexing; Manuals; Radio access networks; Soil; Speech; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743639
Filename
5743639
Link To Document