• DocumentCode
    542158
  • Title

    Automatic indexing of lecture speech by extracting topic-independent discourse markers

  • Author

    Kawahara, Tatsuya ; Hasegawa, Masahiro

  • Author_Institution
    School of Informatics, Kyoto University, Sakyo-ku, 606-8501, Japan
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    Automatic detection of section (sub-topic) boundaries in lecture speech is addressed. The method makes use of the characteristic expressions used in initial utterances of sections defined as discourse makers, as well as pause and language model information. The discourse markers are derived in a totally unsupervised manner based on word statistics used in the information retrieval technique. The statistics is used to select candidates picked up by other information. Experimental results show that the proposed method realizes better indexing performance (better precision at high recall rates) than the simple baseline method using pause information only. Moreover, it is shown to be robust against speech recognition errors.
  • Keywords
    Computational modeling; Machine assisted indexing; Manuals; Radio access networks; Soil; Speech; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743639
  • Filename
    5743639