• DocumentCode
    427030
  • Title

    Active learning for story segmentation of spoken documents

  • Author

    Chen, Shun-Chuan

  • Author_Institution
    Speech Lab., Nat. Taiwan Univ., Taipei
  • Volume
    1
  • fYear
    2004
  • fDate
    30-30 June 2004
  • Firstpage
    607
  • Abstract
    In story segmentation, it is often difficult to gather the segmented data to train a new model for the purpose of supervised learning. Therefore, how to gather the useful data and to reduce the human effort on segmenting stories is an important issue. We apply active learning to selecting the most informative examples to train a supervised model more efficiently. Active learning aims to minimize the number of segmented examples by automatically selecting the examples that are most informative for the story segmenter. By this method, we can decrease the labor effort in boring story segmentation and get the same or better performance in automatic story segmentation. We also consider another problem, namely whether training data in story segmentation can be reusable or not and how the different special structures of the language influence the performance of active learning for story segmentation. The experimental results show that active learning can reduce the number of segmented examples to reach a given level of performance and that reusable training data in story segmentation still contain information that improves the performance. It is a satisfactory result
  • Keywords
    learning (artificial intelligence); linguistics; minimisation; natural languages; speech processing; text analysis; active learning; automatic story segmentation; language structures; minimization; reusable training data; spoken documents; supervised learning; Educational institutions; Hidden Markov models; Humans; Laboratories; Learning systems; NIST; Sampling methods; Speech; Supervised learning; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
  • Conference_Location
    Taipei
  • Print_ISBN
    0-7803-8603-5
  • Type

    conf

  • DOI
    10.1109/ICME.2004.1394265
  • Filename
    1394265