• DocumentCode
    1660990
  • Title

    Auto-labeling Terms Based on Multi-scanning Strategy

  • Author

    Zezhi, Zheng ; Ting, Ao ; Na, Xu ; Bo, Zheng

  • Author_Institution
    Dept. of Chinese Language & Literature, Xiamen Univ., Xiamen, China
  • fYear
    2010
  • Firstpage
    550
  • Lastpage
    554
  • Abstract
    In order to construct the term corpus of physics teaching materials for elementary education, the characters of physics terms were studied, the prediction templates for the unknown terms was built, all kinds of rules for identifying terms was extracted, and the labeling errors of maximum matching algorithm was analyzed, at last, an auto-labeling system was developed. Firstly, this algorithm scans and labels terms which match the rule templates. Secondly, it takes terms in the base glossary as anchor points, and finds out every anchor point with the maximum matching algorithm. Finally scans the context of the anchor point so as to judge whether the candidate strings is a term or not. Together with the prediction and limited function of rules, this method makes full use of the information of terms in base glossary and achieves a higher precision and recall rate. The F-index reaches about 84% in open test.
  • Keywords
    educational technology; physics education; string matching; autolabeling term; elementary education; matching algorithm; multiscanning strategy; physics teaching material; Context; Labeling; Physics; Presses; Terminology; Training; auto-labeling; rule; term component;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Processing (ISIP), 2010 Third International Symposium on
  • Conference_Location
    Qingdao
  • Print_ISBN
    978-1-4244-8627-4
  • Type

    conf

  • DOI
    10.1109/ISIP.2010.106
  • Filename
    5669086