DocumentCode
1660990
Title
Auto-labeling Terms Based on Multi-scanning Strategy
Author
Zezhi, Zheng ; Ting, Ao ; Na, Xu ; Bo, Zheng
Author_Institution
Dept. of Chinese Language & Literature, Xiamen Univ., Xiamen, China
fYear
2010
Firstpage
550
Lastpage
554
Abstract
In order to construct the term corpus of physics teaching materials for elementary education, the characters of physics terms were studied, the prediction templates for the unknown terms was built, all kinds of rules for identifying terms was extracted, and the labeling errors of maximum matching algorithm was analyzed, at last, an auto-labeling system was developed. Firstly, this algorithm scans and labels terms which match the rule templates. Secondly, it takes terms in the base glossary as anchor points, and finds out every anchor point with the maximum matching algorithm. Finally scans the context of the anchor point so as to judge whether the candidate strings is a term or not. Together with the prediction and limited function of rules, this method makes full use of the information of terms in base glossary and achieves a higher precision and recall rate. The F-index reaches about 84% in open test.
Keywords
educational technology; physics education; string matching; autolabeling term; elementary education; matching algorithm; multiscanning strategy; physics teaching material; Context; Labeling; Physics; Presses; Terminology; Training; auto-labeling; rule; term component;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Processing (ISIP), 2010 Third International Symposium on
Conference_Location
Qingdao
Print_ISBN
978-1-4244-8627-4
Type
conf
DOI
10.1109/ISIP.2010.106
Filename
5669086
Link To Document