Title :
A Study on Automatic Extraction of New Terms
Author :
Zhang, Xing ; Fang, Alex Chengyu
Author_Institution :
Dept. of Chinese, Translation & Linguistics, City Univ. of Hong Kong, Kowloon, China
Abstract :
This research explores to automatically predict new terms based on linguistic features and statistical behaviors of noun phrases during a special period. It integrates both syntactic function value and TF-IDF value into an automatic term extraction system to weight new term candidates. Research questions include: what are the linguistic and statistic properties of new terms during a special period? Will linguistic features contribute to prediction of new terms? And will statistic features like, TFIDF Value contribute to prediction of new terms? Correspondingly, a series of experiments are conducted on medical corpus to examine a group of new terms´ distribution properties and syntactic features across two years in comparison. The results show there does exist significant difference between two groups of values. Regardless of this limitation, this research is meaningful as it attempts to realize automation of selection process of new medical terms, which will greatly avoid subjective decisions and reduce experts´ workloads.
Keywords :
information retrieval; statistical analysis; TF-IDF value; automatic new term extraction system; linguistic feature; noun phrase behavior; statistical behavior; syntactic function value; Abstracts; Analysis of variance; Equations; Pragmatics; Syntactics; Terminology; Testing; New term; SF-Value; TFIDF; old term; term extraction;
Conference_Titel :
Knowledge Acquisition and Modeling (KAM), 2011 Fourth International Symposium on
Conference_Location :
Sanya
Print_ISBN :
978-1-4577-1788-8
DOI :
10.1109/KAM.2011.162