DocumentCode
2379433
Title
Automatic term extraction from Chinese scientific texts
Author
Zheng, Qinghua ; Luo, Junying ; Liu, Jun
Author_Institution
MOE KLINNS Lab., Xi´´an Jiaotong Univ., Xi´´an, China
fYear
2011
fDate
8-10 June 2011
Firstpage
727
Lastpage
734
Abstract
Automatic term extraction is an essential task in information processing and has a very important role in many fields, such as information retrieval, knowledge acquisition. However, existing methods are mostly proposed for English domain terms, so they can not fully adapt to the term extraction from Chinese scientific texts. This paper presents a/ new approach on the analysis of the characteristics of Chinese domain terms. Firstly, we introduce a new feature which we call it “max article time” to distinguish terms from non-terms. Then, we use the classification of terms and the links between different terms to obtain the maximum discrimination of this feature. Meanwhile, our method also combines with linguistic methods. Experiments conducted on two different domains for Chinese term extraction indicate our approach has significant improvement over existing techniques and also verify the relative domain independence of the approach.
Keywords
information retrieval; text analysis; Chinese scientific texts; English domain terms; automatic term extraction; information processing; linguistic methods; max article time; Data mining; Feature extraction; Mutual information; Noise; Operating systems; Pragmatics; Speech; Chinese domain terminology; automatic term extraction; machine learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Supported Cooperative Work in Design (CSCWD), 2011 15th International Conference on
Conference_Location
Lausanne
Print_ISBN
978-1-4577-0386-7
Type
conf
DOI
10.1109/CSCWD.2011.5960199
Filename
5960199
Link To Document