DocumentCode
2665308
Title
Automatic recognition of Chinese scientific and technological terms using integrated linguistic knowledge
Author
Sui, Zhifang ; Yirong Chen ; Wei, Zhouchao
Author_Institution
Inst. of Comput. Linguistics, Peking Univ., Beijing, China
fYear
2003
fDate
26-29 Oct. 2003
Firstpage
444
Lastpage
451
Abstract
We introduce our research on using integrated linguistic knowledge to automatically recognize Chinese scientific and technological terms based on the careful analysis of the characteristics of this kind of terms. The system of automatic term recognition includes two phases: learning stage and application stage. In the stage of learning, we use a series of machine learning methods to get various kinds of integrated knowledge for automatic term recognition from a large-scale corpus and a term bank. These knowledge includes the inner structural knowledge of terms, the statistical domain features of term component, the statistical mutual information between the components of terms, the outer environment features of terms and the distinct text-level features of term recognition etc.. In the stage of application, through an efficient model, we use all these various types of knowledge into automatic term recognition. The experiments show that the system can give great help to the expert of term standardization to discover new terms.
Keywords
character recognition; knowledge based systems; learning (artificial intelligence); linguistics; natural languages; automatic Chinese scientific term recognition; integrated linguistic knowledge; machine learning; term standardization; Biological materials; Bluetooth; Bridges; Business; Character recognition; Computational linguistics; Materials science and technology; Natural languages; Paper technology; Standardization;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location
Beijing, China
Print_ISBN
0-7803-7902-0
Type
conf
DOI
10.1109/NLPKE.2003.1275948
Filename
1275948
Link To Document