A New Method of Extracting Chinese Term Based on Open Corpus

Author

Liu Jianzhou ; Shao Xiongkai

Author_Institution

Sch. of Comput. Sci., Hubei Univ. of Technol., Wuhan, China

fYear

2010

fDate

22-23 May 2010

Firstpage

Lastpage

Abstract

Automatic Chinese Term Extraction is an important issue in Natural Language Processing. This paper has proposed a new method to extract terms from open corpus. We have used two improved traditional parameters: mutual information and log-likelihood ratio, and have increased the precision of the method to 75.4%. The results of the research indicate that this method is more efficient and robust than previous term-extraction methods.

Keywords

feature extraction; natural language processing; automatic Chinese term extraction; log-likelihood ratio; mutual information; natural language processing; open corpus; Computer science; Data mining; Filters; Frequency; Maximum likelihood estimation; Mutual information; Natural language processing; Natural languages; Probability; Robustness;

fLanguage

English

Publisher

ieee

Conference_Titel

Intelligent Systems and Applications (ISA), 2010 2nd International Workshop on

Conference_Location

Wuhan

Print_ISBN

978-1-4244-5872-1

Electronic_ISBN

978-1-4244-5874-5

Type

conf

DOI

10.1109/IWISA.2010.5473325

Filename

5473325

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=2479184