Title :
A New Method of Extracting Chinese Term Based on Open Corpus
Author :
Liu Jianzhou ; Shao Xiongkai
Author_Institution :
Sch. of Comput. Sci., Hubei Univ. of Technol., Wuhan, China
Abstract :
Automatic Chinese Term Extraction is an important issue in Natural Language Processing. This paper has proposed a new method to extract terms from open corpus. We have used two improved traditional parameters: mutual information and log-likelihood ratio, and have increased the precision of the method to 75.4%. The results of the research indicate that this method is more efficient and robust than previous term-extraction methods.
Keywords :
feature extraction; natural language processing; automatic Chinese term extraction; log-likelihood ratio; mutual information; natural language processing; open corpus; Computer science; Data mining; Filters; Frequency; Maximum likelihood estimation; Mutual information; Natural language processing; Natural languages; Probability; Robustness;
Conference_Titel :
Intelligent Systems and Applications (ISA), 2010 2nd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5872-1
Electronic_ISBN :
978-1-4244-5874-5
DOI :
10.1109/IWISA.2010.5473325