Title :
Discovering Chinese Concept-In-Corpus
Author :
Chen, Jian-chao ; Zheng, Qi-Lun ; Li, Zhao
Author_Institution :
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou
Abstract :
Concept is the basic of knowledge. A concept consists of a connotation and an extension. The paper comes up with a concept of concept-in-corpus which is a special kind of formal concept, and presents a discovering algorithm called FCWFT (filtering concept-word based on feature-tree) which automatically mine the connotation and the extension for a Chinese concept-in-corpus from corpus in Chinese. Our work is the first one attempting to mine formal concepts from free texts in the area of natural language processing. We test the algorithm with a large scale corpus. The result is encouraging.
Keywords :
natural language processing; Chinese concept-in-corpus; FCWFT; concept-word filtering; discovering algorithm; feature-tree; formal concept; large scale corpus; natural language processing; Computer science; Cybernetics; Data mining; Filtering algorithms; Intelligent systems; Knowledge engineering; Large-scale systems; Machine learning; Natural language processing; Testing; Concept; Concept-IC; Concept-word; Feature-tree; Knowledge; Natural Language Processing;
Conference_Titel :
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2095-7
Electronic_ISBN :
978-1-4244-2096-4
DOI :
10.1109/ICMLC.2008.4620835