DocumentCode
3105361
Title
Automatic Construction of a Core Lexicon for Specific Domain
Author
Ji, Luning ; Lu, Qin ; Li, Wenjie ; Chen, Yirong
fYear
2007
fDate
22-24 Aug. 2007
Firstpage
183
Lastpage
188
Abstract
The rapid development of science and technology in different domains has created many new concepts and the domain lexicon must be updated timely to include the new terms as domain knowledge. However, automatic update of domain knowledge requires a core lexicon for bootstrapping purpose. The core lexicon should contain the fundamental terms used in a domain and from the core lexicon other concepts and terms can be built upon. In this paper we present an algorithm for extracting the core lexicon from some domain specific lexicons. Experiment on a large domain specific lexicon with 139,429 entries shows that only 3,413 terms form the core lexicon with a high precision of 97% and a good coverage.
Keywords
Algorithm design and analysis; Buildings; Data mining; Dictionaries; Information technology; Kernel; Natural language processing; Ontologies; Terminology; Vocabulary; Core LexiconCore TerminologySpecific DomainTerminology Extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Language Processing and Web Information Technology, 2007. ALPIT 2007. Sixth International Conference on
Conference_Location
Luoyang, Henan, China
Print_ISBN
978-0-7695-2930-1
Type
conf
DOI
10.1109/ALPIT.2007.21
Filename
4460637
Link To Document