DocumentCode :
1910561
Title :
An Automated Term Definition Extraction using the Web Corpus in Chinese Language
Author :
Leu, Fang-Yie ; Ko, Chih-Chieh
Author_Institution :
Tunghai Univ, Taichung
fYear :
2007
fDate :
Aug. 30 2007-Sept. 1 2007
Firstpage :
435
Lastpage :
440
Abstract :
This paper proposes a system, named DefExplorer, which extracts term definitions from the Web, determines the type of question terms, and selects answers from noisy Web pages automatically. DefExplorer filters out invalid data with a semantic approach. We deployed two types of candidate sets, common and domain specific, to group similar candidates and determine candidates´ importance for selecting final answers. Experimental results show that DefExplorer can effectively extract term definitions from the Web, especially for the definitions of out-of-vocabulary terms.
Keywords :
information retrieval; natural language processing; semantic Web; vocabulary; Chinese language; DefExplorer; Web corpus; automated term definition extraction; noisy Web pages; semantic approach; Assembly; Computer science; Data mining; Dictionaries; Encyclopedias; Filters; Information retrieval; Natural languages; Vocabulary; Web pages; Chinese Language; Definitions; Information Extraction; Web corpus;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2007. NLP-KE 2007. International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-1611-0
Electronic_ISBN :
978-1-4244-1611-0
Type :
conf
DOI :
10.1109/NLPKE.2007.4368067
Filename :
4368067
Link To Document :
بازگشت