DocumentCode :
3104034
Title :
A Chinese Synonyms Reduced Algorithm Based on Sememe Tree
Author :
Liguo, Duan ; Junjie, Chen ; Haifang, Li ; Aiping, Li
Author_Institution :
Coll. of Comput. Sci. & Technol., Taiyuan Univ. of Technol., Taiyuan, China
fYear :
2010
fDate :
26-28 Sept. 2010
Firstpage :
337
Lastpage :
340
Abstract :
Question Understanding of Chinese Question-Answering System generally includes steps such as: word segmentation, POS Tagging, keywords expansion, information retrieval etc. The extended keyword set usually has redundant messages and part of the words and phrases may be not relevant to the question. Consequently, information retrieval with the extended keywords set may bring about large numbers of noise information and enhance the difficulty of answer pick-up. This paper explores the use of distance between vocabularies in the sememe tree for reducing keywords set. It analyzes the detailed steps of question understanding and the improved algorithm. Empirical results support the theoretical findings. The algorithm proposed in the paper achieves substantial improvement by 23% on the average, and wipes off the vocabulary beside the mark. Furthermore, it will improve the accuracy rate of Question Understanding in the subsequent steps.
Keywords :
information retrieval; natural language processing; vocabulary; Chinese question answering system; Chinese synonyms reduced algorithm; POS Tagging; information retrieval; keywords expansion; sememe tree; vocabulary; word segmentation; Accuracy; Dictionaries; Semantics; Syntactics; Tagging; Vocabulary; Chinese Question-Answering System; Question Understanding; reducing keywords set; sememe tree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Aspects of Social Networks (CASoN), 2010 International Conference on
Conference_Location :
Taiyuan
Print_ISBN :
978-1-4244-8785-1
Type :
conf
DOI :
10.1109/CASoN.2010.82
Filename :
5636724
Link To Document :
بازگشت