DocumentCode :
2294949
Title :
Restricted-domain Chinese automatic question-answering system based on question sentence similarity
Author :
Zheng-Tao Yu ; Xiao-Zhong Fan ; Peng-Cheng Ji
Author_Institution :
Dept. of Comput. Sci. & Eng., Beijing Inst. of Technol., China
Volume :
5
fYear :
2004
fDate :
26-29 Aug. 2004
Firstpage :
3023
Abstract :
It is an available pattern to implement auto answer in RDAQAS (restricted-domain automatic question-answering system) through calculating the similarity of target question sentences and question sentences in question sentences corpus, and then finding the most similar question sentences, and retrieving the answer finally and all these are based on HowNet and domain ontology. This paper introduces the building of financial domain ontology and question sentences corpus, then proposes the method to calculate similarity of question sentences based on keyword vector space method and semantic concept vector space method. The procedure of realization is described in details. The learning algorithm and learning course of getting question sentence semantic vectors based on the maximum entropy model are also introduced in detail. Finally, the experimental comparing data illustrates that the similarity calculation method based on the semantic concept is more superior to that based on the keyword.
Keywords :
financial data processing; learning (artificial intelligence); maximum entropy methods; ontologies (artificial intelligence); HowNet knowledge database; financial domain ontology; keyword vector space method; learning algorithm; maximum entropy model; question sentence similarity; question sentences corpus; restricted domain Chinese automatic question answering system; semantic concept vector space method; Automation; Computer science; Content based retrieval; Data mining; Databases; Dictionaries; Entropy; Finance; Information retrieval; Ontologies;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2004. Proceedings of 2004 International Conference on
Print_ISBN :
0-7803-8403-2
Type :
conf
DOI :
10.1109/ICMLC.2004.1378551
Filename :
1378551
Link To Document :
بازگشت