Title :
Towards improving the performance of Vector Space Model for Chinese Frequently Asked Question Answering
Author :
Ridong Jiang;Seokhwam Kim;Rafael E. Banchs; Haizhou Li
Author_Institution :
Human Language Technology Department, Institute for Infocomm Research, Singapore 138632
Abstract :
This paper presents a method which improves the performance of Vector Space Model (VSM) when applying it to Chinese Frequently Asked Questions (FAQ). This method combines unigram and bigram models in determining the similarity of document vectors. The performance is further improved by applying shallow lexical semantics and the document length information. Experiments showed that the proposed methods outperform baselines (segmentation and bigram) across different datasets which include FAQs from restricted domains and open domains.
Keywords :
"Information retrieval","Silicon","TV","Computational modeling"
Conference_Titel :
Asian Language Processing (IALP), 2015 International Conference on
Print_ISBN :
978-1-4673-9595-3
DOI :
10.1109/IALP.2015.7451550