DocumentCode :
188637
Title :
Real Text-CS - Corpus Based Domain Independent Content Selection Model
Author :
Perera, Rivindu ; Nand, Pradyumn
Author_Institution :
Sch. of Comput. & Math. Sci., Auckland Univ. of Technol., Auckland, New Zealand
fYear :
2014
fDate :
10-12 Nov. 2014
Firstpage :
599
Lastpage :
606
Abstract :
Content selection is a highly domain dependent task responsible for retrieving relevant information from a knowledge source using a given communicative goal. This paper presents a domain independent content selection model using keywords as communicative goal. We employ DBpedia triple store as our knowledge source and triples are selected based on weights assigned to each triple. The calculation of the weights is carried out through log likelihood distance between a domain corpus and a general reference corpus. The method was evaluated using keywords extracted from QALD dataset and the performance was compared with cross entropy based statistical content selection. The evaluation results showed that the proposed method can perform 32% better than cross entropy based statistical content selection.
Keywords :
content-based retrieval; entropy; natural language processing; semantic Web; statistical analysis; DBpedia triple store; QALD dataset; RealTextCS; communicative goal; corpus based domain independent content selection model; cross entropy based statistical content selection; domain dependent task; information retrieval; keywords extraction; knowledge source; log likelihood distance; natural language generation; natural language processing; semantic Web; Adaptation models; Bridges; Buildings; Computers; Encyclopedias; Frequency-domain analysis; Semantics; Content Selection; Natural Language Generation; Natural Language Processing; Semantic Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2014 IEEE 26th International Conference on
Conference_Location :
Limassol
ISSN :
1082-3409
Type :
conf
DOI :
10.1109/ICTAI.2014.95
Filename :
6984531
Link To Document :
بازگشت