Title :
Truncated singular value decomposition for semantic-based data retrieval
Author :
Djellali, Choukri
Author_Institution :
Lab. for Res. on Technol. for Ecommerce, UQAM, Montreal, QC, Canada
Abstract :
This paper addresses the increasingly encountered challenge of knowledge indexation. In the past decade, research on numerical schemes on knowledge indexation has been quite intensive. Vector space model is only based on the information contained in term weighting and does therefore not process the semantic contained in the sequence in which the words appear in a bag-of-words. This representation provides an abstraction of semantic relations between different linguistic units. A novel semantic-based method for knowledge indexation, which can provide improvement in both indexing and retrieval, is described. Despite a huge dimension in vector space model size, retrieval accuracies are seen to improve significantly when the proposed system is applied for indexing Reuters-21578 corpus.
Keywords :
information retrieval; semantic networks; singular value decomposition; Reuters-21578 corpus; knowledge indexation; linguistic units; semantic-based data retrieval; truncated singular value decomposition; vector space model; Indexing; Noise; Security; Semantics; Singular value decomposition; Vectors; clustering; indexation; learning; retrieval; semantic analysis; truncated singular value decomposition; variable selection;
Conference_Titel :
Communications and Information Technology (ICCIT), 2013 Third International Conference on
Conference_Location :
Beirut
Print_ISBN :
978-1-4673-5306-9
DOI :
10.1109/ICCITechnology.2013.6579523