DocumentCode
3127786
Title
Application of Topic Based Vector Space Model with WordNet
Author
Wibowo, Adi ; Handojo, Andreas ; Halim, Albert
Author_Institution
Inf. Dept., Petra Christian Univ., Surabaya, Indonesia
Volume
1
fYear
2011
fDate
4-7 Aug. 2011
Firstpage
133
Lastpage
136
Abstract
Topic Based Vector Space Model (TVSM) proposed a new vector space that its dimensions is composed of topics. Every term and document is represented by vectors inside this vector space. By using topics as dimensions TVSM tries to overcome word-mismatch between terms with similar topics in finding relevant documents to query. This study proposes to develop relations between terms using WordNet and thesaurus to help TVSM calculating similarity between documents. Relations between terms are represented by relation score. This study proposes a way to find optimal relation score for a set of documents. To help indexing documents with multi language terms this study also proposes to use dictionary to expand query terms.
Keywords
dictionaries; indexing; query processing; text analysis; thesauri; vectors; WordNet; dictonary; document indexing; document querying; multi language terms; thesaurus; topic based vector space model; word-mismatch; Business; Dictionaries; Mathematical model; Search engines; Testing; Thesauri; Weight measurement; Topic based vector space model; dictionary; wordnet;
fLanguage
English
Publisher
ieee
Conference_Titel
Uncertainty Reasoning and Knowledge Engineering (URKE), 2011 International Conference on
Conference_Location
Bali
Print_ISBN
978-1-4244-9985-4
Electronic_ISBN
978-1-4244-9984-7
Type
conf
DOI
10.1109/URKE.2011.6007864
Filename
6007864
Link To Document