• DocumentCode
    3127786
  • Title

    Application of Topic Based Vector Space Model with WordNet

  • Author

    Wibowo, Adi ; Handojo, Andreas ; Halim, Albert

  • Author_Institution
    Inf. Dept., Petra Christian Univ., Surabaya, Indonesia
  • Volume
    1
  • fYear
    2011
  • fDate
    4-7 Aug. 2011
  • Firstpage
    133
  • Lastpage
    136
  • Abstract
    Topic Based Vector Space Model (TVSM) proposed a new vector space that its dimensions is composed of topics. Every term and document is represented by vectors inside this vector space. By using topics as dimensions TVSM tries to overcome word-mismatch between terms with similar topics in finding relevant documents to query. This study proposes to develop relations between terms using WordNet and thesaurus to help TVSM calculating similarity between documents. Relations between terms are represented by relation score. This study proposes a way to find optimal relation score for a set of documents. To help indexing documents with multi language terms this study also proposes to use dictionary to expand query terms.
  • Keywords
    dictionaries; indexing; query processing; text analysis; thesauri; vectors; WordNet; dictonary; document indexing; document querying; multi language terms; thesaurus; topic based vector space model; word-mismatch; Business; Dictionaries; Mathematical model; Search engines; Testing; Thesauri; Weight measurement; Topic based vector space model; dictionary; wordnet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Uncertainty Reasoning and Knowledge Engineering (URKE), 2011 International Conference on
  • Conference_Location
    Bali
  • Print_ISBN
    978-1-4244-9985-4
  • Electronic_ISBN
    978-1-4244-9984-7
  • Type

    conf

  • DOI
    10.1109/URKE.2011.6007864
  • Filename
    6007864