• DocumentCode
    247128
  • Title

    Sentence Similarity Based on Semantic Vector Model

  • Author

    Zhao Jingling ; Zhang Huiyun ; Cui Baojiang

  • Author_Institution
    Sch. of Comput., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2014
  • fDate
    8-10 Nov. 2014
  • Firstpage
    499
  • Lastpage
    503
  • Abstract
    Sentence similarity measures play an increasingly important role in text-related research and applications in areas such as text mining, Web page retrieval, and dialogue systems. Existing methods for computing sentence similarity have been adopted from approaches used for long text documents. These methods process sentences in a very high-dimensional space and are consequently inefficient, require human input, and are not adaptable to some application domains. This paper focuses directly on computing the similarity between very short texts of sentence length. It presents an algorithm that takes account of semantic information, structure information and word order information implied in the sentences. The semantic similarity of two sentences is calculated using information from a structured lexical database, How-net. The use of a lexical database enables our method to model human common sense knowledge. The proposed method can be used in a variety of applications that involve text knowledge representation and discovery. Experiments on two sets of selected sentence pairs demonstrate that the proposed method provides a similarity measure that shows higher accuracy than other methods.
  • Keywords
    natural language processing; semantic networks; text analysis; vectors; lexical database; natural language processing; semantic vector model; sentence similarity measure; text document processing; text knowledge representation; Accuracy; Dictionaries; Educational institutions; Joints; Semantics; Speech; Vectors; semantic vector; sentence similarity; word order similarity; word similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    P2P, Parallel, Grid, Cloud and Internet Computing (3PGCIC), 2014 Ninth International Conference on
  • Conference_Location
    Guangdong
  • Type

    conf

  • DOI
    10.1109/3PGCIC.2014.101
  • Filename
    7024635