DocumentCode :
124141
Title :
A New Sentence Similarity Method Based on a Three-Layer Sentence Representation
Author :
Ferreira, Ricardo ; Dueire Lins, Rafael ; Freitas, Fred ; Avila, Beatriz ; Simske, Steven J. ; Riss, Marcelo
Author_Institution :
Inf. Center, Fed. Univ. of Pernambuco, Recife, Brazil
Volume :
1
fYear :
2014
fDate :
11-14 Aug. 2014
Firstpage :
110
Lastpage :
117
Abstract :
Sentence similarity methods are used to assess the degree of likelihood between phrases. Many natural language applications such as text summarization, information retrieval, text categorization, and machine translation employ measures of sentence similarity. The existing approaches for this problem represent sentences as vectors of bag of words or the syntactic information of the words in the phrase. The likelihood between phrases is calculated by composing the similarity between the words in the sentences. Such schemes do not address two important concerns in the area, however: the semantic problem and the word order. This paper proposes a new sentence similarity measure that attempts to address such problems by taking into account the lexical, syntactic, and semantic analysis of sentences. The new similarity measure proposed outperforms the state of the art systems in around 6%, when tested using a standard and publically available dataset.
Keywords :
natural language processing; text analysis; information retrieval; machine translation; natural language applications; semantic analysis; semantic problem; sentence similarity measure; sentence similarity method; syntactic information; text categorization; text summarization; three-layer sentence representation; word order; Companies; Insurance; Measurement; Resource description framework; Semantics; Syntactics; Vectors; Natural Language Processing; Sentence Representation; Sentence Similarity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014 IEEE/WIC/ACM International Joint Conferences on
Conference_Location :
Warsaw
Type :
conf
DOI :
10.1109/WI-IAT.2014.23
Filename :
6927532
Link To Document :
بازگشت