DocumentCode :
3731338
Title :
Comparison of sentence similarity measures for Russian paraphrase identification
Author :
Ekaterina Pronoza;Elena Yagunova
Author_Institution :
Saint-Petersburg State University, Russian Federattion
fYear :
2015
Firstpage :
74
Lastpage :
82
Abstract :
In this paper we analyze and compare different types of sentence similarity measures applied to the problem of sentential paraphrase identification. We work with Russian, and all the experiments are conducted on the Russian paraphrase corpus we have collected from the news headlines (and are collecting at the moment). Apart from the similarity measures, we also analyze the corpus itself. As a result of the research we disprove the supposition that it is more difficult to distinguish between precise and loose paraphrases than between loose paraphrases and non-paraphrases. We also come up with the recommendations for the application of different similarity measures to identifying paraphrases derived from the news texts.
Keywords :
"Semantics","Measurement","TV"
Publisher :
ieee
Conference_Titel :
Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference (AINL-ISMW FRUCT), 2015
Type :
conf
DOI :
10.1109/AINL-ISMW-FRUCT.2015.7382973
Filename :
7382973
Link To Document :
بازگشت