Title :
Comparison of sentence similarity measures for Russian paraphrase identification
Author :
Ekaterina Pronoza;Elena Yagunova
Author_Institution :
Saint-Petersburg State University, Russian Federattion
Abstract :
In this paper we analyze and compare different types of sentence similarity measures applied to the problem of sentential paraphrase identification. We work with Russian, and all the experiments are conducted on the Russian paraphrase corpus we have collected from the news headlines (and are collecting at the moment). Apart from the similarity measures, we also analyze the corpus itself. As a result of the research we disprove the supposition that it is more difficult to distinguish between precise and loose paraphrases than between loose paraphrases and non-paraphrases. We also come up with the recommendations for the application of different similarity measures to identifying paraphrases derived from the news texts.
Keywords :
"Semantics","Measurement","TV"
Conference_Titel :
Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference (AINL-ISMW FRUCT), 2015
DOI :
10.1109/AINL-ISMW-FRUCT.2015.7382973