DocumentCode :
2948971
Title :
Evaluation and classification of syntax information usage in determining short text semantic similarity
Author :
Batanovic, Vuk ; Bojic, Dejan
Author_Institution :
Elektroteh. Fak., Univ. u Beogradu, Belgrade, Serbia
fYear :
2013
fDate :
26-28 Nov. 2013
Firstpage :
821
Lastpage :
824
Abstract :
This paper outlines and categorizes ways of using syntax information in a number of algorithms for determining short text semantic similarity. Algorithm performance was evaluated using the results of a paraphrase detection test on the Microsoft Research Paraphrase Corpus. Among the described algorithms and approaches to using syntax information we identify those best suited for application in languages with limited electronic linguistic tools and, with that goal in mind, we propose a new algorithm classification.
Keywords :
computational linguistics; natural language processing; pattern classification; text analysis; Microsoft Research Paraphrase Corpus; algorithm performance evaluation; electronic linguistic tools; paraphrase detection test; short-text semantic similarity determination; syntax information usage classification; syntax information usage evaluation; Coal; Computational linguistics; Electronic mail; Knowledge discovery; Labeling; Semantics; Syntactics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Telecommunications Forum (TELFOR), 2013 21st
Conference_Location :
Belgrade
Print_ISBN :
978-1-4799-1419-7
Type :
conf
DOI :
10.1109/TELFOR.2013.6716356
Filename :
6716356
Link To Document :
بازگشت