Title :
Evaluation and classification of syntax information usage in determining short text semantic similarity
Author :
Batanovic, Vuk ; Bojic, Dejan
Author_Institution :
Elektroteh. Fak., Univ. u Beogradu, Belgrade, Serbia
Abstract :
This paper outlines and categorizes ways of using syntax information in a number of algorithms for determining short text semantic similarity. Algorithm performance was evaluated using the results of a paraphrase detection test on the Microsoft Research Paraphrase Corpus. Among the described algorithms and approaches to using syntax information we identify those best suited for application in languages with limited electronic linguistic tools and, with that goal in mind, we propose a new algorithm classification.
Keywords :
computational linguistics; natural language processing; pattern classification; text analysis; Microsoft Research Paraphrase Corpus; algorithm performance evaluation; electronic linguistic tools; paraphrase detection test; short-text semantic similarity determination; syntax information usage classification; syntax information usage evaluation; Coal; Computational linguistics; Electronic mail; Knowledge discovery; Labeling; Semantics; Syntactics;
Conference_Titel :
Telecommunications Forum (TELFOR), 2013 21st
Conference_Location :
Belgrade
Print_ISBN :
978-1-4799-1419-7
DOI :
10.1109/TELFOR.2013.6716356