Title :
Extracting the features of similarity in short texts
Author :
Kisla, Tarik ; Metin, Senem Kumova ; Karaoglan, Bahar
Author_Institution :
Bilgisayar ve Ogretim Teknolojileri Egitimi Bolumu, Ege Univ., İzmir, Turkey
Abstract :
Automatic identification of text similarity has found applications in information retrieval, text summarization, assessment of machine translation, assessment of question answering, word sense disambiguation and many more. In this work, the results of discrimant analysis applied to find out the cumulative effect of the attributes used in the literature so far (ratio of common words, text lentgths, common word sequences, synonyms, hypernyms, hyponyms) in detecting word similarity are reported.
Keywords :
feature extraction; text analysis; attributes cumulative effect; automatic identification; common word sequences; discrimant analysis; features extraction; hypernyms; hyponyms; information retrieval; machine translation assessment; question answering assessment; short texts similarity; synonyms; text lengths; text summarization; word sense disambiguation; Computational linguistics; Databases; Feature extraction; Information retrieval; Knowledge discovery; Presses; Semantics; discrimant analysis; paraphrase corpus; text similarity;
Conference_Titel :
Signal Processing and Communications Applications Conference (SIU), 2015 23th
Conference_Location :
Malatya
DOI :
10.1109/SIU.2015.7130443