• DocumentCode
    2004649
  • Title

    A fast and efficient semantic short text similarity metric

  • Author

    Croft, David ; Coupland, Simon ; Shell, Jethro ; Brown, Shannon

  • Author_Institution
    Knowledge Media Design, De Montfort Univ., Leicester, UK
  • fYear
    2013
  • fDate
    9-11 Sept. 2013
  • Firstpage
    221
  • Lastpage
    227
  • Abstract
    The semantic comparison of short sections of text is an emerging aspect of Natural Language Processing (NLP). In this paper we present a novel Short Text Semantic Similarity (STSS) method, Lightweight Semantic Similarity (LSS), to address the issues that arise with sparse text representation. The proposed approach captures the semantic information contained when comparing text to process the similarity. The methodology combines semantic term similarities with a vector similarity method used within statistical analysis. A modification of the term vectors using synset similarity values addresses issues that are encountered with sparse text. LSS is shown to be comparable to current semantic similarity approaches, LSA and STASIS, whilst having a lower computational footprint.
  • Keywords
    natural language processing; statistical analysis; text analysis; LSA similarity approach; LSS; NLP; STASIS similarity approach; STSS method; lightweight semantic similarity; natural language processing; semantic information; semantic short text similarity; semantic term similarities; sparse text representation; statistical analysis; synset similarity values; term vectors; vector similarity method; Educational institutions; Electronic mail; Measurement; Media; Natural language processing; Semantics; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence (UKCI), 2013 13th UK Workshop on
  • Conference_Location
    Guildford
  • Print_ISBN
    978-1-4799-1566-8
  • Type

    conf

  • DOI
    10.1109/UKCI.2013.6651309
  • Filename
    6651309