• DocumentCode
    2101161
  • Title

    A Metric for Paraphrase Detection

  • Author

    Cordeiro, João ; Dias, Gaël ; Brazdil, Pavel

  • Author_Institution
    Centre of Human Language, Univ. of Beira Interior, Covilha
  • fYear
    2007
  • fDate
    4-9 March 2007
  • Firstpage
    7
  • Lastpage
    7
  • Abstract
    Monolingual text-to-text generation is an emerging research area in natural language processing. One reason for the interest in such generation systems is the possibility to automatically learn text-to-text generation strategies from aligned monolingual corpora. In this context, paraphrase detection can be seen as the task of aligning sentences that convey the same information but yet are written in different forms, thereby building a training set of rewriting examples. In this paper, we propose a new metric for unsupervised detection of paraphrases and test it over a set of standard paraphrase corpora. The results are promising as they outperform state-of-the-art measures developed for similar tasks.
  • Keywords
    natural language processing; monolingual corpora; monolingual text-to-text generation; natural language processing; paraphrase detection; unsupervised detection; Artificial intelligence; Bioinformatics; Buildings; Computer science; Data mining; Humans; Laboratories; Natural language processing; Particle measurements; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing in the Global Information Technology, 2007. ICCGI 2007. International Multi-Conference on
  • Conference_Location
    Guadeloupe City
  • Print_ISBN
    0-7695-2798-1
  • Type

    conf

  • DOI
    10.1109/ICCGI.2007.4
  • Filename
    4137062