• DocumentCode
    2540106
  • Title

    A hybrid textual entailment system using lexical and syntactic features

  • Author

    Pakray, Partha ; Bandyopadhyay, Sivaji ; Gelbukh, Alexander

  • Author_Institution
    Comput. Sci. & Eng. Dept., Jadavpur Univ., Kolkata, India
  • fYear
    2010
  • fDate
    7-9 July 2010
  • Firstpage
    291
  • Lastpage
    296
  • Abstract
    A two-way textual entailment (TE) recognition system that uses lexical and syntactic features has been described in this paper. The hybrid TE system is based on the Support Vector Machine that uses twenty three features for lexical similarity and the output tag from a rule based syntactic two-way TE system as another feature. The important lexical features that are used in the present system are: WordNet based unigram match, bigram match, longest common subsequence, skip-gram, stemming, named entity matching and lexical distance. In the syntactic TE system, the important features used are: subject-subject comparison, subject-verb comparison, object-verb comparison and cross subject-verb comparison. The hybrid system has been developed using the collection of RTE-2 test annotated set, RTE-3 development set and RTE-3 test gold set that includes 2400 text-hypothesis pairs. Evaluation scores obtained on the RTE-4 test set (includes 1000 text-hypothesis pairs) show 55.30% precision and 58.40% recall for YES decisions and 55.93% precision and 52.80% recall for NO decisions.
  • Keywords
    natural language processing; support vector machines; text analysis; WordNet based unigram match; bigram match; cross subject-verb comparison; lexical distance; lexical features; lexical similarity; longest common subsequence; named entity matching; object-verb comparison; skip-gram; subject-subject comparison; subject-verb comparison; support vector machine; syntactic features; textual entailment recognition system; Accuracy; Databases; Measurement; NIST; Semantics; Support vector machines; Syntactics; Dependency Parsing; Dependency Relations; Lexical Distance; Textual Entailment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cognitive Informatics (ICCI), 2010 9th IEEE International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-8041-8
  • Type

    conf

  • DOI
    10.1109/COGINF.2010.5599726
  • Filename
    5599726