DocumentCode
555756
Title
Recognizing Textual Entailment by Generality Using Informative Asymmetric Measures and Multiword Unit Identification to Summarize Ephemeral Clusters
Author
Dias, Gaël ; Pais, Sebastiao ; Wegrzyn-Wolska, Katarzyna ; Mahl, Robert
Author_Institution
HULTIG, Univ. of Beira Interior, Covilha, Portugal
Volume
1
fYear
2011
fDate
22-27 Aug. 2011
Firstpage
284
Lastpage
287
Abstract
In the context of Ephemeral Clustering of web Pages, it can be interesting to label each cluster with a small summary instead of just a label. Within this scope, we introduce the paradigm of Textual Entailment by Generality, which can be defined as the entailment from a specific web snippet towards a more general web snippet. The subjacent idea is to find the best web snippet, which summarizes and subsumes all the other web snippets within an ephemeral cluster. To reach this objective, we first propose a new informative asymmetric similarity measure called the Simplified Asymmetric InfoSimba (AISs), which can be combined with different asymmetric association measures. In particular, the AISs proposes an unsupervised language-independent solution to infer Textual Entailment by Generality and as such can help to encounter the web snippet with maximum semantic coverage. This new methodology is tested against the first Recognizing Textual Entailment data set (RTE-1)1 for an exhaustive number of asymmetric association measures with and without the identification of Multiword Units. The comparative experiments with existing state-of-the-art methodologies show promising results.
Keywords
Internet; pattern clustering; text analysis; Web pages; Web snippet; asymmetric association measures; ephemeral cluster summarization; ephemeral clustering; informative asymmetric measures; informative asymmetric similarity measure; multiword unit identification; simplified asymmetric InfoSimba; textual entailment by generality recognition; textual entailment data set recognition; unsupervised language-independent solution; Accuracy; Atmospheric measurements; Conferences; Context; Equations; Indexes; Particle measurements; Asymmetric Association Measures; Informative Asymmetric Measure; Multiword Units Identification; Textual Entailment by Generality;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location
Lyon
Print_ISBN
978-1-4577-1373-6
Electronic_ISBN
978-0-7695-4513-4
Type
conf
DOI
10.1109/WI-IAT.2011.122
Filename
6036770
Link To Document