DocumentCode :
1797977
Title :
Recognizing cross-lingual textual entailment with co-training using similarity and difference views
Author :
Jiang Zhao ; Man Lan ; Zheng-Yu Niu ; Donghong Ji
Author_Institution :
Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
3705
Lastpage :
3712
Abstract :
Cross-lingual textual entailment is a relatively new problem that detects the entailment relationship between two text fragments written in different languages. Previous work adopted machine learning algorithms and similarity measures as features to address this task. In order to overcome the high cost of human annotation and further improve the recognition performance, we present a novel co-training approach to solve this problem. We first use an off-the-shelf machine translation tool to eliminate the language gap between two texts. Then we measure the similarities and differences between two texts and regard them as sufficient and redundant views. We use those two views to conduct the co-training procedure to perform classification. Besides, a new effective Kullback-Leibler (KL) based criterion is proposed to select the results from all possible iterations. Experiments on cross-lingual datasets provided by SemEval 2013 show that our method significantly outperforms the baseline systems and previous work.
Keywords :
language translation; learning (artificial intelligence); text analysis; Kullback-Leibler based criterion; cotraining approach; cross-lingual textual entailment recognition; human annotation; machine learning algorithm; off-the-shelf machine translation tool; Accuracy; Feature extraction; Learning systems; Prediction algorithms; Semantics; Support vector machines; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
Type :
conf
DOI :
10.1109/IJCNN.2014.6889713
Filename :
6889713
Link To Document :
بازگشت