DocumentCode :
1662613
Title :
Semi-supervised Learning Framework for Cross-Lingual Projection
Author :
Hu, PengLong ; Yu, Mo ; Li, Jing ; Zhu, CongHui ; Zhao, Tiejun
Author_Institution :
Machine Intell. & Translation Lab., Harbin Inst. of Technol., Harbin, China
Volume :
3
fYear :
2011
Firstpage :
213
Lastpage :
216
Abstract :
Cross-lingual projection encounters two major challenges, the noise from word-alignment error and the syntactic divergences between two languages. To solve these two problems, a semi-supervised learning framework of cross-lingual projection is proposed to get better annotations using parallel data. Moreover, a projection model is introduced to model the projection process of labeling from the resource-rich language to the resource-scarce language. The projection model, together with the traditional target model of cross-lingual projection, can be seen as two views of parallel data. Utilizing these two views, an extension of co-training algorithm to structured predictions is designed to boost the result of the two models. Experiments show that the proposed cross-lingual projection method improves the accuracy in the task of POS-tagging projection. And using only one-to-one alignments proves to lead to more accurate results than using all kinds of alignment information.
Keywords :
computational linguistics; identification technology; learning (artificial intelligence); resource allocation; POS tagging projection; cotraining algorithm; cross lingual projection; one-to-one alignment; parallel data; resource rich language; resource scarce language; semisupervised learning framework; syntactic divergence; word-alignment error; Accuracy; Data models; Labeling; Natural language processing; Prediction algorithms; Training; Training data; co-training; cross-lingual projection; pos tagging; semi-supervised learning; structured predictions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
Conference_Location :
Lyon
Print_ISBN :
978-1-4577-1373-6
Electronic_ISBN :
978-0-7695-4513-4
Type :
conf
DOI :
10.1109/WI-IAT.2011.58
Filename :
6040843
Link To Document :
بازگشت