Title :
Transductive Representation Learning for Cross-Lingual Text Classification
Author :
Yuhong Guo ; Min Xiao
Author_Institution :
Dept. of Comput. & Inf. Sci., Temple Univ., Philadelphia, PA, USA
Abstract :
In cross-lingual text classification problems, it is costly and time-consuming to annotate documents for each individual language. To avoid the expensive re-labeling process, domain adaptation techniques can be applied to adapt a learning system trained in one language domain to another language domain. In this paper we develop a transductive subspace representation learning method to address domain adaptation for cross-lingual text classifications. The proposed approach is formulated as a nonnegative matrix factorization problem and solved using an iterative optimization procedure. Our empirical study on cross-lingual text classification tasks shows the proposed approach consistently outperforms a number of comparison methods.
Keywords :
classification; iterative methods; learning (artificial intelligence); matrix decomposition; optimisation; text analysis; cross-lingual text classification; document annotation; domain adaptation technique; iterative optimization; language domain; learning system; nonnegative matrix factorization problem; relabeling process; transductive subspace representation learning method; Accuracy; Learning systems; Linear programming; Optimization; Standards; Training; Vectors; cross-lingual text classification; domain adaptation; representation learning;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.29