DocumentCode :
2209716
Title :
Transfer Learning via Cluster Correspondence Inference
Author :
Long, Mingsheng ; Cheng, Wei ; Jin, Xiaoming ; Wang, Jianmin ; Shen, Dou
Author_Institution :
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear :
2010
fDate :
13-17 Dec. 2010
Firstpage :
917
Lastpage :
922
Abstract :
Transfer learning targets to leverage knowledge from one domain for tasks in a new domain. It finds abundant applications, such as text/sentiment classification. Many previous works are based on cluster analysis, which assume some common clusters shared by both domains. They mainly focus on the one-to-one cluster correspondence to bridge different domains. However, such a correspondence scheme might be too strong for real applications where each cluster in one domain corresponds to many clusters in the other domain. In this paper, we propose a Cluster Correspondence Inference (CCI) method to iteratively infer many-to-many correspondence among clusters from different domains. Specifically, word clusters and document clusters are exploited for each domain using nonnegative matrix factorization, then the word clusters from different domains are corresponded in a many-to-many scheme, with the help of shared word space as a bridge. These two steps are run iteratively and label information is transferred from source domain to target domain through the inferred cluster correspondence. Experiments on various real data sets demonstrate that our method outperforms several state-of-the-art approaches for cross-domain text classification.
Keywords :
inference mechanisms; learning (artificial intelligence); matrix decomposition; pattern classification; pattern clustering; text analysis; cluster analysis; cluster correspondence inference; many-to-many scheme; nonnegative matrix factorization; one-to-one scheme; sentiment classification; text classification; transfer learning; Cluster Correspondence Inference; Text Classification; Transfer Learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2010 IEEE 10th International Conference on
Conference_Location :
Sydney, NSW
ISSN :
1550-4786
Print_ISBN :
978-1-4244-9131-5
Electronic_ISBN :
1550-4786
Type :
conf
DOI :
10.1109/ICDM.2010.146
Filename :
5694061
Link To Document :
بازگشت