DocumentCode
2788496
Title
A new topic-bridged model for transfer learning
Author
Wu, Meng-Sung ; Chien, Jen-Tzung
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
fYear
2010
fDate
14-19 March 2010
Firstpage
5346
Lastpage
5349
Abstract
In real-world information systems, there are abundant unlabeled data but sparse labeled data. It is challenging to construct an adaptive model to classify a large amount of documents containing different domains. The classifiers trained from a source domain shall perform poorly for the test data in a target domain due to the domain mismatch. In this study, we build a topic-bridged latent Dirichlet allocation (TLDA) model from a variety of labeled and unlabeled documents and perform the transfer learning for document classification. The severe change of word distributions is compensated by bridging the latent topics of source and target data which are drawn by the Dirichlet priors. A variational inference procedure is performed for semi-supervised learning. In the experiments on text categorization using 20 Newsgroups dataset, the proposed TLDA model achieved higher classification performance compared to the other methods.
Keywords
inference mechanisms; learning (artificial intelligence); pattern classification; text analysis; variational techniques; word processing; document classification; document classifier; newsgroup dataset; semisupervised learning; sparse labeled data; text categorization; topic bridged latent Dirichlet allocation; transfer learning; variational inference; Bayesian methods; Computer science; Data engineering; Knowledge transfer; Linear discriminant analysis; Predictive models; Semisupervised learning; Signal processing algorithms; Testing; Text categorization; Bayes procedures; pattern classification; text processing; text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5494947
Filename
5494947
Link To Document