DocumentCode
2727322
Title
Knowledge Source Selection by Estimating Distance between Datasets
Author
Yi-Ting Chiang ; Wen-Chieh Fang ; Hsu, Jane Yung-jen
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2012
fDate
16-18 Nov. 2012
Firstpage
44
Lastpage
49
Abstract
Most traditional machine learning methods make an assumption that the distribution of the training dataset is the same as the applied domain. Transfer learning omits this assumption and is able to transfer knowledge between different domains. It is a promising method to make machine learning technology become more practical. However, negative transfer can hurt the performance of the model, therefore, it should be avoided. In this paper, we focus on how to select a good knowledge source when there are multiple labelled datasets available. A method to estimate the divergence between two labelled datasets is given. In addition, we also provide a method to decide the mappings between features in different datasets. The experimental results show that the divergence estimated by our method is highly related to the performance of the model.
Keywords
learning (artificial intelligence); knowledge source selection; machine learning methods; multiple labelled datasets; negative transfer; training dataset; transfer knowledge; transfer learning; Accuracy; Data models; Machine learning; Optimized production technology; Testing; Training; Training data; machine learning; similarity measure; transfer learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Technologies and Applications of Artificial Intelligence (TAAI), 2012 Conference on
Conference_Location
Tainan
Print_ISBN
978-1-4673-4976-5
Type
conf
DOI
10.1109/TAAI.2012.37
Filename
6395004
Link To Document