DocumentCode :
2727322
Title :
Knowledge Source Selection by Estimating Distance between Datasets
Author :
Yi-Ting Chiang ; Wen-Chieh Fang ; Hsu, Jane Yung-jen
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear :
2012
fDate :
16-18 Nov. 2012
Firstpage :
44
Lastpage :
49
Abstract :
Most traditional machine learning methods make an assumption that the distribution of the training dataset is the same as the applied domain. Transfer learning omits this assumption and is able to transfer knowledge between different domains. It is a promising method to make machine learning technology become more practical. However, negative transfer can hurt the performance of the model, therefore, it should be avoided. In this paper, we focus on how to select a good knowledge source when there are multiple labelled datasets available. A method to estimate the divergence between two labelled datasets is given. In addition, we also provide a method to decide the mappings between features in different datasets. The experimental results show that the divergence estimated by our method is highly related to the performance of the model.
Keywords :
learning (artificial intelligence); knowledge source selection; machine learning methods; multiple labelled datasets; negative transfer; training dataset; transfer knowledge; transfer learning; Accuracy; Data models; Machine learning; Optimized production technology; Testing; Training; Training data; machine learning; similarity measure; transfer learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Technologies and Applications of Artificial Intelligence (TAAI), 2012 Conference on
Conference_Location :
Tainan
Print_ISBN :
978-1-4673-4976-5
Type :
conf
DOI :
10.1109/TAAI.2012.37
Filename :
6395004
Link To Document :
بازگشت