• DocumentCode
    2727322
  • Title

    Knowledge Source Selection by Estimating Distance between Datasets

  • Author

    Yi-Ting Chiang ; Wen-Chieh Fang ; Hsu, Jane Yung-jen

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
  • fYear
    2012
  • fDate
    16-18 Nov. 2012
  • Firstpage
    44
  • Lastpage
    49
  • Abstract
    Most traditional machine learning methods make an assumption that the distribution of the training dataset is the same as the applied domain. Transfer learning omits this assumption and is able to transfer knowledge between different domains. It is a promising method to make machine learning technology become more practical. However, negative transfer can hurt the performance of the model, therefore, it should be avoided. In this paper, we focus on how to select a good knowledge source when there are multiple labelled datasets available. A method to estimate the divergence between two labelled datasets is given. In addition, we also provide a method to decide the mappings between features in different datasets. The experimental results show that the divergence estimated by our method is highly related to the performance of the model.
  • Keywords
    learning (artificial intelligence); knowledge source selection; machine learning methods; multiple labelled datasets; negative transfer; training dataset; transfer knowledge; transfer learning; Accuracy; Data models; Machine learning; Optimized production technology; Testing; Training; Training data; machine learning; similarity measure; transfer learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Technologies and Applications of Artificial Intelligence (TAAI), 2012 Conference on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4673-4976-5
  • Type

    conf

  • DOI
    10.1109/TAAI.2012.37
  • Filename
    6395004