Title :
Adapting Component Analysis
Author :
Dorri, F. ; Ghodsi, Ali
Author_Institution :
David R. Cheriton Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON, Canada
Abstract :
A main problem in machine learning is to predict the response variables of a test set given the training data and its corresponding response variables. A predictive model can perform satisfactorily only if the training data is an appropriate representative of the test data. This is usually reflected in the assumption that the training data and the test data are drawn from the same underlying probability distribution. However, the assumption may not be correct in many applications for various reasons. We propose a method based on kernel distribution embedding and Hilbert-Schmidt Independence Criterion (HSIC) to address this problem. The proposed method explores a new representation of the data in a new feature space with two properties: (i) the distributions of the training and the test data sets are as close as possible in the new feature space, and (ii) the important structural information of the data is preserved. The algorithm can reduce the dimensionality of the data while it preserves the aforementioned properties and therefore it can be seen as a dimensionality reduction method as well. Our method has a closed-form solution and the experimental results show that it works well in practice.
Keywords :
data handling; data structures; learning (artificial intelligence); statistical distributions; HSIC; Hilbert-Schmidt independence criterion; component analysis; data dimensionality; data representation; dimensionality reduction; kernel distribution embedding; machine learning; predictive model; probability distribution; response variable; test data; training data; Algorithm design and analysis; Error analysis; Kernel; Predictive models; Probability distribution; Training; Training data; Domain Adaptation; Hilbert- Schmidt independence criterion; Kernel Embedding;
Conference_Titel :
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-4649-8
DOI :
10.1109/ICDM.2012.85