Title :
A simple trick for constructing Bayesian formulations of sparse kernel learning methods
Author :
Cawley, Gavin C. ; Talbot, Nicola L C
Author_Institution :
Sch. of Comput. Sci., East Anglia Univ., Norwich, UK
fDate :
31 July-4 Aug. 2005
Abstract :
In this paper, we present a simple mathematical trick that simplifies the derivation of Bayesian treatments of a variety of sparse kernel learning methods. The incomplete Cholesky factorisation due to (Fine and Scheinberg, 2001) is used to transform the dual parameter space, such that the covariance matrix of the Gaussian prior over model parameters becomes the identity matrix. The regularisation term is then the familiar weight-decay regulariser, allowing the Bayesian analysis to proceed straight-forwardly via the methods developed by MacKay (1992). As a bye-product, the incomplete Cholesky factorisation algorithm also identifies a subset of the training data forming an approximate basis for the remaining data in feature space, resulting in a sparse model. Bayesian treatments of the kernel ridge regression algorithm (Saunders et al., 1998), with both constant and input dependent variance structures, arc given as illustrative examples of the proposed technique, which we hope will be more widely applicable.
Keywords :
Bayes methods; covariance matrices; learning (artificial intelligence); regression analysis; Bayesian analysis; Bayesian formulation reconstruction; Cholesky factorisation; covariance matrix; dual parameter space; identity matrix; kernel ridge regression; sparse kernel learning; weight-decay regulariser; Bayesian methods; Computational complexity; Covariance matrix; Kernel; Learning systems; Principal component analysis; Sparse matrices; Statistical analysis; Support vector machines; Training data;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1556084