Title :
Design and Generalization Analysis of Orthogonal Matching Pursuit Algorithms
Author :
Hussain, Zakria ; Shawe-Taylor, John ; Hardoon, David R. ; Dhanjal, Charanpal
Author_Institution :
Dept. of Comput. Sci ence, Univ. Coll. London, London, UK
Abstract :
We derive generalization error (loss) bounds for orthogonal matching pursuit algorithms, starting with kernel matching pursuit and sparse kernel principal components analysis. We propose (to the best of our knowledge) the first loss bound for kernel matching pursuit using a novel application of sample compression and Vapnik-Chervonenkis bounds. For sparse kernel principal components analysis, we find that it can be bounded using a standard sample compression analysis, as the subspace it constructs is a compression scheme. We demonstrate empirically that this bound is tighter than previous state-of-the-art bounds for principal components analysis, which use global and local Rademacher complexities. From this analysis we propose a novel sparse variant of kernel canonical correlation analysis and bound its generalization performance using the results developed in this paper. We conclude with a general technique for designing matching pursuit algorithms for other learning domains.
Keywords :
data compression; generalisation (artificial intelligence); learning (artificial intelligence); pattern matching; principal component analysis; Vapnik-Chervonenkis bounds; generalization analysis; global Rademacher complexity; kernel canonical correlation analysis; kernel matching pursuit; learning domain; local Rademacher complexity; orthogonal matching pursuit algorithm; sample compression analysis; sparse kernel principal components analysis; Algorithm design and analysis; Complexity theory; Kernel; Matching pursuit algorithms; Principal component analysis; Training; Vectors; Kernel methods; Nyström approximation; matching pursuit; principle components analysis; sample compression bounds; sparse kernel canonical correlation analysis; sparsity;
Journal_Title :
Information Theory, IEEE Transactions on
DOI :
10.1109/TIT.2011.2158880