DocumentCode :
1505507
Title :
Rademacher penalties and structural risk minimization
Author :
Koltchinskii, Vladimir
Author_Institution :
Dept. of Math. & Stat., New Mexico Univ., Albuquerque, NM, USA
Volume :
47
Issue :
5
fYear :
2001
fDate :
7/1/2001 12:00:00 AM
Firstpage :
1902
Lastpage :
1914
Abstract :
We suggest a penalty function to be used in various problems of structural risk minimization. This penalty is data dependent and is based on the sup-norm of the so-called Rademacher process indexed by the underlying class of functions (sets). The standard complexity penalties, used in learning problems and based on the VC-dimensions of the classes, are conservative upper bounds (in a probabilistic sense, uniformly over the set of all underlying distributions) for the penalty we suggest. Thus, for a particular distribution of training examples, one can expect better performance of learning algorithms with the data-driven Rademacher penalties. We obtain oracle inequalities for the theoretical risk of estimators, obtained by structural minimization of the empirical risk with Rademacher penalties. The inequalities imply some form of optimality of the empirical risk minimizers. We also suggest an iterative approach to structural risk minimization with Rademacher penalties, in which the hierarchy of classes is not given in advance, but is determined in the data-driven iterative process of risk minimization. We prove probabilistic oracle inequalities for the theoretical risk of the estimators based on this approach as well
Keywords :
iterative methods; learning systems; minimisation; parameter estimation; probability; risk management; signal classification; Rademacher penalties; Rademacher process; classifiers; complexity penalties; data dependent penalty; data-driven Rademacher penalties; distribution; empirical risk minimizers; iterative approach; learning algorithms; learning problems; oracle inequalities; penalty function; structural minimization; structural risk minimization; upper bounds; Estimation theory; Extraterrestrial measurements; Iterative algorithms; Iterative methods; Mathematics; Random variables; Risk management; Statistical distributions; Training data; Upper bound;
fLanguage :
English
Journal_Title :
Information Theory, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9448
Type :
jour
DOI :
10.1109/18.930926
Filename :
930926
Link To Document :
بازگشت