DocumentCode :
3159248
Title :
On the convergence of joint schemes for online computation and supervised learning
Author :
Hao Jiang ; Shanbhag, Uday V.
Author_Institution :
Dept. of Ind. & Enterprise Syst. Eng., Univ. of Illinois, Urbana, IL, USA
fYear :
2012
fDate :
10-13 Dec. 2012
Firstpage :
4462
Lastpage :
4467
Abstract :
Traditionally, the field of deterministic optimization has been devoted to minimization of functions f(x; θ*) whose parameters, denoted by θ*, are known with certainty. Supervised learning theory on the other hand considers the question of employing training data to seek a function from a set of possible functions. Instances of learning algorithms include regression schemes and support vector machines, amongst others. We consider a hybrid problem of computation and learning that arises in online settings, where one may be interested in optimizing f(x; θ*) while learning θ* through a set of observations. More generally, we consider the solution of parameterized monotone variational inequality problems, which can capture a range of convex optimization problems and convex Nash games. The unknown parameter θ* is learned through the noisy observations of a linear function of θ*, denoted by l(x; θ*). This paper provides convergence statements for joint schemes when observations are corrupted by noise in regimes where the associated variational inequality problem may be either strongly monotone or merely monotone. The proposed schemes are shown to produce iterates that converge in mean to their true counterparts. Numerical results derived from the application of these techniques to convex optimization problems and nonlinear Nash-Cournot games is shown to be promising.
Keywords :
convex programming; game theory; learning (artificial intelligence); numerical analysis; regression analysis; support vector machines; convex Nash games; convex optimization problems; deterministic optimization; inequality problems; joint scheme convergence; learning algorithms; linear function; noisy observations; nonlinear Nash-Cournot games; numerical results; online computation; regression schemes; supervised learning theory; support vector machines; Convergence; Convex functions; Games; Joints; Noise; Optimization; Random variables;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control (CDC), 2012 IEEE 51st Annual Conference on
Conference_Location :
Maui, HI
ISSN :
0743-1546
Print_ISBN :
978-1-4673-2065-8
Electronic_ISBN :
0743-1546
Type :
conf
DOI :
10.1109/CDC.2012.6425811
Filename :
6425811
Link To Document :
بازگشت