مرکز منطقه ای اطلاع رساني علوم و فناوري - Stochastic online learning under unknown time-varying models

DocumentCode :

1800786

Title :

Stochastic online learning under unknown time-varying models

Author :

Tehrani, P. ; Qing Zhao

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of California, Davis, Davis, CA, USA

fYear :

2012

fDate :

4-7 Nov. 2012

Firstpage :

1046

Lastpage :

1050

Abstract :

An online learning problem under stochastic time-varying models is considered. The problem is treated as a generalization of the classic multi-armed bandit problem when the arm distributions are time-varying. The objective is to study the impact of time variation in arm distributions on the performance of the player´s strategy. Sufficient conditions on the rate of model variations under which learning can or cannot improve the regret order are established.

Keywords :

game theory; learning (artificial intelligence); stochastic processes; time-varying systems; classic multiarmed bandit problem generalization; player strategy performance; regret order; stochastic online learning problem; stochastic time-varying models; sufficient condition; time-varying arm distributions; unknown time-varying model; Multi-armed bandit; online learning; time-varying models;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signals, Systems and Computers (ASILOMAR), 2012 Conference Record of the Forty Sixth Asilomar Conference on

Conference_Location :

Pacific Grove, CA

ISSN :

1058-6393

Print_ISBN :

978-1-4673-5050-1

Type :

conf

DOI :

10.1109/ACSSC.2012.6489178

Filename :

6489178

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1800786