مرکز منطقه ای اطلاع رساني علوم و فناوري - Regret bounds of a distributed saddle point algorithm

DocumentCode :

730455

Title :

Regret bounds of a distributed saddle point algorithm

Author :

Koppel, Alec ; Jakubiec, Felicia Y. ; Ribeiro, Alejandro

Author_Institution :

Dept. of Electr. & Syst. Eng., Univ. of Pennsylvania, Philadelphia, PA, USA

fYear :

2015

fDate :

19-24 April 2015

Firstpage :

2969

Lastpage :

2973

Abstract :

An algorithm to learn optimal actions in distributed convex repeated games is developed. Learning is repeated because cost functions are revealed sequentially and distributed because they are revealed to agents of a network that can exchange information with neighboring nodes only. Learning is measured in terms of the global networked regret, which is the accumulated loss of causal prediction with respect to a centralized clairvoyant agent to which the information of all times and agents is revealed at the initial time. We use a variant of the Arrow-Hurwicz saddle point algorithm which penalizes local agent disagreement via Lagrange multipliers and leads to a distributed online algorithm. We show that decisions made with this saddle point algorithm lead to regret whose order is not larger than O(√T), where T is the total number of rounds of the game. Numerical behavior is illustrated for the particular case of dynamic sensor network estimation across different network sizes, connectivities, and topologies.

Keywords :

learning (artificial intelligence); statistical analysis; Arrow-Hurwicz saddle point algorithm; Lagrange multipliers; causal prediction; centralized clairvoyant agent; distributed convex repeated games; distributed saddle point algorithm; distributed statistical learning; dynamic sensor network estimation; global networked regret; local agent disagreement; neighboring nodes; Cost function; Estimation; Games; Loss measurement; Network topology; Prediction algorithms; Topology; Distributed Statistical Learning; Online Convex Optimization; Sensor Network Estimation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on

Conference_Location :

South Brisbane, QLD

Type :

conf

DOI :

10.1109/ICASSP.2015.7178515

Filename :

7178515

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=730455