Title :
Regret bounds of a distributed saddle point algorithm
Author :
Koppel, Alec ; Jakubiec, Felicia Y. ; Ribeiro, Alejandro
Author_Institution :
Dept. of Electr. & Syst. Eng., Univ. of Pennsylvania, Philadelphia, PA, USA
Abstract :
An algorithm to learn optimal actions in distributed convex repeated games is developed. Learning is repeated because cost functions are revealed sequentially and distributed because they are revealed to agents of a network that can exchange information with neighboring nodes only. Learning is measured in terms of the global networked regret, which is the accumulated loss of causal prediction with respect to a centralized clairvoyant agent to which the information of all times and agents is revealed at the initial time. We use a variant of the Arrow-Hurwicz saddle point algorithm which penalizes local agent disagreement via Lagrange multipliers and leads to a distributed online algorithm. We show that decisions made with this saddle point algorithm lead to regret whose order is not larger than O(√T), where T is the total number of rounds of the game. Numerical behavior is illustrated for the particular case of dynamic sensor network estimation across different network sizes, connectivities, and topologies.
Keywords :
learning (artificial intelligence); statistical analysis; Arrow-Hurwicz saddle point algorithm; Lagrange multipliers; causal prediction; centralized clairvoyant agent; distributed convex repeated games; distributed saddle point algorithm; distributed statistical learning; dynamic sensor network estimation; global networked regret; local agent disagreement; neighboring nodes; Cost function; Estimation; Games; Loss measurement; Network topology; Prediction algorithms; Topology; Distributed Statistical Learning; Online Convex Optimization; Sensor Network Estimation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178515