مرکز منطقه ای اطلاع رساني علوم و فناوري - Optimal adversarial strategies in learning with expert advice

DocumentCode :

3538708

Title :

Optimal adversarial strategies in learning with expert advice

Author :

Anh Truong ; Kiyavash, Negar

Author_Institution :

Dept. of Ind. & Enterprise Syst. Eng., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA

fYear :

2013

fDate :

10-13 Dec. 2013

Firstpage :

7315

Lastpage :

7320

Abstract :

We propose an adversarial setting for the framework of learning with expert advice in which one of the experts has the intention to compromise the recommendation system by providing wrong recommendations. The problem is formulated as a Markov Decision Process (MDP) and solved by dynamic programming. Somewhat surprisingly, we prove that, in the case of logarithmic loss, the optimal strategy for the malicious expert is the greedy policy of lying at every step. Furthermore, a sufficient condition on the loss function is provided that guarantees the optimality of the greedy policy. Our experimental results, however, show that the condition is not necessary since the greedy policy is also optimal when the square loss is used, even though the square loss does not satisfy the condition. Moreover, the experimental results suggest that, for absolute loss, the optimal policy is a threshold one.

Keywords :

Markov processes; dynamic programming; learning (artificial intelligence); recommender systems; MDP; Markov decision process; adversarial setting; dynamic programming; expert advice; greedy policy; logarithmic loss; optimal adversarial strategies; optimal strategy; recommendation system; Approximation algorithms; Dynamic programming; Educational institutions; Heuristic algorithms; Prediction algorithms; Systems engineering and theory; Vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Decision and Control (CDC), 2013 IEEE 52nd Annual Conference on

Conference_Location :

Firenze

ISSN :

0743-1546

Print_ISBN :

978-1-4673-5714-2

Type :

conf

DOI :

10.1109/CDC.2013.6761050

Filename :

6761050

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3538708