Title :
Comparing reward-based optimal behaviors in user-adapted Recommender Systems
Author :
Mahmood, Tariq ; Ahmed, Syed Hammad ; Mahmood, Saqib
Author_Institution :
Nat. Univ. of CES - FAST, Karachi, Pakistan
Abstract :
Conversational Recommender Systems (CRSs) are intelligent E-commerce applications that interactively assist online users by following a default recommendation strategy. Typically, the strategy remains hard-coded during the interaction, thus making it impossible for CRSs to adapt to the dynamic user needs. In a previous paper, we have proposed and validated a novel technology that allows CRSs to autonomously improve a default strategy and learn the optimal (best) one. Our approach is based on the Reinforcement Learning (RL) paradigm, and uses a numerical reward model that is pre-selected arbitrarily by the system designers. In this context, it becomes important to determine the effect of using different reward models on the optimal system behavior. In this paper, we investigate the issue of strategy-learning under different reward model selections. We show that, even in a limited (simulated) setting, different and intelligent optimal behaviors are learnt under different rewards, and that our results largely correlate with our hypotheses about the expected optimal system behavior. We also deduce the importance of selecting the reward model carefully, based on the users´ contextual needs.
Keywords :
Internet; electronic commerce; learning (artificial intelligence); recommender systems; CRS; conversational recommender systems; default recommendation strategy; intelligent e-commerce applications; online users; optimal system behavior; reinforcement learning; reward-based optimal behaviors; strategy-learning; user-adapted recommender systems; Cognition; Conversational Recommender Systems; Optimal Recommendation Strategy; Reinforcement Learning; Variable Rewards;
Conference_Titel :
Computer Science and Information Technology (ICCSIT), 2010 3rd IEEE International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-5537-9
DOI :
10.1109/ICCSIT.2010.5564699