DocumentCode :
2849448
Title :
Optimal strategy for concurrent variable interval reinforcement schedule
Author :
Cheng, Zhenbo ; Liang, Ming ; Deng, Zhidong
Author_Institution :
Inf. & Eng. Coll., Zhejiang Univ. of Technol., Hangzhou, China
fYear :
2010
fDate :
26-28 May 2010
Firstpage :
642
Lastpage :
647
Abstract :
Herrnstein experimentally studied the choice behavior of pigeons on a special reinforcement schedule, the concurrent variable interval (CVI) schedule, and found a famous matching law. The empirical behavior law is remarkably conserved across many kinds of species, but it has been viewed as an irrational behavior, which means that the matching behavior does not maximize reward. In this paper, we succinctly demonstrate that any strategies leading to matching law can obtain maximal rewards for the CVI reinforcement schedule in discrete time steps. In addition, we put forward a novel strategy algorithm that can earn the maximal reward in the CVI reinforcement schedule. Our results reveal that the matching behavior can be seen as a rational behavior in the reinforcement schedule.
Keywords :
behavioural sciences; learning (artificial intelligence); pattern matching; CVI reinforcement schedule; CVI schedule; concurrent variable interval reinforcement schedule; concurrent variable interval schedule; discrete time steps; irrational behavior; matching behavior; matching law; maximal reward; optimal strategy; pigeons; strategy algorithm; Animals; Biological system modeling; Computer science; Educational institutions; Information science; Intelligent systems; Laboratories; Processor scheduling; Scheduling algorithm; Uncertainty; Matching Law; Matching Strategy; Optimal Strategy; Reinforcement Schedule;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control and Decision Conference (CCDC), 2010 Chinese
Conference_Location :
Xuzhou
Print_ISBN :
978-1-4244-5181-4
Electronic_ISBN :
978-1-4244-5182-1
Type :
conf
DOI :
10.1109/CCDC.2010.5498938
Filename :
5498938
Link To Document :
بازگشت