DocumentCode :
2139536
Title :
Performance Evaluation of TD-Learning Methods for Bandwidth Provisioning
Author :
Jahanshahi, M. ; Meybodi, M.R.
Author_Institution :
Islamic Azad Univ., Tehran
fYear :
2007
fDate :
16-19 Oct. 2007
Firstpage :
171
Lastpage :
176
Abstract :
Q-learning and SARSA are two methods of TD- learning. Researchers interested in this field proposed the Eligibility concept in order to speed up Q-learning and SARSA. They proved their claim by running the algorithms in a static environment. Authors of this paper have used Q-learning, SARSA and also their eligibility versions for bandwidth provisioning in DiffServ networks that is an absolutely dynamic environment. Performance of these methods in this absolutely dynamic environment is evaluated.
Keywords :
DiffServ networks; performance evaluation; DiffServ network; Q-learning; bandwidth provisioning; dynamic environment evaluation; eligibility concept; performance evaluation; temporal-difference learning; Artificial intelligence; Bandwidth; Computer architecture; Delay; Diffserv networks; Information technology; Learning systems; Stochastic processes; Testing; Videoconference;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Information Technology, 2007. CIT 2007. 7th IEEE International Conference on
Conference_Location :
Aizu-Wakamatsu, Fukushima
Print_ISBN :
978-0-7695-2983-7
Type :
conf
DOI :
10.1109/CIT.2007.131
Filename :
4385076
Link To Document :
بازگشت