DocumentCode :
294459
Title :
The nonparametric bandit approach to machine learning
Author :
Yakowitz, Sid ; Lai, Tze-Leung
Author_Institution :
Dept. of Syst. & Ind. Eng., Arizona Univ., Tucson, AZ, USA
Volume :
1
fYear :
1995
fDate :
13-15 Dec 1995
Firstpage :
568
Abstract :
This study is intended to survey results from a different path, best known amongst statisticians as bandit methods. These techniques also have an extensive history, and in recent years, have yielded new understanding. They are applicable to the same classes of problems as neural networks and genetic algorithms. The goal of this paper is to summarize some of the more recent developments with respect to nonparametric bandit theory. In particular, for many problem classes, upper bounds for convergence rates are known, regardless of learning method, and can be achieved by bandit techniques. Moreover, as documented here, there is a comprehensive theory applicable to the case in which noises are dependent. This is important to applications in networking and queues. The theory and case studies are cited in this work should be adequate to introduce the reader to principal developments, and serve as foundation for judgement on the relative merits of this machine-learning approach
Keywords :
decision theory; learning systems; optimisation; probability; time series; convergence rates; genetic algorithms; machine learning; networking; neural networks; nonparametric bandit method; queueing theory; time series; upper bounds; Artificial intelligence; Control systems; Convergence; Genetic algorithms; History; Learning systems; Machine learning; Neural networks; Statistics; Upper bound;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Decision and Control, 1995., Proceedings of the 34th IEEE Conference on
Conference_Location :
New Orleans, LA
ISSN :
0191-2216
Print_ISBN :
0-7803-2685-7
Type :
conf
DOI :
10.1109/CDC.1995.478955
Filename :
478955
Link To Document :
بازگشت