DocumentCode :
1022099
Title :
Adaptive learning mechanisms for ordering actions using random races
Author :
Ng, D.T.H. ; Oommen, B.J. ; Hansen, E.R.
Author_Institution :
Sch. of Comput. Sci., Carleton Univ., Ottawa, Ont., Canada
Volume :
23
Issue :
5
fYear :
1993
Firstpage :
1450
Lastpage :
1465
Abstract :
Consider a learning machine (LM) interacting with an environment ε. The environment offers the machine M actions. Traditionally, learning systems endeavor to compute the best action that the environment offers, and this is done without any estimation procedure. In this paper, we consider the problem of the LM computing not only the optimal action offered but also the ordering of the actions in terms of their optimality. The problem is posed in its generality and various norms of learning in this setting are formalized. Also various learning strategies are presented that use a new mathematical model called the random race. In this model the learning is modeled using M racers that are running toward a goal. At each instant, racer Ri moves toward the goal with a probability of si and stays where he is with a probability of (1-si). In the simplest learning model, the learning multiple race track (LMRT) model, the racers run on multiple tracks, and in this scenario, each racer has his own track, thus disallowing interference between the racers. However, in a more general setting, the learning single race track (LSRT) model, the racers run on a single track, and in this case, interferences between racers are specified in terms of overtaking rules. In this paper, we first examine the learning multiple race track (LMRT) model, and we have shown that in the absence of a priori information the LMRT is permutationally ε-optimal in all suggestive random environments. Other results are proven or conjectured
Keywords :
learning systems; optimisation; action ordering; adaptive learning mechanisms; learning multiple race track model; learning single race track model; overtaking rules; permutationally ε-optimal model; random races; Classification algorithms; Clustering algorithms; Inspection; Learning systems; Measurement standards; Pattern classification; Pattern recognition; Ultrasonic transducers; Ultrasonic variables measurement; Welding;
fLanguage :
English
Journal_Title :
Systems, Man and Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
0018-9472
Type :
jour
DOI :
10.1109/21.260677
Filename :
260677
Link To Document :
بازگشت