DocumentCode
1022099
Title
Adaptive learning mechanisms for ordering actions using random races
Author
Ng, D.T.H. ; Oommen, B.J. ; Hansen, E.R.
Author_Institution
Sch. of Comput. Sci., Carleton Univ., Ottawa, Ont., Canada
Volume
23
Issue
5
fYear
1993
Firstpage
1450
Lastpage
1465
Abstract
Consider a learning machine (LM) interacting with an environment ε. The environment offers the machine M actions. Traditionally, learning systems endeavor to compute the best action that the environment offers, and this is done without any estimation procedure. In this paper, we consider the problem of the LM computing not only the optimal action offered but also the ordering of the actions in terms of their optimality. The problem is posed in its generality and various norms of learning in this setting are formalized. Also various learning strategies are presented that use a new mathematical model called the random race. In this model the learning is modeled using M racers that are running toward a goal. At each instant, racer Ri moves toward the goal with a probability of si and stays where he is with a probability of (1-si). In the simplest learning model, the learning multiple race track (LMRT) model, the racers run on multiple tracks, and in this scenario, each racer has his own track, thus disallowing interference between the racers. However, in a more general setting, the learning single race track (LSRT) model, the racers run on a single track, and in this case, interferences between racers are specified in terms of overtaking rules. In this paper, we first examine the learning multiple race track (LMRT) model, and we have shown that in the absence of a priori information the LMRT is permutationally ε-optimal in all suggestive random environments. Other results are proven or conjectured
Keywords
learning systems; optimisation; action ordering; adaptive learning mechanisms; learning multiple race track model; learning single race track model; overtaking rules; permutationally ε-optimal model; random races; Classification algorithms; Clustering algorithms; Inspection; Learning systems; Measurement standards; Pattern classification; Pattern recognition; Ultrasonic transducers; Ultrasonic variables measurement; Welding;
fLanguage
English
Journal_Title
Systems, Man and Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
0018-9472
Type
jour
DOI
10.1109/21.260677
Filename
260677
Link To Document