Title :
Entropy and outcome classification in reinforcement learning control
Author :
Musgrave, Jeffrey L. ; Loparo, Kenneth A.
Author_Institution :
Dept. of Syst. & Control Eng., Case Western Reserve Univ., Cleveland, OH, USA
Abstract :
Several mathematical structures are proposed for learning control systems using a wide array of techniques from a variety of disciplines. Reinforcement learning offers the greatest degree of flexibility in utilizing process information in conjunction with concepts from optical control while maintaining the basic constructs used in mathematical learning theory at the direct control layer. The adaptive layer in the control hierarchy is developed based on two fundamental properties concerning the grouping of outcomes which must be satisfied if a control policy exists for the process in terms of the defined neighborhoods. The original control objective can be interpreted in light of these two objectives and a control policy will be synthesized once these conditions are satisfied by all neighborhoods constructed during the process of learning
Keywords :
adaptive systems; hierarchical systems; learning systems; adaptive layer; control hierarchy; control policy; direct control layer; entropy; learning control systems; mathematical learning theory; mathematical structures; neighborhoods; optical control; original control objective; outcome classification; process information; reinforcement learning control; Adaptive control; Control engineering; Control system synthesis; Control systems; Entropy; Learning; Lighting control; Optimal control; Programmable control; Random processes;
Conference_Titel :
Intelligent Control, 1989. Proceedings., IEEE International Symposium on
Conference_Location :
Albany, NY
Print_ISBN :
0-8186-1987-2
DOI :
10.1109/ISIC.1989.238709