DocumentCode :
2568545
Title :
Dimensionality effects on the Markov property in Shape Memory Alloy hysteretic environment
Author :
Kirkpatrick, Kenton ; Valasek, John
Author_Institution :
Aerosp. Eng. Dept., Texas A&M Univ., College Station, TX, USA
fYear :
2009
fDate :
11-14 Oct. 2009
Firstpage :
2671
Lastpage :
2676
Abstract :
Shape memory alloy actuators can be used for morphing, or shape change, by controlling their temperature, which is effectively done by applying a voltage difference across their length. Control of these actuators requires determination of the relationship between voltage and strain so that an input-output map can be developed. To determine this policy and map the hysteretic region, a reinforcement learning algorithm called Sarsa was used. Proper use of reinforcement learning requires that the learning environment have the Markov property. However, hysteresis spaces are commonly referenced as non-Markovian due to the fact that state history is needed to properly predict future states and rewards. This paper reveals that this formerly non-Markovian learning environment of shape memory alloy hysteresis can become Markovian by means of increasing the dimensionality of the measured states. The paper compares learning attempts in both versions of the environment and will show that reinforcement learning is successful in the modified learning environment by learning a near-optimal policy for controlling the length of a shape memory alloy wire. This is then validated by using the modified reinforcement learning agent to learn a near-optimal control policy in an experimental setting.
Keywords :
Markov processes; actuators; control engineering computing; hysteresis; learning (artificial intelligence); optimal control; shape memory effects; size control; temperature control; Markov property; Sarsa; actuators; dimensionality effects; length control; near-optimal control policy; reinforcement learning algorithm; shape memory alloy hysteresis; temperature control; Actuators; Capacitive sensors; History; Hysteresis; Learning; Shape control; Shape memory alloys; Strain control; Temperature control; Voltage control; Markov Property; Shape Memory Alloy; hysteresis; morphing; reinforcement learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2009. SMC 2009. IEEE International Conference on
Conference_Location :
San Antonio, TX
ISSN :
1062-922X
Print_ISBN :
978-1-4244-2793-2
Electronic_ISBN :
1062-922X
Type :
conf
DOI :
10.1109/ICSMC.2009.5346132
Filename :
5346132
Link To Document :
بازگشت