DocumentCode :
1511616
Title :
Learning to trade via direct reinforcement
Author :
Moody, John ; Saffell, Matthew
Author_Institution :
Computational Finance Program, Oregon Graduate Inst. of Sci. & Technol., Beaverton, OR, USA
Volume :
12
Issue :
4
fYear :
2001
fDate :
7/1/2001 12:00:00 AM
Firstpage :
875
Lastpage :
889
Abstract :
We present methods for optimizing portfolios, asset allocations, and trading systems based on direct reinforcement (DR). In this approach, investment decision-making is viewed as a stochastic control problem, and strategies are discovered directly. We present an adaptive algorithm called recurrent reinforcement learning (RRL) for discovering investment policies. The need to build forecasting models is eliminated, and better trading performance is obtained. The direct reinforcement approach differs from dynamic programming and reinforcement algorithms such as TD-learning and Q-learning, which attempt to estimate a value function for the control problem. We find that the RRL direct reinforcement framework enables a simpler problem representation, avoids Bellman´s curse of dimensionality and offers compelling advantages in efficiency. We demonstrate how direct reinforcement can be used to optimize risk-adjusted investment returns (including the differential Sharpe ratio), while accounting for the effects of transaction costs. In extensive simulation work using real financial data, we find that our approach based on RRL produces better trading strategies than systems utilizing Q-learning (a value function method). Real-world applications include an intra-daily currency trader and a monthly asset allocation system for the S&P 500 Stock Index and T-Bills
Keywords :
decision theory; investment; learning (artificial intelligence); optimisation; stochastic systems; stock markets; DR; RRL; S&P 500 Stock Index; T-Bills; asset allocations; differential Sharpe ratio; direct reinforcement learning; financial data; forecasting models; intra-daily currency trader; investment decision-making; investment policies; monthly asset allocation system; portfolio optimization; recurrent reinforcement learning; risk-adjusted investment return optimization; stochastic control problem; trading systems; transaction costs; Adaptive algorithm; Asset management; Decision making; Dynamic programming; Investments; Learning; Optimization methods; Portfolios; Predictive models; Stochastic processes;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/72.935097
Filename :
935097
Link To Document :
بازگشت