مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning to trade via direct reinforcement

DocumentCode :

1511616

Title :

Learning to trade via direct reinforcement

Author :

Moody, John ; Saffell, Matthew

Author_Institution :

Computational Finance Program, Oregon Graduate Inst. of Sci. & Technol., Beaverton, OR, USA

Volume :

Issue :

fYear :

2001

fDate :

7/1/2001 12:00:00 AM

Firstpage :

875

Lastpage :

889

Abstract :

We present methods for optimizing portfolios, asset allocations, and trading systems based on direct reinforcement (DR). In this approach, investment decision-making is viewed as a stochastic control problem, and strategies are discovered directly. We present an adaptive algorithm called recurrent reinforcement learning (RRL) for discovering investment policies. The need to build forecasting models is eliminated, and better trading performance is obtained. The direct reinforcement approach differs from dynamic programming and reinforcement algorithms such as TD-learning and Q-learning, which attempt to estimate a value function for the control problem. We find that the RRL direct reinforcement framework enables a simpler problem representation, avoids Bellman´s curse of dimensionality and offers compelling advantages in efficiency. We demonstrate how direct reinforcement can be used to optimize risk-adjusted investment returns (including the differential Sharpe ratio), while accounting for the effects of transaction costs. In extensive simulation work using real financial data, we find that our approach based on RRL produces better trading strategies than systems utilizing Q-learning (a value function method). Real-world applications include an intra-daily currency trader and a monthly asset allocation system for the S&P 500 Stock Index and T-Bills

Keywords :

decision theory; investment; learning (artificial intelligence); optimisation; stochastic systems; stock markets; DR; RRL; S&P 500 Stock Index; T-Bills; asset allocations; differential Sharpe ratio; direct reinforcement learning; financial data; forecasting models; intra-daily currency trader; investment decision-making; investment policies; monthly asset allocation system; portfolio optimization; recurrent reinforcement learning; risk-adjusted investment return optimization; stochastic control problem; trading systems; transaction costs; Adaptive algorithm; Asset management; Decision making; Dynamic programming; Investments; Learning; Optimization methods; Portfolios; Predictive models; Stochastic processes;

fLanguage :

English

Journal_Title :

Neural Networks, IEEE Transactions on

Publisher :

ieee

ISSN :

1045-9227

Type :

jour

DOI :

10.1109/72.935097

Filename :

935097

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1511616