0% found this document useful (0 votes)

70 views5 pages

Dynamic Portfolio Management With Transaction Costs: Alberto Su Arez John Moody, Matthew Saffell

The document presents a reinforcement learning approach for dynamic portfolio management that accounts for transaction costs. The system learns to map indicator time series to portfolio weights to optimize the Sharpe ratio over time. Including transaction costs and shrinking weights toward prior weights produces better diversified portfolios and smoother allocations. Preliminary results on global equity indices show the reinforcement learning approach outperforms a market portfolio and Markowitz portfolio in terms of profit and Sharpe ratio under different transaction cost levels.

Uploaded by

Vaheed Moj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views5 pages

Dynamic Portfolio Management With Transaction Costs: Alberto Su Arez John Moody, Matthew Saffell

Uploaded by

Vaheed Moj

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Dynamic portfolio management with transaction costs

Alberto Su rez a Computer Science Department Universidad Aut noma de Madrid o 28049, Madrid (Spain)
[email protected]

John Moody, Matthew Saffell International Computer Science Institute 1947 Center Street. Suite 600 Berkeley, CA 94704, USA
[email protected],[email protected]

Abstract
We develop a recurrent reinforcement learning (RRL) system that directly induces portfolio management policies from time series of asset prices and indicators, while accounting for transaction costs. The RRL approach learns a direct mapping from indicator series to portfolio weights, bypassing the need to explicitly model the time series of price returns. The resulting policies dynamically optimize the portfolio Sharpe ratio, while incorporating changing conditions and transaction costs. A key problem with many portfolio optimization methods, including Markowitz, is discovering corner solutions with weight concentrated on just a few assets. In a dynamic context, naive portfolio algorithms can exhibit switching behavior, particularly when transaction costs are ignored. In this work, we extend the RRL approach to produce better diversied portfolios and smoother asset allocations over time. The solutions we propose are to include realistic transaction costs and to shrink portfolio weights toward the prior portfolio. The methods are assessed on a global asset allocation problem consisting of the Pacic, North America and Europe MSCI International Equity Indices.

1 Introduction
The selection of optimal portfolios is a central problem of great interest of quantitative nance, one that still dees complete solution. [1, 2, 3, 4, 5, 6, 7, 8, 9]. A drawback of the standard framework formulated by Markowitz [1] is that only one period is used in the evaluation of the portfolio performance. In fact, no dynamics are explicitly considered. Like in many other nancial planning problems, the potential improvements of modifying the portfolio composition should be weighed against the costs of the reallocation of capital, taxes, market impact, and other state-dependent factors. The performance of an investment depends on a sequence of portfolio rebalancing decisions over several periods. This problem has been addressed using different techniques, such as dynamic programming [2, 5] stochastic network programming [3], tabu search [4], reinforcement learning [7] and Monte Carlo methods [8, 9]. A key problem with many portfolio optimization methods, including Markowitz, is nding corner solutions with weight concentrated on just a few assets. In a dynamic context, naive portfolio algorithms can exhibit switching behavior, particularly when transaction costs are ignored. In this work, we address the asset management problem following the proposal of Moody et al [6, 7], and use reinforcement learning to optimize objective functions such as the Sharpe ratio that directly measure the performance of the trading system. A recurrent softmax architecture learns a direct mapping from indicator series to portfolio weights, and the recurrence enables incorporation of transaction costs. The softmax network parameters are optimized via the recurrent reinforcement learning (RRL) algorithm. We extend the RRL approach to produce more evenly diversied portfolios and smoother asset allocations over time. The solutions we propose are to incorporate realistic transaction costs and 1

Figure 1: Architecture for the reinforcement learning system. The system improves on previous proposals by directly considering, when = 0, the current portfolio composition in the determination of the new portfolio weights.

to shrink portfolio weights toward the prior portfolio. The methods are assessed on a global asset allocation problem consisting of the Pacic, North America and Europe MSCI International Equity Indices.

2 Reinforcement learning architecture

We consider the problem of a creating a dynamically managed portfolio by investing in M assets. The composition of the portfolio can be modied at specied times tn = t0 + nT, n = 0, 1, 2, . . ., N 1. The portfolio is evaluated in terms of accumulated prot at time T = NT or of its risk-adjusted performance as measured by the Sharpe ratio. The architecture of the learning system is depicted in Figure 1. The portfolio weights predicted by the policy are a convex combination of Fn , the , prior to rebalancing, and F(S) (w), the output of a softmax network composition of the portfolio at tn n whose inputs are a constant bias term, the information set, In (either lagged information from the time series of asset returns or external economic and nancial indices) and the current portfolio weights Fn
(S) Fn (, w) = Fn + (1 )Fn (w)

(1)

The relative importance of these two terms in the nal output is controlled by a hyperparameter [0, 1]. For = 0, the nal prediction is directly the output of the softmax network. In the absence of transaction costs, a new portfolio can be created at no expense. In this case, the currently held portfolio need not be used as a reference, and = 0 should be used. If transaction costs are nonzero, it is necessary to ensure that the expected return from dynamically managing the investments outweighs the cost of modifying the composition of the portfolio. The costs are deterministic and can be calculated once the new makeup of the portfolio is established. By contrast, the returns expected from the investment are uncertain. If they are overestimated (e.g. when there is overtting) the costs will dominate and the dynamic management strategy seeking to maximize the returns by rebalancing will have a poor performance. A value > 0 causes the composition of the portfolio to vary smoothly, which should lead to improved performance in the presence of transaction costs. The parameters of the RRL system are xed by either directly maximizing the wealth accumulated over the training period or by optimizing an exponentially smoothed Sharpe ratio [6]. The training algorithm is a variant of gradient ascent with learning parameter extended to take into account the recurrent terms in 1. The hyperparameters of the learning system (, , ) can be determined by either holdout validation or cross-validation. 2

Table 1: Performance of the portfolios selected by the different strategies. The values displayed between parentheses are performance measures relative to the market portfolio: The ratio of the prot for the corresponding portfolio to the prot of the market portfolio, and the difference between the Sharpe ratio of the portfolio and the Sharpe ratio of the market portfolio. Cost Prot 0% 1% 2% 3% 5% 0% 1% 2% 3% 5% Market 2.9084 Markowitz 3.1889 ( 1.0965) 2.9094 ( 1.0003) 2.6539 ( 0.9125) 2.4205 ( 0.8322) 2.0125 ( 0.6920) 0.5147 ( 0.0381) 0.4804 ( 0.0037) 0.4457 (-0.0309) 0.4108 (-0.0658) 0.3405 (-0.1362) RRL 3.4507 ( 1.1865) 3.1825 ( 1.0942) 3.1749 ( 1.0916) 2.9176 ( 1.0031) 2.8342 ( 0.9745) 0.5456 ( 0.0689) 0.5110 ( 0.0350) 0.5080 ( 0.0314) 0.4793 ( 0.0027) 0.4682 (-0.0084)

Sharpe ratio

0.4767

3 Preliminary results and ongoing work

The performance of the reinforcement learning system is assessed on real market data and compared to the market portfolio (optimal if the market were ideally efcient) and to the tangency portfolio computed using the Markowitz framework portfolio, which is optimal in terms of the Sharpe ratio, assuming zero transaction costs. The experiments are carried out using the MSCI International Equity Indices (gross level) that measure the performance of different economic regions (indices for the Pacic, North America and Europe) and of the global market (the World index) [10]. A total of 470 values of monthly data starting December 31, 1969 until January 30, 2009 are used. The objective is to learn optimal policies for the dynamic management of a geographically diversied portfolio. In particular, we consider the problem of improving the performance of the World index by investing in some of its constituents, the North America, the Europe and the Pacic indices. As inputs of the softmax network, we employ internal indices that measure the recent performance of each of the assets. The information set at time tn , In , consists of moving averages of the asset returns over the previous 3, 6, 12 and 24 months. Single month returns are not directly employed because they exhibit large uctuations that make it difcult for the reinforcement learning system to distill any useful information from the data. Averages over periods longer than two years are probably not useful because of the changes in the economic conditions of the markets considered. At each point in time the parameters of the model are determined employing data from the recent past. In our investigation, 10 years of data (120 points) are used. The weights of the softmax network are learned by optimizing the objective function (either prot or Sharpe ratio), using 2/3 of the traing set data (80 points). Early stopping occurs during training at a maximum of the performance as measured on an independent validation set containing 1/3 of the data (40 points). To make the policy more robust the process is repeated using 10 different random partitions of the data into training and validation sets. The nal output is the average of the output of these 10 different learning systems. The hyperparameters of the model (the learning rate , the time-scale of the moving average Sharpe ratio , and the contribution of the the current portfolio to the rebalanced portfolio, ) are determined based on the performance of the trained models on a holdout set composed of 100 points. Finally, the performance of the model is measured on the last 220 points of the series. Two performance measures are considered: the accumulated prot and the annualized Sharpe ratio, which is calculated as the quotient of the expected return and the standard deviation of the returns in the period considered. Policies are learned with different values transaction costs 0%, 1%, 2%, 3% and 5%. The performance measures for the different strategies are presented in Table 1. The ratio to market prot (shown between parentheses after the corresponding value of the prot) is greater than one when the wealth accumulated by the strategy considered is larger than the market portfolio. The difference to the Sharpe ratio of the portfolion (shown between parentheses after the corresponding value of the Sharpe ratio) is negative when the strategy underperforms, with respect to the mar3

ket. For zero transaction costs, both a Markowitz portfolio, and the reinforcement learning strategy perform better than the market portfolio. Since the market portfolio is never rebalanced, there is no cost associated to holding the market portfolio even when transaction costs are different from zero (other than the initial investment, no transactions are needed to implement this passive management strategy). In the presence of non-zero transaction costs, the performance of the Markowitz portfolio quickly deteriorates. Only for small transaction costs (1%), and according to the Sharpe ratio is it better than the market portfolio. By contrast, the reinforcement learning strategy improves the results of the market portfolio even when higher transactions costs are considered (up to 3%). However, for sufciently high transaction costs (5%), the market portfolio outperforms the dynamic investment strategies considered.

Market weights

1 EU NA PA

0.5

100

120

140

160

180

200

t (months) F (cost = 0%)

0.5

100

120

140

160

180

200

t (months) F (cost = 1%)

0.5

100

120

140

160

180

200

t (months) F (cost = 3%)

0.5

100

120

140

160

180

200

t (months)

Figure 2: Evolution of portfolio weights for the market portfolio (top) and for the reinforcement learning systems for different transaction costs (0%, 1% and 3% from the top down). The gures show the sensitivity of the RL learners to vary their strategies with the level of transaction costs. From the results obtained, several important observations can be made. As anticipated, the policy learned in the absence of transaction costs involves a large amount of portfolio rebalancing. At a given time, the investment is concentrated in the index that has had the best performance in the recent past. The switching observed for the portfolio weights is clearly undesirable i n real markets, where transaction costs make this type of behavior suboptimal. By contrast, the policies learned by the RRL system when transaction costs are considered to be smoother and require much less rebalancing. Furthermore, the portfolios selected are well-diversied, which is in agreement with nancial good practices. The use of the current portfolio composition as a reference in the reinforcement learning architecture considered in (Fig. 1) is crucial for the identication of robust investment policies in the presence of transaction costs. Current work includes extending the empirical investigation of the learning capabilities and limitations of the RRL system under different conditions. In particular, it is important to analyze its performance in the presence of correlations, autoregressive structure or heterocedasticity in the series of asset prices. Furthermore, the reinforcement learning system is being extensively tested using different nancial data, and its performance compared with alternative investment strategies [11, 12]. Finally, it is also necessary to consider the possibility of investing in a risk-free asset so that strong decreases in prot can be avoided during periods in which all the portfolio constituents lose value. 4

References
[1] Harry Markowitz. Portfolio selection. Journal of Finance, 7(1):7791, 1952. [2] Paul A. Samuelson. Lifetime portfolio selection by dynamic stochastic programming. The Review of Economics and Statistics, 51(3):239246, aug 1969. [3] J. M. Mulvey and H. Vladimirou. Stochastic network programming for nancial planning problems. Management Science, 38:16421664, 1992. [4] F. Glover, J. M. Mulvey, and K. Hoyland. Solving dynamic stochastic control problems in nance using tabu search with variable scaling. In I. H. Osman and J. P. Kelly, editors, MetaHeuristics: Theory and Applications, pages 429448. Kluwer Academic Publishers, 1996. [5] Ralph Neuneier. Optimal asset allocation using adaptive dynamic programming. In David S. Touretzky, Michael C. Mozer, and Michael E. Hasselmo, editors, Advances in Neural Information Processing Systems, volume 8, pages 952958. The MIT Press, 1996. [6] John Moody, Lizhong Wu, Yuansong Liao, and Matthew Saffell. Performance functions and reinforcement learning for trading systems and portfolios. Journal of Forecasting, 17(1):441 470, 1998. [7] John Moody and Matthew Saffell. Learning to trade via direct reinforcement. IEEE Transactions on Neural Networks, 12(4):875889, 2001. [8] J.B. Detemple, R. Garcia, and M. Rindisbacher. A monte carlo method for optimal portfolios. The Journal of Finance, 58(1):401446, 2003. [9] Michael W. Brandt, Amit Goyal, Pedro Santa-Clara, and Jonathan R. Stroud. A simulation approach to dynamic portfolio choice with an application to learning about return predictability. Review of Financial Studies, 18(3):831873, 2005. [10] MSCI Inc. http://www.mscibarra.com/products/indices/equity/. [11] Allan Borodin, Ran El-Yaniv, and Vincent Gogan. Can we learn to beat the best stock. Journal of Articial Intelligence Research, 21:579594, 2004. [12] Amit Agarwal, Elad Hazan, Satyen Kale, and Robert E. Schapire. Algorithms for portfolio management based on the newton method. In Proceedings of the 23rd international conference on Machine learning, ICML 2006, pages 9 16, New York, NY, USA, 2006. ACM.

Part 1 - 1998-99 PCM Connector Pin Out Charts (GM 4.3L, 5.0L, 5
50% (6)
Part 1 - 1998-99 PCM Connector Pin Out Charts (GM 4.3L, 5.0L, 5
6 pages
An Automated FX Trading System Using Adaptive Reinforcement Learning
No ratings yet
An Automated FX Trading System Using Adaptive Reinforcement Learning
10 pages
Deep Reinforcement Learning For Trading: Correspondence To: Zihao Zhang
No ratings yet
Deep Reinforcement Learning For Trading: Correspondence To: Zihao Zhang
16 pages
Portfolio Optimisation Bridging The Gap Between Theory and Practice
No ratings yet
Portfolio Optimisation Bridging The Gap Between Theory and Practice
33 pages
Asset Management: 2. Portfolio Choice (Part II) : Felix Wilke Nova School of Business and Economics
No ratings yet
Asset Management: 2. Portfolio Choice (Part II) : Felix Wilke Nova School of Business and Economics
31 pages
Mean-Variance Portfolio Selection by Continuous-Time
No ratings yet
Mean-Variance Portfolio Selection by Continuous-Time
76 pages
SSRN Id4187217
No ratings yet
SSRN Id4187217
70 pages
PAMRPassive Aggressive Mean Reversion Strategyfor Portfolio Selectio
No ratings yet
PAMRPassive Aggressive Mean Reversion Strategyfor Portfolio Selectio
39 pages
Model-Free Reinforcement Learning For Asset Allocation: Practicum Final Report
No ratings yet
Model-Free Reinforcement Learning For Asset Allocation: Practicum Final Report
69 pages
978-3-030-41068-1 (1) - 366-437
No ratings yet
978-3-030-41068-1 (1) - 366-437
72 pages
Reinforcement-Learning Portfolio Allocation With Dynamic Embedding of Market Information
No ratings yet
Reinforcement-Learning Portfolio Allocation With Dynamic Embedding of Market Information
49 pages
AlphaPortfolio Direct Construction Through Deep Reinforcement Learning and Interpretable AI
No ratings yet
AlphaPortfolio Direct Construction Through Deep Reinforcement Learning and Interpretable AI
70 pages
Portfolio Optimization With Prediction Based Return Using Long Short Term Memory Neural Networks: Testing On Upward and Downward European Markets
No ratings yet
Portfolio Optimization With Prediction Based Return Using Long Short Term Memory Neural Networks: Testing On Upward and Downward European Markets
26 pages
Ensembling Portfolio Strategies For Long-Term Investments: A Distribution-Free Preference Framework For Decision-Making and Algorithms
No ratings yet
Ensembling Portfolio Strategies For Long-Term Investments: A Distribution-Free Preference Framework For Decision-Making and Algorithms
25 pages
17CH10019 BTP Ii Report
No ratings yet
17CH10019 BTP Ii Report
22 pages
1 s2.0 S095070512300655X Main
No ratings yet
1 s2.0 S095070512300655X Main
18 pages
Gunjan-A Brief Review of Portfolio Optimization Techniques
No ratings yet
Gunjan-A Brief Review of Portfolio Optimization Techniques
40 pages
Deep Reinforcement Learning For Portfolio Selecti 2024 Global Finance Journa
No ratings yet
Deep Reinforcement Learning For Portfolio Selecti 2024 Global Finance Journa
15 pages
Mechanical Equipment Selection
No ratings yet
Mechanical Equipment Selection
17 pages
Adaptive Online Portfolio Selection With Transaction Costs
No ratings yet
Adaptive Online Portfolio Selection With Transaction Costs
13 pages
Portfolio Optimisation: Models and Solution Approaches: J.E. Beasley
No ratings yet
Portfolio Optimisation: Models and Solution Approaches: J.E. Beasley
55 pages
Predictable Forward Performance Processes: The Binomial Case
No ratings yet
Predictable Forward Performance Processes: The Binomial Case
23 pages
Portfolio Formation With Preselection Using Deep Lea - 2020 - Expert Systems Wit
No ratings yet
Portfolio Formation With Preselection Using Deep Lea - 2020 - Expert Systems Wit
17 pages
Stock Statement
No ratings yet
Stock Statement
4 pages
Continuous-Time Optimal Investment With Portfolio
No ratings yet
Continuous-Time Optimal Investment With Portfolio
38 pages
A Deep Reinforcement Learning Framework For Dynamic Portfolio Optimization Evidence From China Stock Market
No ratings yet
A Deep Reinforcement Learning Framework For Dynamic Portfolio Optimization Evidence From China Stock Market
27 pages
Combining Reinforcement Learning and Inverse Reinforcement Learning For Asset Allocation Recommendations
No ratings yet
Combining Reinforcement Learning and Inverse Reinforcement Learning For Asset Allocation Recommendations
9 pages
Multi-Period Portfolio Optimization Using A Deep Reinforcement Learning Hyper-Heuristic Approach
No ratings yet
Multi-Period Portfolio Optimization Using A Deep Reinforcement Learning Hyper-Heuristic Approach
11 pages
1 s2.0 S2215098621000070 Main
No ratings yet
1 s2.0 S2215098621000070 Main
12 pages
Algorithms 17 00570 v2
No ratings yet
Algorithms 17 00570 v2
37 pages
Deep Reinforcement Learning
No ratings yet
Deep Reinforcement Learning
16 pages
8601 Quiz - 03087611772
0% (1)
8601 Quiz - 03087611772
55 pages
Regret-Optimized Portfolio Enhancement Through Deep Reinforcement Learning and Future Looking Rewards
No ratings yet
Regret-Optimized Portfolio Enhancement Through Deep Reinforcement Learning and Future Looking Rewards
11 pages
Kimiagari2010 Genetic Algorithms For Fuzzy Multi-Objective Approach To Portfolio Selection
No ratings yet
Kimiagari2010 Genetic Algorithms For Fuzzy Multi-Objective Approach To Portfolio Selection
8 pages
Axioms 09 00130
No ratings yet
Axioms 09 00130
15 pages
Portfolio Optimization in Dynamic Markets Reinforcement Learning For Investment 2024
No ratings yet
Portfolio Optimization in Dynamic Markets Reinforcement Learning For Investment 2024
10 pages
Deep Reinforcement Learning For Optimal Portfolio Allocation JPMorgan
No ratings yet
Deep Reinforcement Learning For Optimal Portfolio Allocation JPMorgan
10 pages
Bridging The Gap Between Markowitz Planning and Drl+-+paper
No ratings yet
Bridging The Gap Between Markowitz Planning and Drl+-+paper
10 pages
Multimodal Deep Reinforcement Learning For
No ratings yet
Multimodal Deep Reinforcement Learning For
24 pages
Algorithm Trading Using Q-Learning and Recurrent Reinforcement Learning PDF
No ratings yet
Algorithm Trading Using Q-Learning and Recurrent Reinforcement Learning PDF
7 pages
Artigo 2
No ratings yet
Artigo 2
11 pages
Can We Learn To Beat The Best Stock
No ratings yet
Can We Learn To Beat The Best Stock
16 pages
(BAO, 2014) A Simulation-Based Portfolio Optimization Approach With Least Squares Learning
No ratings yet
(BAO, 2014) A Simulation-Based Portfolio Optimization Approach With Least Squares Learning
6 pages
Portfolio Selection How To Integrate Complex Constraints
No ratings yet
Portfolio Selection How To Integrate Complex Constraints
14 pages
Portfolio Construction Using Explainable Reinforce
No ratings yet
Portfolio Construction Using Explainable Reinforce
23 pages
Nueral Network Paper
No ratings yet
Nueral Network Paper
17 pages
Newtons Method of Trading
No ratings yet
Newtons Method of Trading
8 pages
Rebalancing An Investment Portfolio in The Presence of Transaction Costs
No ratings yet
Rebalancing An Investment Portfolio in The Presence of Transaction Costs
19 pages
Icbda49040 2020 9101333
No ratings yet
Icbda49040 2020 9101333
8 pages
Final Synop
No ratings yet
Final Synop
8 pages
Deep Reinforcement Learning For Portfolio Management of Markets With A Dynamic Number of Assets
No ratings yet
Deep Reinforcement Learning For Portfolio Management of Markets With A Dynamic Number of Assets
10 pages
Optimal Dynamic Fixedmix Portfolios Based On Reinforcement Learning With Second Order Stochastic Dominance
No ratings yet
Optimal Dynamic Fixedmix Portfolios Based On Reinforcement Learning With Second Order Stochastic Dominance
16 pages
Reinforcement - Learning - For - Financial - Portfolio - Optimization Dynamic Strategies For Risk and Reward Management Nov 2024
No ratings yet
Reinforcement - Learning - For - Financial - Portfolio - Optimization Dynamic Strategies For Risk and Reward Management Nov 2024
8 pages
MAPS: Multi-Agent Reinforcement Learning-Based Portfolio Management System
No ratings yet
MAPS: Multi-Agent Reinforcement Learning-Based Portfolio Management System
7 pages
NIPS 1998 Reinforcement Learning For Trading Paper
No ratings yet
NIPS 1998 Reinforcement Learning For Trading Paper
7 pages
Analysis of New Approaches Used in Portfolio
No ratings yet
Analysis of New Approaches Used in Portfolio
16 pages
Civ Pro Digest
No ratings yet
Civ Pro Digest
125 pages
Doubly Regularized Portfolio With Risk Minimization - AAAI - 14
No ratings yet
Doubly Regularized Portfolio With Risk Minimization - AAAI - 14
7 pages
Risk Aversion Rebalancing
No ratings yet
Risk Aversion Rebalancing
21 pages
Reinforcement Learning For Trading Systems and Portfolios
No ratings yet
Reinforcement Learning For Trading Systems and Portfolios
5 pages
Research On Portfolio Optimization Based On Deep Reinforcement Learning
No ratings yet
Research On Portfolio Optimization Based On Deep Reinforcement Learning
5 pages
SSRN 4780026 1
No ratings yet
SSRN 4780026 1
8 pages
Portfoli o Management: A Project On
No ratings yet
Portfoli o Management: A Project On
48 pages
Dynamic Portfolio Management With Transaction Costs: Alberto - Suarez@uam - Es Moody, Saffell@icsi - Berkeley.edu
No ratings yet
Dynamic Portfolio Management With Transaction Costs: Alberto - Suarez@uam - Es Moody, Saffell@icsi - Berkeley.edu
2 pages
Algorithms For Portfolio Management Based On The Newton Method
No ratings yet
Algorithms For Portfolio Management Based On The Newton Method
8 pages
Practice Problems 3 PDF
No ratings yet
Practice Problems 3 PDF
4 pages
MarketingPlan Nike
No ratings yet
MarketingPlan Nike
32 pages
Dharavi - A City Within A City
No ratings yet
Dharavi - A City Within A City
2 pages
Criminal Law
No ratings yet
Criminal Law
11 pages
Usermanual Em6400.v01
No ratings yet
Usermanual Em6400.v01
81 pages
Memsic 2125 Accel Guide v2.1
No ratings yet
Memsic 2125 Accel Guide v2.1
3 pages
CitrixPorts by Port 1103
No ratings yet
CitrixPorts by Port 1103
23 pages
International Reporting Template: Exploration Results, Mineral Resources and Mineral Reserves
No ratings yet
International Reporting Template: Exploration Results, Mineral Resources and Mineral Reserves
36 pages
Impact of Bonus Issue On Market Price
No ratings yet
Impact of Bonus Issue On Market Price
70 pages
Content Control Interfaces
No ratings yet
Content Control Interfaces
58 pages
Concave Impact
No ratings yet
Concave Impact
30 pages
Testing of Lifting Equipment (Mobile Crane) As Per Legislation by Anshul Agrawal and Sanjay Kumar
No ratings yet
Testing of Lifting Equipment (Mobile Crane) As Per Legislation by Anshul Agrawal and Sanjay Kumar
8 pages
James R. Rosendall Jr.'s Bankruptcy Filing.
No ratings yet
James R. Rosendall Jr.'s Bankruptcy Filing.
50 pages
CH 3 Multithreading
No ratings yet
CH 3 Multithreading
54 pages
LX-Helios User Manual 1 8
No ratings yet
LX-Helios User Manual 1 8
26 pages
Problem - 1739D - Codeforces
No ratings yet
Problem - 1739D - Codeforces
2 pages
Ucl3612 Company Law I Tri 1, 2020/2021 Tutorial Topic 2: Promoters and Pre-Incorporation Contracts
No ratings yet
Ucl3612 Company Law I Tri 1, 2020/2021 Tutorial Topic 2: Promoters and Pre-Incorporation Contracts
7 pages
Research Paper Templates For Elementary Students
No ratings yet
Research Paper Templates For Elementary Students
8 pages
Symbol Resolution and Relocation
No ratings yet
Symbol Resolution and Relocation
14 pages
Possession of Antiquities, Artefacts: The Legal Position: Chennai
No ratings yet
Possession of Antiquities, Artefacts: The Legal Position: Chennai
5 pages
Tutorial 1
No ratings yet
Tutorial 1
8 pages
Oxford Exam Excellence Recording 26
No ratings yet
Oxford Exam Excellence Recording 26
1 page
Cutting of Cement Bags by Manually JSA HSE Professionals
No ratings yet
Cutting of Cement Bags by Manually JSA HSE Professionals
1 page
Swot - Case
No ratings yet
Swot - Case
2 pages
AI Quantitative Methods
From Everand
AI Quantitative Methods
Anand Vemula
No ratings yet

Dynamic Portfolio Management With Transaction Costs: Alberto Su Arez John Moody, Matthew Saffell

Uploaded by

Dynamic Portfolio Management With Transaction Costs: Alberto Su Arez John Moody, Matthew Saffell

Uploaded by

Dynamic portfolio management with transaction costs

2 Reinforcement learning architecture

3 Preliminary results and ongoing work

t (months) F (cost = 0%)

t (months) F (cost = 1%)

t (months) F (cost = 3%)

You might also like