Portfolio Probe User Manual
Portfolio Probe User Manual
Portfolio Probe User Manual
2 Copyright 2003-2010 Burns Statistics Limited. All rights reserved. http://www.burns-stat.com/ Edition 2: 2010 May 20
Contents
1 Orientation 1.1 Overview of Functionality 1.2 Necessary Tools . . . . . . 1.3 Loading the Software . . . 1.4 Road Map . . . . . . . . . 1.5 Typography Conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 13 13 14 14 17 19 19 21 24 27 29 29 33 33 35 35 36 37 37 38 38 38 39 40 40 40 41 41 42 43
2 Random Portfolio Examples 2.1 Performance Measurement: Static . . . 2.2 Testing the Eect of Constraint Bounds 2.3 Test a Risk Model . . . . . . . . . . . . 2.4 Performance Measurement: Shadowing . 2.5 Evaluating a Trading Strategy . . . . . 2.6 Going Farther . . . . . . . . . . . . . . . 3 Generating Random Portfolios 3.1 The Command . . . . . . . . . . . . . 3.2 Valuation . . . . . . . . . . . . . . . . Simple Use . . . . . . . . . . . . . . . Collapsing Values . . . . . . . . . . . . Net Asset Value . . . . . . . . . . . . . Random Weights . . . . . . . . . . . . 3.3 Working with Random Portfolios . . . Small Selections . . . . . . . . . . . . . Evaluating Portfolios . . . . . . . . . . Summary . . . . . . . . . . . . . . . . 3.4 Exporting Random Portfolios . . . . . Writing monetary value . . . . . . . . Writing compact les . . . . . . . . . . 3.5 Combining Random Portfolio Objects 3.6 Unsatisable and Dicult Constraints 3.7 Adding a Utility Constraint . . . . . . 3.8 Going Farther . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4 Constraints 45 4.1 Summary of All Constraints . . . . . . . . . . . . . . . . . . . . . 45 Round Lots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45 4.2 Monetary Value of the Portfolio . . . . . . . . . . . . . . . . . . . 45 3
4 Long-only Portfolios . . . . . . . . . . . . . . . . Long-short Portfolios . . . . . . . . . . . . . . . . Limits on Assets . . . . . . . . . . . . . . . . . . max.weight . . . . . . . . . . . . . . . . . . . . . universe.trade . . . . . . . . . . . . . . . . . . . . lower.trade and upper.trade . . . . . . . . . . . . Number of Assets . . . . . . . . . . . . . . . . . . Number of Assets to Trade . . . . . . . . . . . . Number of Assets in the Portfolio . . . . . . . . . Threshold Constraints . . . . . . . . . . . . . . . Trade Thresholds . . . . . . . . . . . . . . . . . . Portfolio Thresholds . . . . . . . . . . . . . . . . Summary of Threshold Inputs . . . . . . . . . . . Forced Trades . . . . . . . . . . . . . . . . . . . . Positions . . . . . . . . . . . . . . . . . . . . . . . Portfolio constraints . . . . . . . . . . . . . . . . Trade constraints . . . . . . . . . . . . . . . . . . Forced constraints . . . . . . . . . . . . . . . . . Universe constraints . . . . . . . . . . . . . . . . Threshold constraints . . . . . . . . . . . . . . . Tolerance . . . . . . . . . . . . . . . . . . . . . . Summary of positions inputs . . . . . . . . . . . Linear Constraints . . . . . . . . . . . . . . . . . Building Constraints . . . . . . . . . . . . . . . . Bounds and lin.style . . . . . . . . . . . . . . . . Numerical Constraints: Risk Factors . . . . . . . Numerical Constraints: Market Capitalization . . Mixing Numerical and Categorical Constraints . Portfolio Constraints versus Trade Constraints . Net Constraints versus Gross Constraints . . . . Long-side Constraints and Short-side Constraints Looking at the Eect of the Constraints . . . . . Evaluating Un-imposed Constraints . . . . . . . Inspecting Linear Constraints . . . . . . . . . . . Count Constraints . . . . . . . . . . . . . . . . . Alpha (Expected Return) Constraints . . . . . . Variance Constraints . . . . . . . . . . . . . . . . Tracking Error (Benchmark) Constraints . . . . . Single Upper Bound . . . . . . . . . . . . . . . . Scaling . . . . . . . . . . . . . . . . . . . . . . . . Lower and Upper Bounds . . . . . . . . . . . . . Multiple Benchmarks . . . . . . . . . . . . . . . . Advanced Use . . . . . . . . . . . . . . . . . . . . Distance . . . . . . . . . . . . . . . . . . . . . . . Alternative prices . . . . . . . . . . . . . . . . . . Multiple distances . . . . . . . . . . . . . . . . . Sums of Largest Weights . . . . . . . . . . . . . . Cost Constraints . . . . . . . . . . . . . . . . . . Number of Positions to Close . . . . . . . . . . . Quadratic Constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
CONTENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 47 49 49 50 50 51 51 52 52 52 53 53 54 54 55 56 56 56 56 57 57 57 58 59 61 62 63 63 63 64 65 66 66 67 69 70 71 71 71 71 72 72 72 73 73 74 75 75 76
4.3
4.4
4.5
4.6 4.7
4.8
4.13
CONTENTS Add Constraints to the Variance . . . . . Impose Constraint Bounds . . . . . . . . . Dummy Run . . . . . . . . . . . . . . . . Check for Benchmark . . . . . . . . . . . Constraints out of Utility . . . . . . . . . Actual Computation . . . . . . . . . . . . 4.18 Constraint Penalties and Soft Constraints 5 Details of Random Portfolio Examples 5.1 Performance Measurement: Static . . . 5.2 Testing the Eect of Constraint Bounds Static Utility . . . . . . . . . . . . . . . Caching Variances . . . . . . . . . . . . Dynamic Utility . . . . . . . . . . . . . 5.3 Test a Risk Model . . . . . . . . . . . . Ex Ante Volatility Comparisons . . . . . Ex Ante versus Realized Volatility . . . 5.4 Performance Measurement: Shadowing . 5.5 Evaluating a Trading Strategy . . . . . 5.6 Going Farther . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6 Optimizing Long-Only Portfolios 6.1 Required Inputs . . . . . . . . . . . . . . . . . . . . . . . Monetary Value . . . . . . . . . . . . . . . . . . . . . . . Utility . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.2 Examples for Passive Portfolios . . . . . . . . . . . . . . Minimize the Variance of the Portfolio . . . . . . . . . . Minimize Tracking Error . . . . . . . . . . . . . . . . . . 6.3 Examples for Active Portfolios . . . . . . . . . . . . . . Maximize the Information Ratio . . . . . . . . . . . . . The Information Ratio with a Tracking Error Constraint Maximize Benchmark-relative Information Ratio . . . . Mean-Variance Optimization . . . . . . . . . . . . . . . Mean-Volatility Optimization . . . . . . . . . . . . . . . Buy-Hold-Sell List . . . . . . . . . . . . . . . . . . . . . 6.4 Utility-free Optimization . . . . . . . . . . . . . . . . . . 6.5 Managing Cash Flow . . . . . . . . . . . . . . . . . . . . Injecting Money into a Portfolio . . . . . . . . . . . . . Extracting Money out of a Portfolio . . . . . . . . . . . 6.6 Asset Allocation . . . . . . . . . . . . . . . . . . . . . . 6.7 Going Farther . . . . . . . . . . . . . . . . . . . . . . . . 7 Optimizing Long-Short Portfolios 7.1 Required Inputs . . . . . . . . . . . . . . . . . . . . . . Monetary Value . . . . . . . . . . . . . . . . . . . . . . Utility . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.2 Examples . . . . . . . . . . . . . . . . . . . . . . . . . Maximize the Information Ratio . . . . . . . . . . . . Maximize Return with a Bound on the Variance . . . Minimize Variance Given a Long List and a Short List . . . . . . .
6 Mean-Variance Optimization . . . . Managing Cash Flow . . . . . . . . . Injecting Money into a Portfolio . . Extracting Money out of a Portfolio Money Constraints . . . . . . . . . . Real-Time Monitoring . . . . . . . . Going Farther . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
CONTENTS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 105 105 105 106 106 108 109 109 109 110 112 112 113 113 114 115 115 115 116 116 117 118 119 120 120 121 121 121 123 124 126 126 127 127 127 127 128 129 129 129
7.3
8 General Use 8.1 Setting Up Data . . . . . . . . . . . . . . Prices and Other Imports . . . . . . . . . Variance Matrix . . . . . . . . . . . . . . Adding a Benchmark to the Variance . . . 8.2 The Random Generation or Optimization 8.3 Post-Optimization . . . . . . . . . . . . . Explore the Trade . . . . . . . . . . . . . Export the Trade . . . . . . . . . . . . . . 9 Trading Costs 9.1 Background . . . . . . . . . . . . . . 9.2 Specifying Costs . . . . . . . . . . . Linear Costs . . . . . . . . . . . . . . Nonlinear Costs . . . . . . . . . . . . Polynomial Costs . . . . . . . . . . . 9.3 Power Laws . . . . . . . . . . . . . . 9.4 On Scaling Costs Relative to Utility 9.5 Costs Due to Taxes . . . . . . . . . . 9.6 Going Farther . . . . . . . . . . . . . 10 Practicalities and Troubleshooting 10.1 Easy Ways to Be Wrong . . . . . . . Data Mangling . . . . . . . . . . . . Input Mangling . . . . . . . . . . . . 10.2 Suppressing Warning Messages . . . 10.3 Cheatsheets . . . . . . . . . . . . . . Implied Ranges . . . . . . . . . . . . Threshold Inputs . . . . . . . . . . . Positions Inputs . . . . . . . . . . . . 10.4 Troubleshooting . . . . . . . . . . . . Utility Problems . . . . . . . . . . . Portfolio Problems . . . . . . . . . . 10.5 S Language Problems and Solutions Creating Matrices . . . . . . . . . . Debugging . . . . . . . . . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
. . . . . . . . . . . . . .
11 Special Instructions 131 11.1 Special Instruction 1: Long-only when shorts exist . . . . . . . . 131 11.2 Special Instruction 2: Benchmark in long-short optimization . . . 131
CONTENTS 12 Adjusting Optimization Speed 12.1 Staying at a Given Solution . 12.2 Reducing Time Use . . . . . . 12.3 The Optimization Process . . 12.4 Improving Quality . . . . . . 12.5 Testing Optimization Quality 13 Utility 13.1 Maximum Information Ratio 13.2 Mean-Variance Utility . . . . 13.3 Mean-Volatility Utility . . . . 13.4 Minimum Variance . . . . . . 13.5 Maximum Expected Return . 13.6 Minimum Distance . . . . . . 13.7 Going Farther . . . . . . . . . and Quality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7 133 133 134 135 136 136 139 139 140 141 141 141 142 142 143 143 144 145 147 148 149 150 150 152 153 154 155 156 161 161 163 163 163 163 164
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
14 Advanced Features 14.1 Multiplicity . . . . . . . . . . . . . . . . 14.2 Alpha and Variance Tables . . . . . . . 14.3 Variance Constraints . . . . . . . . . . . 14.4 Expected Return Constraints . . . . . . 14.5 Multiple Utilities . . . . . . . . . . . . . 14.6 Utility Tables . . . . . . . . . . . . . . . 14.7 Multiplicity Examples . . . . . . . . . . Dual Benchmarks . . . . . . . . . . . . . Benchmark-relative Utility and Absolute Rival Variance Forecasts . . . . . . . . . Multiple Time Periods . . . . . . . . . . Credit Risk . . . . . . . . . . . . . . . . Multiple Scenarios . . . . . . . . . . . . 14.8 Compact Variance Objects . . . . . . . The Variance List . . . . . . . . . . . . . 15 Dregs 15.1 Portfolio Probe Constituents 15.2 The Objectives . . . . . . . . 15.3 Writing C or C++ Code . . . 15.4 Bug Reporting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Variance Constraint . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
CONTENTS
List of Tables
2.1 2.2 4.1 Simple analyses of the MACD funds. . . . . . . . . . . . . . . . . Performance measurement using initial positions. . . . . . . . . . Summary of constraints. . . . . . . . . . . . . . . . . . . . . . . . 20 28 46
10.1 Implied ranges of arguments. . . . . . . . . . . . . . . . . . . . . 127 10.2 The meaning of threshold inputs. . . . . . . . . . . . . . . . . . 127 10.3 The column order for the positions argument. . . . . . . . . . . 128 14.1 Arguments for multiple variances and expected returns. . . . . . 144 14.2 Utilities codes for the utility table. . . . . . . . . . . . . . . . . . 150
10
LIST OF TABLES
List of Figures
1.1 1.2 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 2.10 2.11 2.12 2.13 2.14 2.15 7.1 Some suggested routes through the document. . . . . . . . . . . . Possible route for utility-free optimization. . . . . . . . . . . . . . MACD funds compared to random portfolios: 2007. . . . . . . . MACD funds compared to random portfolios: 2007 and 2008. . . Mean-variance utility for calendar year 2007. . . . . . . . . . . . Approximate minimum volatility. . . . . . . . . . . . . . . . . . . Mean-variance utility from overlapping 60-day windows. . . . . . Shrinkage versus factor model estimates of volatility for 2007 Q4. Shrinkage versus factor model estimates of volatility for 2008 Q4. Quarterly correlations shrinkage versus realized: long-only. . . . . Quarterly correlations shrinkage versus realized: dollar neutral. . Quarterly correlations shrinkage versus realized: 120/20. . . . . . Performance measurement: 200% turnover for 2007. . . . . . . . Performance measurement: 200% turnover for 2007-2008. . . . . MACD ecacy for Q1 of 2008. . . . . . . . . . . . . . . . . . . . MACD ecacy for the rst half of 2008. . . . . . . . . . . . . . . MACD ecacy for the year 2008. . . . . . . . . . . . . . . . . . . 15 16 20 21 22 23 23 25 25 26 26 27 28 28 30 30 31
11
12
LIST OF FIGURES
Chapter 1
Orientation
This chapter has diverse aims: It provides a brief overview of Portfolio Probe functionality. It explains what software you need in order to run Portfolio Probe. It suggests a route through the rest of this document, given a task and a state of mind. It presents the typographic conventions of the document.
1.1
Overview of Functionality
The two primary aims of Portfolio Probe are: To generate random portfolios (the random.portfolio function). To optimize a portfolio (the trade.optimizer function). The two functions have the same inputs (except for saying how many random portfolios you would like, and whether you want random portfolios or random trades). All of the rest of Portfolio Probe is support for these two tasks.
1.2
Necessary Tools
You need to choose a language in which to run Portfolio Probe. It can be one of three: R, which can be downloaded for free via: http://www.r-project.org/ There are some commercial distributions of R as well. S-PLUS, sold by TIBCO:
http://spotre.tibco.com/ 13
14
CHAPTER 1. ORIENTATION C or C++. You can call Portfolio Probe functionality in a program that you write.
The last option of using C is not recommendedit requires considerable eort, and likely has little or no benet. S-PLUS and R are versions of the S language, and Portfolio Probe has been written to work with either version of S. This document assumes you are using S (as opposed to using C code). Portfolio Probe uses only very general features of S so it should run the same in any version of R or S-PLUS. When this document says S, it means either R or S-PLUSthe term S should not be construed to mean only S-PLUS. Some of the examples explicitly assume R is being usedthe same eect would be done slightly dierently in S-PLUS. Programming experience is not mandatorywhatever your objective, there is likely to be an example provided in this manual that is close to your case. The present document assumes knowledge of S to the level of Some Hints for the R Beginnera brief, structured introduction which can be found in the Tutorials section of http://www.burns-stat.com/. Commands beyond that level are included and explained. While it is reasonably easy to start using Portfolio Probe, there is a lot of room to grow. Portfolio Probes exibility and the power of S can carry you a long way.
1.3
If Portfolio Probe was installed in the default place, then Portfolio Probe is loaded into an R session on Windows with: > library(PortfolioProbe) An analogous statement is used under Linux. It is possible that the lib.loc argument to library may be required. In S-PLUS it will depend on the particular installation, but something similar is likely.
1.4
Road Map
We suggest that there is one of two frames of mind that you are likely to have: Conceptual: primarily wanting to understand the task Operational: primarily wanting to do the task With the choice of two tasks, that produces four possible routes through the document. Figure1.1 is a graphical view of the suggestions. Note that not all chapters appear in the maps. A lot of the document can be reserved for reference as the need arises. In particular if you are only generating random portfolios, you can safely ignore several of the chapters. If you are only interested in utility-free optimization, then Figure 1.2 shows a possible route through this document. This supposes that you will impose a turnover constraint rather than providing trading costs for the assets.
15
Conceptual
Chapter 2: Examples
Operational
Chapter 2: Examples Chapter 8: General Use Chapter 3: Generate Random Chapter 4: Constraints Chapter 5: Example Code Chapter 10: Practicalities
Generate Random
Chapter 3: Generate Random Chapter 4: Constraints Chapter 8: General Use Chapter 5: Example Code Chapter 10: Practicalities
Optimize
Chapter 9: Trade Cost Chapter 4: Constraints Chapter 13: Utility Chapter 9: Trade Cost
16
CHAPTER 1. ORIENTATION
Chapter 4: Constraints
17
1.5
Typography Conventions
Computer commands or pieces of commands are written in this font. For example, a variance argument is written as variance whenever it is the argument itself that is referred to. Entire commands are written in the same font. Commands are introduced by > (which is the prompt in the S language) so that any output can be distinguished from the input. An example is: > rep(0, 6) [1] 0 0 0 0 0 0 The user types rep(0, 6) (followed by return), and the next line is the response from S. Commands may span more than a single linethe second and subsequent lines of a command are introduced by +. For example: > op <- trade.optimizer(prices, varian, gross.value=1e6, + long.only=TRUE) The second line of the command starts with long.only (the + is not typed by the user, but rather by S). There is no output in this example. S code note The only catch with multi-line commands is that it needs to be clear to S that the command is incomplete. In this example the command needs a closing parenthesis. Occasionally a fragment of code is written, in which case there are no introductory prompts. In addition to S code notes, there are boxes which contain cautions and notes.
18
CHAPTER 1. ORIENTATION
Chapter 2
2.1
Suppose we have a particular fund that we would like to evaluate. We can generate a number of random portfolios that obey the constraints of the fund. The fund manager might have selected any one of the random portfolios that we generate. We can compare the selection the fund manager did make with the selections of the proverbial monkeys. The static method of performance measurement is to create a set of random portfolios that satisfy the funds constraints at the start of the period, assume that we hold these portfolios throughout the time period, and compare the return of the actual portfolio to the distribution of returns from the random portfolios. The information that we need in order to apply this method is: the constraints under which the fund operates While it would be best if we knew the constraints exactly, approximations of the constraints can be used. For the examples of this section two funds were created by starting with a specic portfolio at the start of 2007 and using the constraints: long-only the number of assets in the portfolio can be no more than 20 the maximum weight for any asset is 10% Every day throughout 2007 and 2008 an optimal trade was made according to an MACD signal (the default from the MACD function in the TTR R package 19
20
CHAPTER 2. RANDOM PORTFOLIO EXAMPLES Figure 2.1: MACD funds compared to random portfolios: 2007.
0.04
Density
0.00
0.01
0.02
0.03
40
60
Table 2.1: Simple analyses of the MACD funds. 200% turnover 25.8% 91 -32.7% 3 400% turnover 28.6% 96 -24.2% 31
[Ulrich, 2009]). Note that this is meant neither to praise nor to pick on MACD it is merely an easy way of producing expected returns that you can replicate yourself if you wish. The dierence between the two funds is that one has a target of 200% annual turnover (buys plus sells) while the other has a 400% turnover target. Figure 2.1 shows the results for 2007. Both funds did reasonably well. The situation changes if we extend the analysis through 2008Figure 2.2 displays those results. Table 2.1 shows the numerical values for the analyses. The percentile shows where the return of the actual fund falls among the returns of the random portfolios. One failing of this analysis is that the random portfolios are not guaranteed to obey the constraints at the end of the period. Even so, the results are likely to be useful, and the eort to create the analysis is minimal. The more elaborate analysis in Section 2.4 overcomes the constraint violation problem as well as being substantially more sensitive. The number of random portfolios that were generated was 1000. This is not enough to make the distribution plots completely smooth, but is more than
2.2. TESTING THE EFFECT OF CONSTRAINT BOUNDS Figure 2.2: MACD funds compared to random portfolios: 2007 and 2008.
0.06
21
Density
0.00
0.02
0.04
40
30
20
10
2.2
Constraints are put on portfolios as a form of insurance. We are willing to give up some up-side in order to be protected from down-side. In general, we are striking that bargain blindlywe dont know the up-side we are forgoing, and we dont know the extent of the protection we are getting. Random portfolios allow us to examine the trade-o over historical periods. For the examples of this section, the constraints that are imposed are: long-only no more than 30 assets no asset with a weight greater than 10% and some times volatility no more than some value Figure 2.3 shows the distribution of mean-variance utility with risk aversion 2 for portfolios that are held throughout 2007. The blue line indicates the distribution for portfolios that have the additional constraint that the volatility is no more than 8% (as measured ex ante at the beginning of the year). We see that we lose a lot of up-side with the volatility constraint. But this is merely a snapshotit tells us what happened to have occurred, but it doesnt indicate what we can expect to occur.
22
CHAPTER 2. RANDOM PORTFOLIO EXAMPLES Figure 2.3: Mean-variance utility for calendar year 2007.
3.0
Density
0.0
0.5
1.0
1.5
2.0
2.5
0.5
0.0
0.5
1.0
1.5
2.0
Suppose that we are interested in utility over a quarter (or, for convenience, 60 trading days). We can move that 60-day window one day at a time and get dierent answers. We have two years of data (2007 and 2008) in our example, so we have roughly 440 windows. There are at least two approaches. One is to form the portfolios with their constraints at the start of 2007 and never change them. That is easy to do, but probably doesnt answer the question that we really have in mind. The second approach forms random portfolios at the start of each window and uses these to calculate the utility that is realized for the window. Often the constraints will be static throughout, even though the portfolios change with each window. In this case, imposing a static volatility constraint is a problem we start by looking at what the 8% volatility constraint looks like in the broader context. Figure 2.4 shows the minimum (ex ante) volatility that is achievable throughout the time period we are using. Clearly we cant always impose a constraint that volatility is no more than 8% since the minimum possible is greater than 8% for most of the period. The 8% constraint on the rst day is approximately 120% of the minimum volatility. So well use 120% of the minimum volatility as the upper limit on volatility when it is constrained. Figure 2.5 compares the rolling distributions of utility with and without the volatility constraint. We see bimodal distributions. There is what might be considered a normal mode and a disaster mode. In normal mode, the volatility constraint reduces the up-side about the same amount as the down-side. In disaster mode, the volatility constraint is very useful. So this looks quite good for the volatility constraint. Note, though, that this is quite a short time period from which to make generalizations.
23
Q1
Q2
Q3 2007
Q4
Q1 2008
Q2
Q3
10
24
2.3
We have a wide choice of risk models. If we could compare results of risk models on a large number of realistic portfolios, we could make informed decisions about the best risk models for particular tasks. Random portfolios can provide those realistic portfolios in a convenient form. This section merely provides a brief taster of what is possible. The BurStFin R package contains two functions for estimating variance matrices (see page 110). One estimates a statistical factor model, the other shrinks the sample variance towards the equal correlation matrix. Each function was given 250 days of returns and used default values of all arguments. Since there are only 70 assets, this is a rather atypical case for these estimators. Often there are more assets than time points. Even so, the amount of shrinkage towards equal correlation ranged from about 17% to about 32%. One thousand random portfolios were generated at the start of each of eight quarters. The constraints were: long-only no more than 30 assets in the portfolio 5% maximum weight for each asset the sum of the 4 largest weights is no more than 15% Throughout this section the last three constraints are maintained, but the rst is sometimes changed. Figure 2.6 compares the ex ante estimates of volatility from the two models with data up to the start of the fourth quarter of 2007. The estimates agree very well. Figure 2.7 is a similar plot, but for the fourth quarter of 2008. We would like to see high correlations between the estimated and realized volatilities for the portfolios. Figure 2.8 shows the correlation for each quarter between the shrinkage estimate of volatility and the realized volatility. The red dashed lines represent 95% condence intervals for the correlation as found by a statistical bootstrap. The correlation between the factor model and realized is very similar. As we need to expect, the predicted level of volatility in the last quarter of 2008 is severely lower than the realized volatilitythe actual volatilities tended to be more than twice the predicted. Two more sets of random portfolios were generated. One set was long-short dollar neutral (zero net value). The other was 120/20 long-short. In both cases the same constraints of 30 assets, 5% maximum weight and 15% sum of the 4 largest weights were imposed. Figure 2.9 shows the quarterly correlations between the ex ante shrinkage estimate and the realized volatility for the dollar neutral portfolios. Figure 2.10 shows the correlations for the 120/20 portfolios. Random portfolios give you the tools to select the best variance estimators for the types of portfolios that you hold, and allow you to see how good that is.
25
Figure 2.6: Shrinkage versus factor model estimates of volatility for 2007 Q4.
12.5
13.0
13.5
14.0
14.5
15.0
15.5
Figure 2.7: Shrinkage versus factor model estimates of volatility for 2008 Q4.
30 Shrink to equal correlation 22 24 26 28
22
24
26
28
30
26
07Q2
07Q3
07Q4
08Q1
08Q2
08Q3
08Q4
07Q2
07Q3
07Q4
08Q1
08Q2
08Q3
08Q4
27
0.55 07Q1
0.65
0.75
07Q2
07Q3
07Q4
08Q1
08Q2
08Q3
08Q4
2.4
Section 2.1 presents the static method of performance measurement. While that method is more powerful than peer groups and benchmarks, we can often do even better. If we know some or all of the positions of the portfolio at the beginning of the test period, random portfolios can give us a much more sensitive test of performance. The information that we need for this method is: the constraints under which the fund operates some or all of the positions at the start of the time period the known or approximate turnover of the fund throughout the period With this method we imitate the trading of the fund throughout the period. We get one random portfolio by means of a series of random trades starting with what we know about the initial portfolio. We then repeat that a number of times. Lets revisit the example we saw in Section 2.1. We know the full portfolio at the start of the period, and we know the typical daily trading. We use this knowledge to get the distribution of returns with zero skill. Figure 2.11 shows the results for the year 2007 with 200% turnover, while Figure 2.12 shows the 2007-2008 results for 200% turnover. The strategy is very clearly outperforming chance. The results are even more extreme for 400% turnover. Table 2.2 shows the numerical results.
28
10
15
20
25
38
36
34
32
Table 2.2: Performance measurement using initial positions. 200% turnover 25.8% 100 -32.7% 99 400% turnover 28.6% 100 -24.2% 100
29
2.5
It is quite common to perform a backtest of a strategy. However, the backtest itself only shows how much the strategy makes or loses in the given situation. It doesnt provide any information on whether it is luck or skill. Random portfolios can be used to help decide that. The previous section discussed performance measurement using the shadowing method. We can do the same thing to test a trading strategy. One key dierence is that we are doing it ex ante rather than ex post. Plus, we know the constraints and the trading exactly. Another dierence is that with performance measurement we are interested in the particular portfolio that was actually held at the start of the period, but with evaluating a strategy we want the performance to be good no matter where we start. Operationally the real dierence is that when evaluating a trading strategy we create a set of random portfolios at which to start and we see how the strategy does during the time period starting at each of those portfolios. That is, we repeat the shadowing process a number of times. In the example we create 20 initial portfolios at the beginning of 2008, and see how the MACD strategy performs after 1 quarter, 2 quarters and the whole year. We have the constraints: long-only no more than 20 names in the portfolio no asset weight greater than 10% 200% turnover target Figure 2.13 shows how good MACD was for these constraints during the rst quarter of 2008. It has close to zero value on averagethat is, it is neither good nor bad. However, for a specic starting portfolio it tends to either signicantly outperform or signicantly underperform. If the strategy truly had no eect, then the percentiles would fall roughly along the diagonal line. That the percentiles tend to be zero or one hundred is an indication of the high power of the technique. (Each strategy is compared to only 100 random runs, but adding more random runs would at most move the percentiles slightly.) Figure 2.14 shows how the strategy did for the rst half of the year. It did very wellthe strategy outperformed all random runs for each of the starting portfolios. The picture for the whole year (Figure 2.15) is almost at the other extreme. For the whole year the strategy signicantly underperformed with almost all of the starting points.
2.6
Going Farther
Chapter 3 discusses the function that generates random portfolios. Chapter 4 discusses the constraints that may be imposed. Chapter 5 shows the commands that created the examples that were displayed in this chapter.
30
20
40
60
80
100
Theoretical quantiles
20
40
60
80
100
Theoretical quantiles
31
20
40
60
80
100
Theoretical quantiles
32
Chapter 3
3.1
The Command
To generate random portfolios you give random.portfolio the number of random portfolios that you want to generate, the basic information for the problem, and the constraints that you would like. There is also the out.trade argument which controls whether it is the random portfolio (the default) or the trade which is output. At a minimum you need to specify the vector of prices and the amount of money in the portfolio. One possibility is:
> randport1 <- random.portfolio(prices=prices, long.only=TRUE, + gross.value=1e6) Of course this is not very interesting. You are likely to want more than one random portfolio, and to have non-trivial constraints imposed. S code note The notation 1e6 means 1,000,000, that is, a one followed by six zeros. The prices (always required) needs to be a vector of positive numbers that has names which identify the assets in the problem. Here is an example of the rst few values of a suitable price vector: > head(pricevec) stockA stockB stockC stockD stockE stockF 27.63 19.46 11.67 5.79 5.15 20.99 33
34
The assets named in prices dene the universe of assets for the problem. In examples the prices argument is usually given a vector called pricesin reality the name of the vector can be whatever you like. The other two pieces of basic information are the variance matrix and the vector of expected returnsneither of these are required. You only need to give these when there is a pertinent constraint. To generate 100 random portfolios that have country and sector constraints, no more than a 4% tracking error and no more than 55 assets, the following command would do: > randport2 <- random.portfolio(100, prices, varian, + long.only=TRUE, gross.value=1e6, + bench.constraint = c(spx=.04^2/252), + port.size=55, lin.constraints=cntrysect.constraint, + lin.bounds=cntrysect.bounds) S code note The rst three arguments in the call that creates randport2 do not need to have the name of the argument specied because they are all in the order of the arguments in the denition of the function. In contrast the call that creates randport1 uses the argument name in all cases. If there is any doubt, then it is safest to give the argument by name. The examples so far assume that there is no existing portfolio (or that it doesnt matter). The existing argument gives the current portfolio. > randport3 <- random.portfolio(100, prices, varian, + long.only=TRUE, gross.value=1e6, + bench.constraint = c(spx=.04^2/252), + existing=current.portfolio, + port.size=55, lin.constraint=cntrysect.constraint, + lin.bounds=cntrysect.bounds) Sometimes it is more convenient to have the trades rather than the portfolios. If you want the trades, just set the out.trade argument to TRUE: > randtrade3 <- random.portfolio(100, prices, varian, + long.only=TRUE, gross.value=1e6, + bench.constraint = c(spx=.04^2/252), + existing=current.portfolio, + port.size=55, lin.constraint=cntrysect.constraint, + lin.bounds=cntrysect.bounds, out.trade=TRUE) S code note The full name of the out.trade argument must be given. This is unlike almost all other arguments where only enough of the rst portion of the name needs to be given to make it unique among the arguments to the function. (For
3.2. VALUATION
35
a full explanation of argument matching in S, see [Burns, 1998] page 19 or [Burns, 2009].) If the existing argument is not given or is NULL, then it doesnt matter which value out.trade hasthe output is the same in either case. The result of a call to random.portfolio is a list where each component of the list is a portfolio (or trade). The object has a number of attributes including a class attribute ("randportBurSt"). Here is a small example: > random.portfolio(2, priceten, gross.value=1e5, + long.only=TRUE, port.size=3, max.weight=.5) [[1]] stockA stockB stockJ 1337 1481 6484 [[2]] stockB stockF stockH 1274 2382 3100 attr(,"call") random.portfolio(number.rand = 2, prices = priceten, gross.value = 1e+05,long.only = TRUE, port.size = 3, max.weight = 0.5) attr(,"timestamp") [1] "Thu Oct 29 11:36:03 2009" attr(,"class") [1] "randportBurSt" seed attribute begins: 1 -1142929704 1716596987 -285978235 Each component of the list is a portfolio (or trade), which is a vector giving the number of asset units (shares, lots, contracts) for each asset that appears.
3.2
Valuation
Random portfolios are seldom interesting in their own right, generally it is necessary to manipulate them to get the information we want. This section and the next discuss several common tasks.
Simple Use
The valuation method for random portfolios can give a variety of results relating to the monetary content of the random portfolios. The simplest usage is to give a vector of prices in addition to the random portfolio object: > head( valuation(ranport, pricevec) [[1]] stockB stockC stockJ 38.92 50461.08 -49500.00 [[2]] stockB stockI stockJ -28625.66 50497.42 -20871.84 , 2)
36
What we get is the amount of money in each position in each portfolio. Any vector of prices (that contains the assets in the random portfolio object) can be used. It can be the price vector used to create the random portfolios, or some other price vector. A matrix of prices can be given as well. The columns should correspond to assets: > head( [[1]] 20091012 20091013 20091014 [[2]] valuation(ranport, pricemat) , 2) stockB stockC stockJ 38.92 50461.08 -49500 38.34 52882.52 -46875 36.20 53574.36 -48750
stockB stockI stockJ 20091012 -28625.66 50497.42 -20871.84 20091013 -28199.07 50358.50 -19765.00 20091014 -26625.10 52650.68 -20555.60 The price matrix we used looks like: > pricemat[, 1:4] stockA stockB stockC stockD 20091012 27.63 19.46 11.67 5.79 20091013 29.47 19.17 12.23 5.85 20091014 27.86 18.10 12.39 5.43
Collapsing Values
Often we dont care about the individual positions, but we do care about the gross value of the portfolios. Use the collapse argument to get this: > valuation(ranport, pricevec, collapse=TRUE) [1] 100000.00 99994.92 99992.25 99993.29 This has length 4 because that is the number of portfolios in ranport. Perhaps the most useful is to use collapse and a matrix of prices: > valuation(ranport, pricemat, collapse=TRUE) [,1] [,2] [,3] [,4] 20091012 100000.00 99994.92 99992.25 99993.29 20091013 99795.86 98322.57 101966.21 106510.98 20091014 102360.56 99831.38 104858.49 107995.59 The result is a matrix with as many rows as the price matrix and as many columns as there are portfolios in the random portfolio object. It is possible to get the net value rather than the gross: > valuation(ranport, pricemat, collapse=TRUE, type="net") [,1] [,2] [,3] [,4]
3.2. VALUATION 20091012 1000.00 999.92 999.93 999.99 20091013 6045.86 2394.43 3872.21 -2285.54 20091014 4860.56 5469.98 2364.57 -1377.59 The type argument is only used when collapse is TRUE.
37
> valuation(randport.dollar.neutral, pricemat, collapse=TRUE, + type=nav) # no leverage > valuation(randport.dollar.neutral, pricemat, collapse=TRUE, + type=nav, cash = some.value) > valuation(randport.120.20, pricemat, collapse=TRUE, + type=nav, cash = 0)
Random Weights
If you want to have the random portfolios expressed in terms of weights rather than asset units, then you can use the valuation function with the weight argument set to TRUE. For example:
stockB stockC stockJ 20091012 0.0003892000 0.5046108 -0.4950000 20091013 0.0003841843 0.5299070 -0.4697089 20091014 0.0003536518 0.5233887 -0.4762577 [[2]] stockB stockI stockJ 20091012 -0.2862711 0.5049999 -0.2087290 20091013 -0.2868016 0.5121764 -0.2010220 20091014 -0.2667007 0.5273961 -0.2059032 This works whether prices is just a vector or a matrix.
38
3.3
Small Selections
You can use head to get the rst few random portfolios, and tail to get the last few. These are generic functions in R. Their random portfolio methods return an object that retains the class and other attributes of the original object. These functions can be useful to inspect the portfolios to see if they look reasonable without printing hundreds or thousands of portfolios to the screen. They can also be used to test commands, such as the example immediately below.
Evaluating Portfolios
The sister function to random.portfolio is trade.optimizer. It can be of interest to see some of the values that the optimizer would return for each of the random portfolios. The randport.eval function does that: for each of the random portfolios (or trades) in the object it nds what the optimizer says about it. You can select which components of the output of trade.optimizer to keep (using the keep argument). The result is a list as long as the random portfolio object and each component of that list is a list containing the kept components. Here is a small example of keeping the portfolio variances: > randport.eval(head(randport4, 3), keep=var.values) [[1]] [[1]]$var.values [1] 382.3576 [[2]] [[2]]$var.values [1] 147.6476 [[3]] [[3]]$var.values [1] 134.6368 In this case where we are returning only one number per portfolio, it makes more sense to coerce this to a numeric vector: > unlist(randport.eval(head(randport4, 3), keep=var.values), + use.names=FALSE) [1] 382.3576 147.6476 134.6368 Keep in mind that these values are ex ante predictionsthey may or may not have much relation to realized variance. note In randport.eval the optimizer is called using the same names of objects as was used when the random portfolio object was originally created. Objects with these names must be visible at the time that randport.eval is used. If any of
39
these objects has changed, then it is the current value rather than the original value that is used. caution Additional arguments or changes to arguments may be given to randport.eval so that what the optimizer does is not exactly what random.portfolio did. If you are making a change to an argument, then you need to use the exact same abbreviation (if any) as in the original call to random.portfolio. There is a FUN argument to randport.eval that, if given, applies that function to each of the portfolio objects that are created. For example, we could do: > randport.eval(head(randport4, 3), FUN=summary) Or perhaps a more useful command along the same lines: > do.call("rbind", randport.eval(randport4, + FUN=function(x) summary(x)$number.of.assets)) S code note The command: > do.call("rbind", some.list) is equivalent to the command: > rbind(some.list[[1]], some.list[[2]], ..., + some.list[[length(some.list)]])
Summary
The summary method for random portfolios shows how many assets are in the portfolios, and the number of times each asset appears in a portfolio: > summary(randport5) $port.size 7 8 9 10 1 32 165 802 $count.assets stockC stockD stockA stockB stockE stockF stockI 1000 1000 987 975 973 972 972 stockG stockH stockJ 968 961 960 This shows us that out of the 1000 portfolios, 802 contained all 10 assets, 165 had 9 assets, 32 had 8 assets and 1 had 7 assets. We also see that stockC and stockD were both in all of the portfolios while stockJ was only in 960 of them.
40
3.4
The deport function will write les containing the result of random.portfolio. The simplest use is: > deport(randport2) [1] "randport2.csv" This writes a comma-separated le where the columns each correspond to one of the assets that appear in the object and the rows correspond to the portfolios or trades. There are arguments that allow you to switch the meaning of rows and columns, and to give a universe of assets (which must include all of those appearing in the object). See the help le for details.
41
3.5
You may want to combine some random portfolio objects. Suppose you have objects named rp1, rp2 and rp3 resulting from calls to random.portfolio. You would like these to be in one object as they all have the same constraints (or perhaps they have slightly dierent constraints but you want them all in the same analysis). The c function will put them all together: > rp.all <- c(rp1, rp2, rp3) But not all is well: > deport(rp.all) Error in deport(rp.all) : no applicable method for "deport" > summary(rp.all) Length Class Mode [1,] 45 -none- numeric [2,] 45 -none- numeric [3,] 45 -none- numeric [4,] 45 -none- numeric ... Even though rp.all is basically correct, it doesnt have the class that the other objects have. Without the class, generic functions like summary, deport and valuation dont work as expected. > class(rp.all) <- class(rp1) > deport(rp.all) [1] "rp.all.csv" Once the class is put on the object, we can operate as usual. Almost. If you want to use randport.eval, then you need the call attribute as well. In that case, you could give the big object all of the attributes of one of the original objects: > attributes(rp.all) <- attributes(rp1)
3.6
Not all sets of constraints can be achieved. Obviously there are no portfolios that satisfy a variance that is smaller than zero (or even smaller than the minimum variance given the other constraints). If you set random.portfolio such a task, it is bound to fail. There is a trade-o between returning quickly when asked the impossible and being successful when asked the merely dicult. There is, of course, a default value for this trade-o, but you can adjust it for specic circumstances. There are a number of control arguments that say how hard to work. In terms of impossible constraints, the most important is init.fail. This says how many separate attempts to make before quitting when there have been no portfolios successfully found.
42
For each attempt there is a trio of arguments that control how hard to work within the attempt. iterations.max gives the maximum number of iterations before stopping. fail.iter is the maximum number of consecutive iterations allowed that fail to make progress. miniter gives the minimum number of iterations allowed even if fail.iter says to quit. (Of course if a portfolio is found that satises all the constraints, then the attempt is declared successful and stops no matter what the value of miniter.) The remaining argument of this ilk is gen.fail. Lets start with the problem that this argument solves. Suppose you have set a dicult problem for random.portfolio and you want 1000 portfolios. Suppose further that the rst attempt was successful (so clearly the problem is not impossible) but the next one million attempts fail. As far as you are concerned you are waiting forever. gen.fail times the number of portfolios requested is the maximum number of failed attempts allowed. In our example you requested 1000 and the default value of gen.fail is 4, so it would stop after 4000 failures and return the one random portfolio it successfully found (and warn you that it didnt do so well with your request). Note that it is seldom obvious whether a specic set of constraints is easy to satisfy, dicult to satisfy or impossible.
3.7
The random.portfolio function does not allow a constraint on the utility, but random.portfolio.utility does. If computation time is of concern, then it can be better to just use random.portfolio with constraints on the variance and expected returns. However, this may not be quite what you want. There is not much dierence between the functions in terms of how they are used. The key dierence is the objective.limit argument. The objective is the negative of the utility. So if you want the utility to be at least 0.6, then you want the argument: objective.limit = -0.6 The meaning of gen.fail is the same, but the other control arguments are those used with optimization. There are two forms of failure: the objective does not achieve objective.limit the objective is okay, but there are other broken constraints The objfail.max argument controls how many of the rst type are allowed. If objfail.max=1 (the default) and the rst try does not achieve the objective limit, then an error is triggered. The calls look like: rp1 <- random.portfolio(100, the.prices, ...) ru1 <- random.portfolio.utility(100, -0.6, the.prices, ...)
43
3.8
Going Farther
Tasks that you might want to undertake: Chapter 4 discusses how to specify the constraints. To review common mistakes, see Section 10.1 on page 121. Chapter 5 shows the commands used for examples of random portfolio tasks.
44
Chapter 4
Constraints
This chapter covers the constraints that can be imposed for generating random portfolios and optimization. random.portfolio and trade.optimizer use the exact same set of constraint arguments.
4.1
Table 4.1 lists the possible constraints along with the arguments used to achieve them. In addition to these explicit constraints, there is the implicit constraint of trading integer numbers of asset units.
Round Lots
Trading is only done in integer numbers of asset units except when there is an existing position that is non-integral. Thus if the prices are given for lots as opposed to shares, then round lotting is automatic.
4.2
This section discusses the arguments: gross.value net.value long.value short.value turnover long.only allowance 45
46
CHAPTER 4. CONSTRAINTS
Arguments gross.value net.value long.value short.value allowance turnover long.only max.weight universe.trade lower.trade upper.trade ntrade port.size threshold forced.trade positions tol.positions lin.constraints lin.bounds lin.trade lin.abs lin.style lin.direction alpha.constraint var.constraint bench.constraint dist.center dist.style dist.bounds dist.trade dist.utility dist.coef sum.weight limit.cost close.number
Section 4.2
turnover (buys plus sells) no short values if TRUE maximum weight in the portfolio per asset restrict assets to be traded lower and upper bounds on the the number of asset units to trade number of assets to trade number of assets in the portfolio threshold constraints on trade and portfolio trades that must be done otherwise available constraints expressed in monetary terms linear (and/or count) constraints on the portfolio and/or the trade
4.2 4.2 4.3 4.3 4.3 4.4 4.4 4.5 4.6 4.7
4.8 4.9
bound on expected return of the portfolio bound on variance of the portfolio bound on squared tracking error
4.13
maximum of the sum of a specied number of the largest weights allowable range of costs number of positions to close
47
While there is no single monetary argument that needs to be given, it is mandatory that the monetary value of the portfolio be constrained somehow. All of the monetary arguments are in the currency units that prices uses. In all cases it is sucient to only give turnover. The argument: turnover = 12000 says that the buys plus sells of the trade can not exceed 12,000 currency units. While this makes most sense when there is an existing portfolio, that is not necessary. The turnover can, of course, be constrained even when other monetary constraints are given. The turnover can be expressed as an interval: turnover = c(11000, 12000) When only one number is given, the implied lower bound is zero. How the other monetary arguments are used largely depends on whether or not the portfolio is long-only.
Long-only Portfolios
If you want long-only portfolios, then you need to set the long.only argument to TRUE (the default is FALSE). You can state the amount of money in the resulting portfolio by giving the gross.value argument. Ultimately this needs to be a range of allowable values. You can give the range explicitly with a vector of two numbers: gross.value = c(999900, 1e6) Alternatively you can give a single number: gross.value = 1e6 When a single number is given, this is taken to be the upper boundthe lower bound is computed via the allowance argument. The default allowance is 0.9999, that is, one basis point away. So (by default) these two specications of gross.value are equivalent. In general there is no problem with a constraint this tightthe key thing is how wide the range is relative to the prices of the assets. There will be a warning if the interval is seen to be too narrow. In the case of long-only portfolios, net.value and long.value are synonyms for gross.value, so you can give any one of these three.
Long-short Portfolios
The arguments gross.value, net.value, long.value and short.value control the value of the portfolio. To be clear: The long value is the amount of money in positive positions. The short value is the amount of money in negative positionsthis is meant to be a positive number, but the absolute value of negative numbers is taken for
48
CHAPTER 4. CONSTRAINTS
the short value. The gross value is the sum of the long and short values. The net value is the long value minus the short value. There are two minimal sets of these arguments: gross.value and net.value long.value and short.value Here are some examples: gross.value = 1e6, net.value = c(200, 3000) # OK long.value = 6e5, short.value = 5e5 # OK gross.value = 1e6, net.value = c(0, 2e5), long.value=6e5 # OK gross.value = 1e6, long.value = 6e5 # not OK, neither pair These four arguments are logically each of length twogiving an allowable range. If they only have length one, then the second value is computed by multiplying the given value by the allowance argument. The default value for allowance is 0.9999, that is, one basis point awayyou may want to alter this depending on how important the tightness of these constraints is to you, and on the size of the portfolio relative to the prices of the assets. The allowance computation is unlikely to be what is desired for net.value, so it is recommended that net.value always have length two. Section 7.4 on page 106 gives more details about constraining the value of the portfolio. There are two cases that are of particular interest: dollar neutral portfolios and portfolios in the genre of 120/20. Dollar Neutral Portfolios A dollar neutral portfolio implies that the net value is zero. This is a case where the rule that net.value should only be given as an interval might be relaxed. The argument: net.value = 0 translates into an interval that is symmetric around zero and the radius of the interval is the gross value times one minus allowance. Even so it is probably better to explicitly set your interval for the net value. 120/20 and 130/30 Portfolios The simplest way to get these is to give constraints just like the stated aim: long.value = 1.2 * NAV, short.value = 0.2 * NAV or possibly a range can be given: long.value = c(1.2, 1.25) * NAV, short.value = c(0.2, 0.25) * NAV, net.value = c(.999, 1.0) * NAV
49
4.3
Limits on Assets
This section discusses the arguments: max.weight enforce.max.weight universe.trade lower.trade upper.trade
max.weight
One typical use of the max.weight argument is: max.weight = 0.05 This limits the maximum weight of each asset to 5% of the gross value of the portfolio. The weight is the absolute value of the monetary value of the position divided by the gross value. (So even in long-short portfolios the absolute weights always sum to 1.) The other typical use is to give max.weight a named vector that states the maximum weight for each asset. Assets that are not named are not restricted. Example If you want to allow a few assets to have larger weights, then you can create a vector of all the pertinent assets. Suppose you want almost all assets to have weight no more than 5% and a few to have weight no more than 7.5%. Then you could create the appropriate vector by: > maxw.spec <- rep(0.05, length=length(prices)) > names(maxw.spec) <- names(prices) > maxw.spec[spec.assets] <- 0.075 The maxw.spec vector would then be given as the max.weight argument. A related argument is the control argument enforce.max.weight. When this is TRUE (the default) and if there are positions in the existing portfolio that will break the maximum weight constraint, then forced trades are created to make them conform to the maximum weight. caution There are circumstances in which the max.weight argument does not guarantee that the maximum weight in the nal portfolio is obeyed if the weight is too large in the existing portfolio. A warning may not be given (since max.weight merely limits the extent of trading of the assets). One case is if the control argument enforce.max.weight is FALSE. When this argument is TRUE (the default), then forced trades are automatically built
50
CHAPTER 4. CONSTRAINTS
to make the positions conform to their maximum weights. This is done for the maximum of the range of the gross value. If the range for the gross value is large and the resulting portfolio has a gross value substantially smaller than the maximum allowed gross, then some maximum weights could be broken. It is also possible that a maximum weight is broken by enough that the trade can not be forced to be large enough. In this case, the trade is forced to be as large as possible if enforce.max.weight is TRUE. If the maximum weight is the same for all assets, then you can ensure that it is obeyed by using sum.weight (well, either its obeyed or you see a warning about it). To just constrain the largest weight, the sum.weight argument (see Section 4.14) would look like: sum.weight = c("1"=.1) This restricts the largest weight to 10%.
universe.trade
The universe.trade argument should be a vector of character strings containing some of the asset names. Those will be the only assets allowed to trade. If this argument is NULL (the default), then there is no constraint on the assets to trade (except possibly for benchmarks).
51
The process of constraining the liquidity merely involves computing the maximum amount of each asset that should be traded; setting the upper.trade argument to this amount; and setting the lower.trade argument to the negative of that amount. In this example it is assumed that ave.daily.volume is a vector of numbers that has names which are the names of the assetssimilar to the prices vector. These two vectors need not have the same assets. If there are assets in the daily volume vector (and hence in liquidity) that are not in prices, then these will be ignored. If prices has assets that are not in liquidity, then the missing assets will not be limited by the upper.trade and lower.trade arguments. caution Make sure that the units for the volume are the same as those for the prices. An easy mistake would be to try to limit trading to 10% of daily volume, but instead limit it to 10 times daily volume because the volume is in shares while the prices are in lots. Another sort of liquidity constraint is to limit the amount of assets in the portfolio to n days of average volume. This is more complicated since we want the bounds on the portfolio, but here we have constraints on the trade. This is a case where using positions is an easier and safer approach. See page 55 for an example of getting this constraint using positions.
4.4
Number of Assets
This section discusses the arguments: ntrade port.size Often these two constraints should be thought of as convenient shortcuts to threshold constraints (page 52).
52
CHAPTER 4. CONSTRAINTS
states that the number of assets traded needs to be between 4 and 25, inclusive. A minimum number to trade is most likely to be useful when threshold constraints (Section 4.5) are used as well. Otherwise trading just one unit of an asset counts. The minimum number to trade is more useful for generating random portfolios than in optimizing.
4.5
Threshold Constraints
This section discusses the argument: threshold The threshold argument controls two types of threshold constraintstrade thresholds and portfolio thresholds. Threshold constraints may also be specied with positions (page 54). Note that in any one problem you can only declare threshold constraints with one of these arguments (though both arguments can be used in one problem).
Trade Thresholds
Trade threshold constraints force trades to be at least a certain amount of an asset if it is traded at all. The argument: threshold=c(ABC=4, DEF=23)) requests that at least 4 units (lots perhaps) of asset ABC and at least 23 units of DEF be bought or sold if they are traded at all. This is not the same as forcing a trade of a certain sizethat is discussed in Section 4.6. When the threshold argument is a vector (or a one-column matrix), then the constraint is taken to be symmetric. Another way of stating the constraint given above would be: threshold=rbind(ABC=c(-4,4), DEF=c(-23,23))) where what we are giving to threshold looks like:
4.5. THRESHOLD CONSTRAINTS > rbind(ABC=c(-4,4), DEF=c(-23,23))) [,1] [,2] ABC -4 4 DEF -23 23
53
Give a two-column matrix when you want dierent constraints for buying and selling for at least one of the assets. threshold=rbind(ABC=c(-5,4), DEF=c(0,23))) The command above states that if ABC is sold, then at least 5 units should be sold. If ABC is bought, then at least 4 units should be bought. If DEF is bought, then at least 23 units should be bought. There is no threshold constraint if DEF is sold.
Portfolio Thresholds
A portfolio threshold constraint states the minimum number of units of an asset that should be held in the portfolio. For example, you may prefer to have no lots of ABC if less than 7 lots are held. Portfolio thresholds are specied similarly to trade thresholds except they are in the third and fourth columns of the matrix instead of the rst and second. Here is an example:
> thresh.m1 <- cbind(0, 0, rbind(ABC=c(-7, 7), DEF=c(-5, 8))) > thresh.m1 [,1] [,2] [,3] [,4] ABC 0 0 -7 7 DEF 0 0 -5 8 If thresh.ml is given as threshold, then there should be at least 7 units of ABCeither long or shortif it is in the portfolio at all. Also DEF should be at least 5 short or at least 8 long or not in the portfolio. In this case there are no trading threshold constraints since the rst two columns are all zero.
54
CHAPTER 4. CONSTRAINTS
4.6
Forced Trades
This section discusses the argument: forced.trade The forced.trade argument is a named vector giving one or more trades that must be performed. The value gives the minimal amount to trade and the names give the assets to be traded. For example: forced.trade = c(ABC=-8, DEF=15) says to sell at least 8 units (lots) of ABC and buy at least 15 units of DEF. If you wanted to buy exactly 15 units of DEF, then you would say: forced.trade = c(ABC=-8, DEF=15), upper.trade=c(DEF=15) Trades may also be automatically forced if the existing portfolio breaks maximum weight constraintssee page 49). The positions argument can also force trades.
4.7
Positions
This section discusses the arguments: positions tol.positions The positions argument does not embody any constraints that cant be achieved with other arguments. It exists because the constraints can sometimes be more conveniently expressed via positions. The constraints that positions can do are: max.weight universe.trade lower.trade upper.trade threshold forced.trade The key dierence between positions and these other constraints is that positions is expressed in monetary value. In examples we will assume the currency unit is dollars, but it is really the currency unit that the prices vector is in. The positions argument takes a matrix with rows representing assets, and 2, 4 or 8 columns. Not all assets in the problem need to be present in positions but (by default) there will be a warning if not.
4.7. POSITIONS
55
Portfolio constraints
The rst two columns of positions contain portfolio constraints. Column 1 is the minimum amount of money that is allowed in the portfolio for each of the assets. Column 2 is the maximum amount of money. So this includes the max.weight constraint. If both positions and max.weight are given, then whichever is the stronger for each asset is the actual constraint. n days of daily volume in portfolio On page 50 there is an example of imposing liquidity constraints on the trade. Here we want to impose liquidity constraints on the portfolio. Suppose that we want to restrict positions in the portfolio to be no more than 8 days of average volume. Since positions is expecting monetary value rather than the number of asset units, we rst need to transform to money:
> liquid.value <- ave.daily.volume[names(prices)] * prices > any(is.na(liquid.value)) # trouble if this is TRUE There are a couple things to note here. We are assuming that the volume and the prices refer to the same asset unitswe dont want one to be in shares and the other in lots. Well see on page 56 why the occurrence of missing values would be upsetting. Now we are ready to build our matrix to give to the positions argument. If we have a long-only portfolio, then the matrix can be: > posmat1 <- cbind(0, 8 * liquid.value) If we have a long-short portfolio, then it would be: > posmat2 <- 8 * cbind(-liquid.value, liquid.value) Lets do one more supposition. Suppose that we have a long-only portfolio and we want no position to be larger than $300,000 as well as having the liquidity constraint. You might be tempted to impose that with a command like: > posmat1[, 2] <- min(3e5, posmat1[, 2]) # WRONG Note that there is no warning from thisS has no way of knowing that you are doing something you dont really want to do. What you do want to do is: > posmat1[, 2] <- pmin(3e5, posmat1[, 2]) # right The min function returns just one number which is the minimum of all the numbers it sees. The pmin function does a minimum for each element, which is what we want in this instance.
56
CHAPTER 4. CONSTRAINTS
Trade constraints
Trade constraints are imposed with the third and fourth columns of positions. If you want to impose trade constraints without imposing portfolio constraints, then you can just make the rst two columns innite. For example, if we want selling of each asset to be limited to $5000 and buying of each asset to be limited to $8000, then we could create a matrix like: > posmat3 <- cbind(rep(-Inf, length(prices)), Inf, + -5000, 8000) > head(posmat3) [,1] [,2] [,3] [,4] [1,] -Inf Inf -5000 8000 [2,] -Inf Inf -5000 8000 [3,] -Inf Inf -5000 8000 [4,] -Inf Inf -5000 8000 [5,] -Inf Inf -5000 8000 [6,] -Inf Inf -5000 8000 > dimnames(posmat3) <- list(names(prices), NULL) These constraints are essentially identical to lower.trade and upper.trade except they are expressed in money rather than asset units, and it is possible to impose forced trades with the positions constraints.
Forced constraints
A trade is forced either if the range for the portfolio does not include the current position, or if the trading range does not include zero. This substitutes for using forced.trade. Forced trades from positions cause the output of trade.optimizer to have a component named positions.forced which will show the forced trades created.
Universe constraints
If there is a missing value (NA) in any of the rst four columns of positions, then the corresponding asset is not allowed to trade. (It is an error to have a missing value in the fth through eighth columns.) These constraints could be achieved by using the universe.trade argument. Be careful not to let missing values stray in by accident. The summary of optimized portfolios tells how many assets positions forces not to trade.
Threshold constraints
The 5th through the 8th columns of positions are for threshold constraints. All four of these columns need to be given if any are given. Note also that if these columns are given, then the threshold argument can not be given. You can put null constraints in columns that you dont want to have inuence. The most extreme possibility in this regard is if you want to use positions to only impose a portfolio threshold on a long-only portfolio. Suppose you want only positions that have at least $20,000, then you could build the positions matrix like:
57
> posmat4 <- cbind(rep(-Inf, length(prices)), Inf, -Inf, Inf, + 0, 0, 0, 2e4) > head(posmat4) [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [1,] -Inf Inf -Inf Inf 0 0 0 20000 [2,] -Inf Inf -Inf Inf 0 0 0 20000 [3,] -Inf Inf -Inf Inf 0 0 0 20000 [4,] -Inf Inf -Inf Inf 0 0 0 20000 [5,] -Inf Inf -Inf Inf 0 0 0 20000 [6,] -Inf Inf -Inf Inf 0 0 0 20000 > dimnames(posmat4) <- list(names(prices), NULL)
Tolerance
Imagine a situation where asset ABC is restricted to be no more than $50,000 in the portfolio. At time 1 the optimizer likes ABC and buys enough that it is close to that upper limit. From time 1 to time 2 ABC dutifully does well, so now it is above the $50,000 limit. Assuming the limit remains the same for time 2, then the optimizer will force a sale of some ABC. So we are performing a trade (probably of a trivial amount) for a rather arbitrary reason. If there is a trade threshold constraint on selling ABC, then it is even worse we are eectively forcing a sale of at least the threshold amount. The tol.positions argument solves this problem. If tol.positions is zero (the default), then the constraints are taken as stated. If tol.positions is positive, then assets that violate portfolio constraints in positions by a value less than tol.positions are not forced to trade. In our example if the value of the position of ABC at time 2 is $50,203, then at least $203 worth of ABC will be forced to be sold if tol.positions is less than $203. But there will not be a forced trade if tol.positions is greater than $203. Note that this tolerance only applies to assets that are already in the existing portfolio.
4.8
Linear Constraints
CHAPTER 4. CONSTRAINTS
These arguments create linear constraints on the portfolio and/or the trade. These arguments can also impose count constraintsa topic that is discussed in the next section. Portfolio Probe makes a distinction between numeric constraints and constraints based on categories. An example of the rst is a bound on twice the value of asset ABC minus 1.5 times the value of asset DEF minus the value of asset GHI. An example of the second is bounds on the value from each country in the asset universe. (Most other optimizers will expand such constraints into a matrix of zeros and onesyou could do this also, but the method used here is more compact as well as easier for the user.)
Building Constraints
To impose linear constraints you must specify at least two argumentsyou can accept default values of other arguments. These two key arguments are: lin.constraints lin.bounds The lin.constraints argument gives the numbers or categories for each asset for each constraint. lin.bounds provides the lower and upper bound for each (sub)constraint. Because the functions are picky about these arguments, it is best to create them with the build.constraints function. The build.constraints function takes a vector, a matrix, a factor or a data frame containing the information for the lin.constraints argument. The vector or matrix can be character, numeric or logical. A data frame is required if you have a mixture of types. In this example, we are giving a character matrix with two columnsone for countries and one for sectors: > cons.obj <- build.constraints(cbind(country=countryvec, + sector=sectvec)) > names(cons.obj) [1] "lin.constraints" "bounds" The result of build.constraints is a list with two components. The rst component, lin.constraints, is suitable to give as the lin.constraints argument, the bounds component is the template for an object to give as lin.bounds.
59
The lin.constraints object must have column names, so build.constraints will add column names if they are not already there. (The column names are used to keep track of which bounds go with which constraint.) > cons.obj$lin.constraints[1:3,] country sector ABC "Spain" "media" DEF "France" "retail" GHI "Italy" "energy" > cons.obj$bounds lower upper country : France -Inf Inf country : Italy -Inf Inf country : Spain -Inf Inf sector : energy -Inf Inf sector : media -Inf Inf sector : retail -Inf Inf sector : telecom -Inf Inf The typical use of the bounds component is to create a separate object out of it, and then change any of the innite values desired.
60
CHAPTER 4. CONSTRAINTS
If on the other hand, we want bounds in terms of monetary value, then we could do: > > > > > bounds.csv <- cons.obj$bounds bounds.csv[1:2, ] <- c(1e5, 2e5, 4e5, 5e5) bounds.csv[3, 2] <- 2e5 bounds.csv[5,] <- c(3e5, 6e5) bounds.csv lower upper country : France 1e+05 4e+05 country : Italy 2e+05 5e+05 country : Spain -Inf 2e+05 sector : energy -Inf Inf sector : media 3e+05 6e+05 sector : retail -Inf Inf sector : telecom -Inf Inf
Now the arguments would be: lin.constraints = cons.obj$lin.constraints, lin.bounds = bounds.csv, gross.value = 1e6, lin.style = "value" In this case we need to give the lin.style argument because we are using a non-default value for it. It is also possible to mix styles: > bounds.csmix <- bounds.csw > bounds.csmix[1:3, ] <- bounds.csv[1:3, ] > bounds.csmix lower upper country : France 1e+05 4e+05 country : Italy 2e+05 5e+05 country : Spain -Inf 2e+05 sector : energy -Inf Inf sector : media 0.3 0.6 sector : retail -Inf Inf sector : telecom -Inf Inf So the arguments become: lin.constraints = cons.obj$lin.constraints, lin.bounds = bounds.csv, gross.value = 1e6, lin.style = c("value", "weight") There is no need for any particular pattern of nite bounds. In this example, the only sector we are bounding is media. Once a bounds matrix has been set up, it can be used when building new constraint objects. Suppose that we want to change from sectors to industries, we can build new constraints like: > cons.obj2 <- build.constraints(cbind(country=countryvec, + industry=industvec), bound=bounds.csw)
61
In the cbind command, we are assuming that countryvec and industvec have the same assets in the same order. A safer approach would be: > ci.inam <- intersect(names(countryvec), names(industvec)) > cbind(country=countryvec[ci.inam], + industry=industvec[ci.inam]) The bounds component of cons.obj2 will have the same bounds for the countries as bounds.csw and will have innite bounds for the industries. A bounds object that is actually used may contain extraneous boundsfor example bounds for sectors when only countries are being constrained. However, it is an error not to give bounds for all of the constraints represented in lin.constraints. Only categorical constraints have been discussed so far. We now look at numerical constraints.
62
instead of > cons.obj3$bounds[] <- c(0.9, 0.95) The dierence is that the rst of these will result in a plain vector while the second retains the matrix structure of the original object. We want that matrix structure. We would now use the arguments: lin.constraints = cons.obj3$lin.constraints, lin.bounds = cons.obj3$bounds Constraining more than one risk factor is straightforward. > cons.obj4 <- build.constraints(cbind(Risk1=beta1, + Risk2=beta2)) > head(cons.obj4$lin.constraints) Risk1 Risk2 stockA 1.3119461 0.8972907 stockB 0.9886379 0.9784834 stockC 1.1688637 0.7135961 stockD 0.8160228 1.1195767 stockE 1.0312180 1.0847337 stockF 1.2067453 1.0977745 Now just create the bounds that you want and use the arguments as before.
63
> cons.objmix <- build.constraints(data.frame(Risk1=beta1, + Risk2=beta2, Country=country)) > head(cons.objmix$lin.constraints) Risk1 Risk2 Country stockA 1.3119461 0.8972907 Australia stockB 0.9886379 0.9784834 Singapore stockC 1.1688637 0.7135961 Japan stockD 0.8160228 1.1195767 Australia stockE 1.0312180 1.0847337 Japan stockF 1.2067453 1.0977745 Australia > cons.objmix$bounds lower upper Risk1 -Inf Inf Risk2 -Inf Inf Country : Australia -Inf Inf Country : Japan -Inf Inf Country : Singapore -Inf Inf Yet again the next step is to specify the bounds and then use the arguments.
64
CHAPTER 4. CONSTRAINTS
The lin.abs argument is replicated to have length equal to the number of columns in the lin.constraint object. As with similar arguments, lin.abs can be a vector: lin.abs = c(TRUE, TRUE, FALSE) Lets look at the four possible combinations of the values of lin.trade and lin.abs. These are: portfolio, net (not absolute) (the default) portfolio, gross (absolute) trade, net (not absolute) trade, gross (absolute) For long-only portfolios the rst two are the same since all positions are positive. An extreme example is to enforce all four types on the same constraint. You want to name the constraints properly so that you can keep track of them: > con.4c <- build.constraints(cbind(c.tn=countryvec, + c.ta=countryvec, c.pn=countryvec, c.pa=countryvec)) The result of the above command can then be used like: lin.constraints = con.4c$lin.constraints, lin.trade = c(TRUE,TRUE,FALSE,FALSE), lin.abs = c(FALSE,TRUE)) The lin.abs argument will automatically be replicated to be: FALSE, TRUE, FALSE, TRUE. Of course we could have given all four values.
65
Of course, lin.direction gets replicated to be as long as the number of columns in the lin.constraints object, and can be given as a vector: lin.direction = c(1, -1, 0, 0) When a short-side constraint is in eect (lin.direction=-1), then it is the absolute value of the short values that is usedyour constraints should be nonnegative.
upper realized nearest violation -2e+05 -296632 -3368 NA 2e+05 180767 19233 NA -3e+04 -75494 -24506 NA 7e+05 600624 -624 NA 3e+05 206917 -6917 NA 5e+05 399552 NA -448 0e+00 -24099 24099 NA 1e+05 92422 7578 NA 0e+00 -76340 -23660 NA 4e+05 394621 5379 NA 4e+05 304686 -4686 NA 4e+05 400398 NA 398
This contains a matrix where each row has information on a sub-constraint. The nal column indicates the size of violationsif all of the elements in this column are NA, then there are no violations. A negative violation means that the realized value is below the lower bound, and a positive violation means that the realized value is above the upper bound. The next to last column gives proximity of the realized value to the nearest bound (if there is not a violation). The same convention in terms of sign is useda negative number means the realized is closer to the lower bound. The last two columns have exactly one missing value for each row. In this example the sizes of the violations are relatively trivial. However, they would not be trivial in terms of their aect on the objective if you were optimizingany violation means that the optimizer has been concentrating on getting a feasible solution (a solution that merely satises the constraints) and may not be near the best feasible solution. At times there is a non-zero penalty that is of trivial size relative to the utility, you need not worry in such cases.
66
CHAPTER 4. CONSTRAINTS
4.9. COUNT CONSTRAINTS Country : Germany Country : UK $lin.trade [1] FALSE $lin.abs [1] FALSE $lin.style [1] "value" $lin.direction [1] 0 0 100 100 2000 27.57 175.39 -27.57 -75.39 NA NA
67
An example for optimization is: > tail(summary(op.obj6)) $valuation.trade.fraction.of.gross gross net long short 0.05179854 0.02020499 0.03600176 0.01579677 $constraints.realized $constraints.realized$linear lower upper realized nearest violation Cntry : France -300.0 0.00 -297.6500 -2.350e+00 NA Cntry : Germany 0.0 100.00 11.4600 -1.146e+01 NA Cntry : UK 100.0 2000.00 782.1500 -6.822e+02 NA Beta 0.9 0.95 0.9001 -1.060e-04 NA $lin.trade [1] FALSE $lin.abs [1] FALSE $lin.style Cntry Beta "value" "weight" $lin.direction [1] 0 Notice that if a linear argument is the same for all constraints, then just one value is printed. On the other hand if an argument has more than one value (lin.style in the example above), then all values are printed including names of the constraints.
4.9
Count Constraints
This section continues the description of arguments: lin.constraints lin.bounds lin.trade lin.abs lin.style
68 lin.direction
CHAPTER 4. CONSTRAINTS
but with the restriction that lin.style="count". When lin.style is "count", then count constraintsuser-dened integer constraints are produced. This can only be used with a categorical column of lin.constraintsa count constraint on a numerical column will trigger an error. Lets consider some examples. Suppose that we have a long-short portfolio and we want an equal number of long positions and short positions from the UK, and we want one more French long position than French short position. We have no preference regarding Germany. Then we would create a bounds object like: > lbc1 Country : France Country : Germany Country : UK lower upper 0.5 1.5 -Inf Inf -0.5 0.5
Our call would then include the arguments: lin.bounds = lbc1, lin.style = "count" We can accept the default values of lin.abs and lin.trade because we want a net constraint on the portfolio. We can infer from the bounds object that the lin.constraints argument contains just one constraint that is called Country. Notice that in this context (lin.abs = FALSE, lin.direction = 0) long positions count as one but short positions count as negative one. Also notice that we are giving bounds that are between integersyou dont want to give bounds for count constraints that are integer. Now suppose that we want exactly one German position in the portfolio and we want one or two French positions. Our bounds object should now look like: > lbc2 Country : France Country : Germany Country : UK We would have arguments: lin.bounds = lbc2, lin.style = "count", lin.abs = TRUE We are adding lin.abs to the call because we want an absolute constraint. If we want exactly two long UK positions and we want no short French positions, then we need to do a bit more work. The extra work is because we now have two constraints (a long-side constraint and a short-side constraint) so we need two columns in lin.constraints, not the one column that we have been using. We start by building the appropriate object: lower upper 0.5 2.5 0.5 1.5 -Inf Inf
69
> lcc3 <- build.constraints(cbind(Country.longc=countryvec, + Country.shortc=countryvec)) Now we get the bounds matrix that we need: > > > > > lbc3 <- lcc3$bounds # lbc3 lbc3[3,] <- c(1.5, 2.5) lbc3[4, 2] <- .5 lbc3 lower upper -Inf Inf -Inf Inf 1.5 2.5 -Inf 0.5 -Inf Inf -Inf Inf
Country.longc : France Country.longc : Germany Country.longc : UK Country.shortc : France Country.shortc : Germany Country.shortc : UK Now well use arguments:
lin.constraints = lcc3$lin.constraints, lin.bounds = lbc3, lin.style = "count", lin.direction = c(1, -1) Logical values are allowed to dene constraints. Logicals are probably most likely to be used in count constraints. Here we have a somewhat contrived example (as if the others arent) where we want to lower the beta of the portfolio. We lower beta if we sell high beta stocks or buy low beta stocks. Well create a constraint that does more of this than its opposite in terms of asset count. We build the constraint object and the bounds: > > > > lcc4 <- build.constraints(cbind(HighBeta = beta > 1) lbc4 <- lcc4$bounds lbc4[1,1] <- 3.5 lbc4 lower upper HighBeta : FALSE 3.5 Inf HighBeta : TRUE -Inf Inf
Our arguments are: lin.constraints = lcc4$lin.constraints, lin.bounds = lbc4, lin.style = "count", lin.trade = TRUE
4.10
70
CHAPTER 4. CONSTRAINTS
The alpha.constraint argument bounds the expected return of the optimized portfolio. In its simplest use, it is merely a single number that gives the minimum for the expected return. For example: alpha.constraint = 1.1 An upper bound as well as a lower bound can be specied by giving a twocolumn matrix. For instance: alpha.constraint = cbind(1.1, 2.3) will restrict the expected return to be between 1.1 and 2.3. Note that if a benchmark is given, then this will constrain the expected return relative to the benchmark. This is just the usual expected return of the portfolio minus the expected return of the benchmark. Advanced use of alpha.constraint (chiey when more than one vector of expected returns is given) is discussed on page 147.
4.11
Variance Constraints
This section discusses the argument: var.constraint The var.constraint argument constrains the value of the variance. In its most common usage, it is merely a number giving the upper bound for the variance. Using the argument: var.constraint = 1.4 imposes an upper bound of 1.4 on the variance. The next most common usage is a two column matrix that gives a range of values that the variance is allowed to be in. For example: var.constraint = cbind(1.2, 1.4) says that the variance is to be at least 1.2 but no more than 1.4. The value given looks like: > cbind(1.2, 1.4) [,1] [,2] [1,] 1.2 1.4 More advanced use of this argumentpredominantly when multiple variances are givenis discussed on page 145.
71
4.12
This section discusses the argument: bench.constraint The bench.constraint argument is used to constrain (squared) tracking errors. There has to be information on not only the numerical bound that you want, but the benchmark as well. The benchmark needs to be an asset in the variance, but it doesnt need to be in the prices unless benchmark trading is allowed. Ultimately bench.constraint creates a variance constraintit is just a more convenient way of specifying some commonly desired constraints.
Scaling
Dont forget that this constraint is on squared tracking error, not tracking error. While tracking errors are almost always an annual number, your data probably arent annual. For example if your variance is on a daily scale, then you need to transform the constraint to the daily scale: bench.constraint = c(spx = .04^2 / 252)
72
CHAPTER 4. CONSTRAINTS
Multiple Benchmarks
If you want upper bounds on more than one benchmark, merely give a numeric vector with the names of the benchmarks: bench.constraint = c(spx = .04^2, R1000 = .05^2) species no more than a 4% tracking error from spx and also no more than a 5% tracking error from R1000. To get both upper and lower constraints for multiple benchmarks, you give a two-column matrix with multiple rows. For example: bench.constraint = mult.benchcon.lowup where > mult.benchcon.lowup [,1] [,2] spx 0.0004 0.0016 R1000 0.0009 0.0025 R2000 0.0016 0.0064
Advanced Use
When there are multiple variance matrices, then specifying benchmark constraints is more complicated. This is discussed on page 145.
4.13
Distance
This section discusses the arguments: dist.center dist.style dist.bounds dist.trade dist.utility dist.prices These arguments impose distance constraints. Lets start with a simple example. Suppose we have a target portfolio that has the following weights:
4.13. DISTANCE
73
Suppose we want the resulting portfolio to have weight distance no more than 0.5 (so the buys plus sells of the trade to get from the result to the target would be no more than 50% of the gross value of the portfolio). To impose that constraint we would use arguments: dist.center=targwt, dist.style="weight", dist.bounds=.5 The example above computes distance as the dierence in weights. The distance can alternatively be computed as the dierence in monetary values. These two are equivalent if the gross value is constrained to a narrow range (although the weight formulation takes slightly more time to compute). However, if the gross value is allowed a wide range, then there is a dierence: the value distance will favor a gross value close to the gross value of the target portfolio while the weight distance will be indierent to the gross value. For a value distance, you can give either the values of the assets in the target portfolio or the shares in the target portfolio. Using value might look like: dist.center=target.values, dist.style="value", dist.bounds=25000 while using shares might look like: dist.center=target.shares, dist.style="shares", dist.bounds=25000 The dist.trade argument takes a logical vector and allows you to constrain the distance of the trade rather than the portfolio. The dist.utility argument also takes a logical vectorTRUE values of this imply that you want to use it as the objective in optimization. For more on this, see Section 6.4.
Alternative prices
By default, distance is dened by the prices vector. You dont have to dene it that way. You can include a value for dist.prices that gives dierent prices to be used in the computations of distances. The dist.style argument takes a value with custom before the style when you want to use the dist.prices values. For example, you might have: dist.center=target.shares, dist.style="customshares", dist.prices=some.price.vector, dist.bounds=35000
Multiple distances
It is possible to have more than one distance constraint. In this case the dist.center argument needs to be a list. For example, you might have arguments that look like: dist.center=list(target1.wt, target2.share, target3.wt), dist.style=c("weight", "shares", "weight"), dist.bounds=c(0.2, 2e5, 0.3)
74
CHAPTER 4. CONSTRAINTS
Here we have three distances declaredthe rst and third in terms of weight and the second in terms of shares. If you want lower bounds as well as upper bounds on the distances, then you need to give dist.bounds a two-column matrix with each row corresponding to a distance. For example: dist.bounds=cbind(c(0.1, 1e5, 0.2), c(0.2, 2e5, 0.3)) Each of the arguments dist.style, dist.trade and dist.utility can either have length equal to the number of distances, or length one if the argument is the same for all arguments. For example, in the example above dist.trade and dist.utility both have their length one default value. The dist.prices argument can be a list of price vectors to match the dist.center argument.
4.14
This section discusses the argument: sum.weight Consider the constraint that the largest n weights should sum to no more than some limitthis is handled by the sum.weight argument. For example if you want the 5 largest weights to sum to no more than 40%, and the 10 largest weights to sum to no more than 70%, the argument would look like: sum.weight = c("5"=.4, "10"=.7)) The values in the vector given to sum.weight are the limits, the names of the vector are the numbers of weights in the sums. S code note The names within the c function must be inside quotes as they are not legal S object names. Another way of getting the same thing would be: > sumw.arg <- c(.4, .7) > names(sumw.arg) <- c(5, 10) Then sumw.arg would be given as the sum.weight argument. The argument: sum.weight = c("1" = .05) creates a constraint that says that the largest weight of any one asset is 5%. This is conceptually exactly the same as: max.weight = .05
75
However, there are some operational dierences. The key dierence is that using sum.weight ensures that either the constraint is satised or there is a penalty imposed for breaking the penalty. But it will be somewhat slower when sum.weight is used. See page 49 for the possibility of max.weight failing to enforce the constraint.
4.15
Cost Constraints
This section discusses the argument: limit.cost This is unlikely to be of use with optimization. limit.cost puts lower and upper constraints on the cost. This can be useful for mimicking actual trading with random portfolios. If this is given, then it must be a length two vector giving the allowable range of costs: limit.cost = c(2.3, 2.35) or possibly something like: limit.cost = opt.obj$results["cost"] * c(.99, 1.01)
4.16
This section discusses the argument: close.number The minimum and maximum number of positions to close in the existing portfolio can be specied with the close.number argument. If a single number is given, then exactly that number are to be closed. So: close.number = 3 is really shorthand for: close.number = c(3, 3) Note that this is dierent than other arguments where a single number does not imply a lower bound. To specify that at least 2 positions should be closed, but no more than 3, then say: close.number = c(2, 3) This sort of constraint is unlikely to be useful in optimization, but can be used to generate random portfolios that match what an actual trade has done.
76
CHAPTER 4. CONSTRAINTS
4.17
Quadratic Constraints
Quadratic constraints on the portfolio are not ocially supported, but can be imposed. Quadratic constraints on the trade are not possible. The acceptance of more than one variance means that trade.optimizer and random.portfolio can be tricked into allowing quadratic constraints on the portfolio. Here are the steps for creating quadratic constraints: Add the constraint matrices to the variance. Specify the constraint bounds. Make sure the constraints are not benchmark-relative. If doing optimization, tell the optimizer not to include the constraints in the utility.
77
If you are generating random portfolios (without a utility constraint) and you are not using a benchmark, then you can proceed to the actual computation. The intervening steps only apply if you are using a benchmark or a utility.
Dummy Run
The next two steps are aided by performing a dummy run of optimization something like: > qc.temp <- trade.optimizer(prices, varian3, + iterations=0, ...) It is best to include the variance constraints in this runwithout them the step regarding benchmarks can go wrong. When iterations.max is set to zero, then a minimal amount of optimization is performed. You will get a warning that this is not the same as no optimization. We dont care about optimization in this instance, we only care about the setup for optimization.
[,3] 2 500 0
In our example we want the last two columns to not have a benchmark. Hence we would do an operation like: > qcvtab <- qc.temp$vtable > attr(qcvtab, "benchmarks") <- c("spx", "", "") If you are generating random portfolios and you didnt include the variance constraints in the dummy run, then you will also want to change the utility.only row to be zero in the columns that pertain to your constraints. The argument: vtable = qcvtab should be given in the actual computation. You can learn more about the vtable argument on page 144.
78
CHAPTER 4. CONSTRAINTS
Actual Computation
Now we are ready for the actual computation. If we are generating random portfolios, well have a command like: > qc.rp <- random.portfolio(100, prices, varian3, + vtable = qcvtab, var.con = ...) Alternatively, if we are optimizing, wed do something like: > qc.opt <- trade.optimizer(prices, varian3, + vtable = qcvtab, utable = qc.utiltab, var.con = ...)
4.18
This section applies only to optimizationit can not apply to generating random portfolios. Virtually all of the constraints are enforced via penalties. That is, instead of the optimization algorithm minimizing the negative utility, it minimizes the negative utility plus penalties for whatever constraints are violated. Solutions that obey all of the constraints experience no penalties. The algorithm actively attempts to satisfy some of the constraintsin particular the constraints on the monetary value of portfolios are closely managed,
79
as are threshold constraints, and the constraint on the maximum number of assets traded is always obeyed. Tools are available in trade.optimizer to allow you to adjust the penalties so that most of the constraints (those not actively managed) can become soft constraints. This is done by making the penalties for those constraints small enough so that there will be a trade-o between the penalty for the constraint and the utility. Consider an example: > soft.op1$violated [1] "variance" "linear" attr(,"linear violations") [1] "Country" "Sector" "gross value"
There are violations of two linear constraints, the variance constraint, and the gross value. The default value of the penalty.constraint argument is 1000. You can change individual elements of the penalty by giving the penalty.constraint argument a named vector where the names are the items for which you want to change the penalty. You need to give the names exactlythey can not be abbreviated. The easiest approach is to change the penalty.constraint component of the output from the original optimization. > softpen2 <- soft.op1$penalty.constraint > softpen2[1:2] <- .1 > soft.op2 <- update(soft.op1, penalty.constraint=softpen2)
80
CHAPTER 4. CONSTRAINTS
Chapter 5
82
This function merely calls the (univariate) MACD function from TTR multiple times and saves the signal part of the output. The objects us.price and us.macd are of class xts. In order to avoid assuming that xts is available, we convert these to matrices: > us.pricemat <- as.matrix(us.price) > us.macdmat <- as.matrix(us.macd) (The only substantial change is that row names are created for the matrices.)
R Commands to Do It All
This describes one way of using the auxiliary les to reproduce the analyses. We assume that there is a directory called topdir. This has a subdirectory called test. If the variance caching is done, it will have another subdirectory called Varshr. The auxiliary les from the website are placed in topdir. R is started in test. The scripts include an object called PP LOC that you can change if you would like a dierent topology for the directories. In the R session start o by making sure you are in the right place. The result of: > getwd() should be ".../topdir/test". Then you want to install the regular functions and the graphics functions: > source("../pprobe functions01.R") > source("../pprobe graphFun01.R") Now you need to build the basic data: > source("../pprobe R data01.txt") This retrieves data from the internet and builds the basic data objects. This will fail if the rst asset ID is not available, but should work if others are missing. (This currently produces data for 69 stocks rather than the original 70BNI has been dropped.) If you are going to use the cached variance matrices, then execute the le that contains those commands: > source("../pprobe R varCache.txt") The static method of performance measurement is done with: > source("../pprobe R perfMeas static.txt") To test constraint bounds, do: > source("../pprobe R bounds.txt") Risk is explored with:
5.1. PERFORMANCE MEASUREMENT: STATIC > source("../pprobe R risk.txt") The shadowing method of performance measurement is done with: > source("../pprobe R perfMeas shadow.txt") Backtests are evaluated with: > source("../pprobe R varCache.txt")
83
5.1
The le pprobe R perfMeas static.txt gives the full set of commands for these examples. Step number one is to create the initial portfolio. This was done by optimizing a portfolio: > init.20.w10 <- trade.optimizer(us.pricemat[251,], + expected.return=us.macdmat[251,], port.size=20, + max.weight=.1, gross.value=1e6, long.only=TRUE + utility="maximum return", do.warn=c(novar=FALSE)) The 200% turnover path of optimal portfolios was created with: > ser.20.w10.t200 <- pp.serial.opt(init.20.w10, + pricemat=us.pricemat[-1:-251,], alphamat=us.macdmat, + frac.turnover=2/252, max.weight=.1, port.size=20, + do.warn=c(novar=FALSE, utility.switch=FALSE), + keep.port=TRUE) An object with 400% turnover was also created. Now, we generate random portfolios that obey the constraints at the start of the time period: > rp251.20.w10 <- random.portfolio(1000, us.pricemat[251,], + port.size=20, max.weight=.1, gross.value=1e6, + long.only=TRUE) We then calculate the random portfolio returns over the period: > rp251.20.w10.ret07 <- pp.simpret(valuation(rp251.20.w10, + us.pricemat[c(251, nrow(us.pricemat)-252),], + collapse=TRUE)) We also need to calculate the return of the optimal fund throughout 2007: > serret200.07 <- ser.20.w10.t200$value[251] / + ser.20.w10.t200$value[1] - 1
84
For this one time period and amount of turnover, the only thing left to do is the comparison. The percentile can be computed as: > round(100 * mean(rp251.20.w20.ret07 < serret200.07)) A visual representation can be done like: > plot(density(100 * rp251.20.w20.ret07)) > abline(v = 100 * serret200.07, col="blue") If you want to save plots, then you can open an appropriate graphics device before you perform the graphics commands and close the graphics device afterwards. For example: > > > > pdf(file="myplots.pdf") plot(density(100 * rp251.20.w20.ret07)) abline(v = 100 * serret200.07, col="blue") dev.off()
5.2
Static Utility
First, we create a price vector at a specic time: > price.20061229 <- us.pricemat[251, ] We create a return matrix from the price matrix: > us.retmat <- diff(log(us.pricemat)) We also need a specic variance matrix: > varshr.Q1 <- var.shrink.eqcor(us.retmat[1:250,]) The next step is to generate random portfolios with all the constraints except the volatility constraint: > rp30w10 <- random.portfolio(number.rand = 10000, + prices = price.20061229, gross.value = 1e+06, + long.only = TRUE, port.size = 30, max.weight = 0.1) The utility (with risk aversion 2) of these portfolios can now be computed for the year 2007: > rp30w10.mvu2.2007 <- pp.meanvarutil(valuation(rp30w10, + us.price[252:502,], collapse=TRUE), 2) The same two steps (generate portfolios, calculate utility) are now done with the volatility constraint:
85
> rp30w10v08 <- random.portfolio(number.rand = 10000, + prices = price.20061229, variance=varshr.Q1, + gross.value = 1e+06, long.only = TRUE, port.size = 30, + max.weight = 0.1, var.constraint=.08^2/252) > rp30w10v08.mvu2.2007 <- pp.meanvarutil(valuation(rp30w10v08, + us.price[252:502,], collapse=TRUE), 2)
Caching Variances
Many operations depend on a series of variance matrices. Variance matrices are sometimes reasonably large so the memory used by a few hundred of them can become a problem. One solution is to create a directory containing individual les of each variance that can each be loaded as needed. The solution shown here works for R, it would (probably) need to be adjusted some to work in S-PLUS. The code that creates the les of variances may be found in le pprobe R varCache.txt. Here is code to create the les: > for(i in 250:700) { + JJd <- rownames(us.retmat)[i] + JJn <- paste("varshr", JJd, sep=".") + assign(JJn, var.shrink.eqcor(us.retmat[seq(to=i, + length=250),])) + save(list=JJn, file=paste("Varshr/", JJn, ".rda", + sep="")) + rm(list=JJn) + cat("done with", JJd, "\n") + } The following fragment from a function shows how the variance les can be used: for(i in seq(along=dates)) { this.date <- dates[i] vnam <- paste(varprefix, this.date, sep="") attach(paste(vardir, "/", vnam, ".rda", sep="")) this.rand <- random.portfolio( variance=get(vnam), ...) detach() .... } In each iteration the le containing the desired variance matrix is attached, the variance is used, and then the le is detached. The function has a couple of arguments (vardir and varprefix) that allow some exibility.
Dynamic Utility
The rst dynamic utility object does not involve the variances. The following call sets the window size to 60 trading days and the risk aversion to 2, among other things:
86
CHAPTER 5. DETAILS OF RANDOM PORTFOLIO EXAMPLES > rp30w10.mvu2.dat <- pp.date.meanvarutil( + rownames(us.pricemat)[251:695], 60, 2, + us.pricemat, us.macdmat, max.weight=.1, + port.size=30, gross.value=1e6, long.only=TRUE, + vardir=paste(PP LOC, "Vardir", sep="/"))
Even though the variances arent really used, we still need to provide a valid location for the directory holding the variances because the function still pulls the variances in. The next command nds the minimum variance that can be achieved with the given constraints. We dont really care if we have the exact minimum or not, so in the interests of speed the number of iterations that the optimizer is allowed is reduced to 3: > rp30w10.minvar.dat <- pp.date.minvar( + rownames(us.pricemat)[251:695], us.pricemat, + max.weight=.1, port.size=30, gross.value=1e6, + long.only=TRUE, iteration=3, + vardir=paste(PP LOC, "Varshr", sep="/")) The command to create the dynamic utility with the volatility constraint is: > rp30w10vv20.mvu2.dat <- pp.date.var.meanvarutil( + rownames(us.pricemat)[251:695], + 1.44 * rp30w10.minvar.dat, 60, 2, us.pricemat, + us.macdmat, max.weight=.1, port.size=30, + gross.value=1e6, long.only=TRUE, throw.error=FALSE, + vardir=paste(PP LOC, "Varshr", sep="/")) We want a constraint that is 120% of the minimum volatility, but the constraint is actually on the variancethe square of 120% is 1.44.
5.3
The full set of commands for this task is in le pprobe R risk.txt. A preliminary step is to create a vector stating where in the price data the quarters end: > quarter.ends <- c(251, 312, 375, 438, 502, 563, + 627, 691, 755) > names(quarter.ends) <- paste(rep(2006:2008, + c(1,4,4)), Q, c(4,1:3), sep="") We create the template for a number of objects that well ll in: > empty8 <- vector("list", 8) > names(empty8) <- tail(names(quarter.ends), 8) We make a return matrix using the price matrix: > us.retmat <- diff(log(us.pricemat))
87
Next, we create a set of random portfolios for each quarter: > rp30s15.Q <- empty8 > for(i in 1:8) { + rp30s15.Q[[i]] <- random.portfolio(1000, + us.pricemat[quarter.ends[i], ], gross.value=1e6, + long.only=TRUE, port.size=30, max.weight=.05, + sum.weight=c("4"=.15)) + } What we want from the random portfolios is the predictions of their volatility. So we create an object to hold the predictions from each type of model: > > > + + + + + + + + + + + rp30s15.facexante.Q <- empty8 rp30s15.shrexante.Q <- empty8 for(j in 1:8) { rp30s15.facexante.Q[[j]] <- unlist(randport.eval( rp30s15.Q[[j]], additional.args=list(prices= us.pricemat[quarter.ends[j], ], variance=varfac.Q[[j]]), keep="var.values"), use.names=FALSE) rp30s15.shrexante.Q[[j]] <- unlist(randport.eval( rp30s15.Q[[j]], additional.args=list(prices= us.pricemat[quarter.ends[j], ], variance=varshr.Q[[j]]), keep="var.values"), use.names=FALSE) }
We generally think in terms of volatility, but it is actually (daily) variance that is saved into the objects.
88
advanced The commands above ensure that a subtle but important problem is avoided. The randport.eval function looks at the image of how the random portfolio object was created (the call attribute) and uses that to create the actual data objects that are used. The prices vector for the random portfolios was: us.pricemat[quarter.ends[i], ] In the current loop there is no i, so either an arbitrary i will be used or there will be an error if no i is found. We could have had our loop variable be i, which would have been ne in this case, but there is no guarantee that the loop variables correspond to the same thing. There is a check that will warn you if it detects that the current data are dierent from the original. If the data are largely static, then the check can fail to properly detect the problem. How the current loop has avoided the problem is to explicitly state the prices vector and variance matrix that we want to use by putting them in the additional.args argument. We can compare the volatility estimates in a plot like: > plot(100 * sqrt(252 * rp30s15.facexante.Q[[4]]), + 100 * sqrt(252 * rp30s15.shrexante.Q[[4]]))
5.4. PERFORMANCE MEASUREMENT: SHADOWING + + + + + } corshrreal[i] <- cor(rp30s15.shrexante.Q[[i]], rp30s15.realized.Q[[i]], method=spearman) corfacreal[i] <- cor(rp30s15.facexante.Q[[i]], rp30s15.realized.Q[[i]], method=spearman)
89
Actually this is computing the correlation between the predicted variance and the realized volatility. Since we are using Spearman (rank) correlations, it doesnt matter whether we use variance or volatilitythe ordering is unchanged when we convert from variance to volatility. Next we bootstrap the correlations and compute 95% condence intervals: > > > + + + + + + + > + corboot.Q <- rep(list(numeric(5000)), 8) names(corboot.Q) <- names(empty8) for(i in 1:5000) { this.bootsamp <- sample(1000, 1000, replace=TRUE) for(j in 1:8) { corboot.Q[[j]][i] <- cor(rp30s15.shrexante.Q[[j]][ this.bootsamp], rp30s15.realized.Q[[j]][ this.bootsamp], method=spearman) } } corshrreal.95ci <- sapply(corboot.Q, quantile, probs=c(.025, .975))
The condence intervals arent all that important, but they provide a nice example of showing how easy it is to perform statistical bootstrapping. The dollar neutral and 120/20 portfolios are done analogously. The key dierence is in the call to valuation to get the realized volatility. The type and cash arguments become importantthe assumption of how much cash there is makes a dierence to the returns and hence to the volatility.
5.4
The commands for this section are in le pprobe R perfMeas shadow.txt. The rst thing is to create the optimized run as shown in Section 5.1. Then the random runs that imitate the optimal run are done: > randser.20.w10.t200 <- pp.serial.rand(200, + init.20.w10, pricemat=us.pricemat[-1:-251,], + alphamat=us.macdmat, frac.turnover=2/252, + max.weight=.1, port.size=20, do.warn=c(novar=FALSE, + utility.switch=FALSE)) A plot can now be performed: > the.den <- density(100 * pp.simpret( + randser.20.w10.t200$value[c(1,252),])) > plot(the.den, main="", xlab="2007 return (percent)", + xlim=range(the.den$x, 100 * serret200.07)) > abline(v=100 * serret200.07, col="blue", lwd=2)
90
5.5
The code for this section is in le pprobe R tradeStrat.txt. The rst step is to generate a set of random portfolios (20 of them in this case) to serve as the starting points: > rp.20.w10.2008 <- random.portfolio(20, + us.pricemat[502,], gross.value=1e7, long.only=TRUE, + max.weight=.1, port.size=20) Doing the optimization and then the random mimicking for each of the starting portfolios is all done in the following function: > evalst.20.w10.t200.2008 <- pp.evalstrat(100, + rp.20.w10.2008, us.pricemat[-1:-502,], + us.macdmat, frac.turnover=2/252, max.weight=.1, + port.size=20, verbose=2, + do.warn=c(utility.switch=FALSE, novariance=FALSE)) The percentiles for a point in time can be plotted: > plot(ppoints(20) * 100, + sort(evalst.20.w10.t200.2008$percentile[61,]), + ylim=c(0,100), xlab="Theoretical quantiles", + ylab="Percentile") > abline(0,1, col="blue") You can see what such a plot would like with a purely random strategy with commands like: > plot(ppoints(20) * 100, sort(sample(0:100, 20, + replace=TRUE))) > abline(0, 1, col="blue")
5.6
Going Farther
You can modify the scripts to use your own data. You can modify the functions that the scripts use.
Chapter 6
6.1
Required Inputs
The trade.optimizer function needs a price vector. The prices argument needs to be a vector of prices with names that are the asset names. The assets in prices control the assets that are used in the problem. Other inputssuch as the variance and linear constraintsmay contain additional assets (which will be ignored). Other than prices there is no single argument that is required. However, there are two concepts that are required: the amount of money in the portfolio has to be constrained somehow, and there has to be something that provides a utility.
Monetary Value
The turnover or the amount of money in the portfolio needs to be constrained. Details can be found on page 45
Utility
At least one of the arguments: variance 91
92
must be given. Utility-free optimizationminimizing the distance to a target portfoliocan be done using dist.center and related arguments. See Section 6.4 to do this. Assuming distance is out of the picture: If expected.return is not given, then the utility is the minimum variance. If a variance matrix is not given, then the utility is the maximum expected return. If both are given, then the default utility is to maximize the information ratio. It can be safest to explicitly state the utility that you want. Optimizing a long-only portfolio without a variance or a distance is very unlikely to be a good idea.
6.2
This section contains some examples that build in complexity. Expected returns do not enter into optimization for passive portfolios, and they need not be given.
6.3. EXAMPLES FOR ACTIVE PORTFOLIOS > op.te2 <- trade.optimizer(prices, varian, + utility = "minimum variance", + long.only=TRUE, gross.value=1.4e6, benchmark="spx", + port.size=55, max.weight=0.05)
93
The op.te2 object represents a portfolio containing up to 55 assets that (approximately) minimizes the tracking error among all portfolios of size 55 in which assets have a maximum weight of 5%. If you are rebalancing an existing portfolio rather than creating a new one, then the command might be: > op.te3 <- trade.optimizer(prices, varian, + utility = "minimum variance", + long.only=TRUE, gross.val=1.4e6, benchmark="spx", + max.weight=0.05, existing=cur.port, ntrade=20) The command creating op.te3 is allowing up to 20 assets to be traded. It does not control the turnover, nor the number of assets in the resulting portfolio. Thus op.te3 can potentially have up to 75 assets in it if the current portfolio contains 55 assets. Suppose that we want to have a portfolio with 45 to 55 assets in it, and we want to hold the turnover to 100,000 (currency units, e.g., dollars). The following command will provide that: > op.te4 <- trade.optimizer(prices, varian, + utility = "minimum variance", + long.only=TRUE, gross.value=1.4e6, benchmark="spx", + max.weight=0.05, existing=cur.port, turnover=1e5, + port.size=c(45,55)) This command is a realistic possibility. The examples above could have been done without specifying the utility argumentsince there are no expected returns, the utility would default to minimum variance (but you would get a warning stating that utility was being modied). See page 98 for examples of dealing with cash ow. Page 150 has an example of minimizing against dual benchmarksfor example, when you have a prediction of the benchmark after an upcoming rebalancing.
6.3
Expected returns of the assets are required for optimization of active portfolios. (This is not absolutely truesee page 96 for an example.) Additionally, there needs to be a decision on how to balance expected return with risk.
94
natural quantity (in mean-variance land) to maximizewere seeking the best risk-adjusted return. A command to maximize the information ratio is: > op.ir1 <- trade.optimizer(prices, varian, + long.only=TRUE, expected.return=alphas, + gross.value=1.4e6) The alphas object is a vector of expected returns of the assets. When both variance and expected returns are given, the default objective is to maximize the information ratio. In this example 1.4 million currency units are put into the portfolio. The information ratio is traditionally put into annualized terms. If the variance and expected returns in your command are not annualized, then you can annualize the information ratio by multiplying the utility by the square root of the number that you would use to annualize the variance and expected returns. For example, for a daily frequency the annualized information ratio would be: > -op.ir1$utility.value * sqrt(252) The negative sign is because the optimizer always minimizes. When quantities are being maximized (information ratio or utility), then it really minimizes the negative of the quantity. Note that this is the information ratio you would get if all of your inputs were exact. The actual information ratio that you achieve is likely to be worse. We now turn to the case of maximizing the information ratio when there is a benchmark involved.
95
annualized and created from returns in percent, then the bench.constraint argument would have been: c(spx=4^2) note The benchmark argument is not given in the previous command. If it had been, then the information ratio that was optimized would be relative to the benchmark. A more realistic command might be: > op.ir3 <- trade.optimizer(prices, varian, + long.only=TRUE, expected.return=alphas, + gross.value=1.4e6, turnover=1e5, ntrade=15, + bench.constraint = c(spx=0.04^2/252), max.weight=.05, + existing=cur.port, port.size=60) This command includes the existing portfolio, has a limit on the turnover, and species that the nal portfolio should contain no more than 60 assets. You can easily add constraints to other benchmarks as welljust make the bench.constraint argument longer: > op.ir4 <- trade.optimizer(prices, varian, + long.only=TRUE, expected.return=alphas, + gross.value=1.4e6, turnover=1e5, ntrade=15, + bench.constraint = c(spx=0.04^2/252, newspx=0.05^2/252), + max.weight=.05, existing=cur.port, port.size=60) In this example we are constraining the tracking error to be no more than 4% against the current benchmark, and no more than 5% against our prediction of the benchmark after it is rebalanced. Minimizing relative to two or more benchmarks is, however, somewhat more complicatedthat is discussed on page 150. Also a little more complicated is to get a benchmark-relative utility but a constraint on the portfolio variancepage 152 shows how to do that.
96
You can use bothin which case benchmark changes the utility as usual, but there is also a constraint on the tracking error.
Mean-Variance Optimization
The default objective is to maximize an information ratio (whenever both variance and expected returns are given). If you want to perform a mean-variance optimization instead, then you need to specify the utility argument. The generic mean-variance problem is to maximize the expected return of the portfolio minus the risk aversion times the variance of the portfolio. Page 140 has more on the risk aversion parameter. A command to do a simple mean-variance optimization is: > op.mv1 <- trade.optimizer(prices, varian, + long.only=TRUE, expected.return=alphas, + gross.value=1.4e6, utility=mean-var, risk.aversion=.7) This command does not include a benchmark. As noted above, if the benchmark argument is given, then the mean and variance will be relative to the benchmark. If you do have a benchmark, an alternative approach is to do the optimization in the absolute sense and add a constraint on the tracking error. The tracking error constraint is placed on the optimization with a command like: > op.mv2 <- trade.optimizer(prices, varian, + long.only=TRUE, expected.return=alphas, + gross.value=1.4e6, utility=mean-var, risk.aversion=.7, + bench.constraint = c(spx=0.04^2/252)) For an explanation of the value given for bench.constraint, see page 71. In mean-variance optimization, the risk aversion parameter is quite important.
Mean-Volatility Optimization
Mean-volatility optimization is just like mean-variance optimization except that the value of the utility argument is: utility = "mean-volatility" and the risk.aversion argument will (usually) have a substantially dierent value.
Buy-Hold-Sell List
It is not absolutely necessary to create expected returns for an actively managed portfolio. Instead you can minimize the variance or tracking error using a list of assets that can be sold and a list of assets that can be bought. Suppose that buy.vec and sell.vec are vectors giving the assets that are allowed to be bought and sold. The rst thing to do is to set up the constraints that make sure nothing on the buy list can be sold, and nothing on the sell list can be bought.
6.4. UTILITY-FREE OPTIMIZATION > > > > lower.tr <- rep(0, names(lower.tr) <upper.tr <- rep(0, names(upper.tr) <length(buy.vec)) buy.vec length(sell.vec)) sell.vec
97
S code note These commands create two new vectors containing all zeros and with names that are the buy assets for one of them, and names that are the sell assets for the other. The rep function replicates its rst argument. Then these are used in the call to the optimizer. In this case we assume that we want to minimize tracking error relative to spx. > op.bhs <- trade.optimizer(prices, varian, + long.only=TRUE, gross.value=1.4e6, benchmark="spx", + existing=cur.port, lower.trade=lower.tr, + upper.trade=upper.tr, + universe.trade=c(buy.vec, sell.vec)) The universe.trade argument is restricting the assets that are traded to be only in the buy or sell list. Assets not named are constrained to not trade. You may wish to use the turnover argument to constrain turnover, and other arguments.
6.4
Utility-free Optimization
The idea of utility-free optimization is that you have an ideal target portfolio and you would like to get as close to that target as possible while still obeying all of your portfolio constraints. Expected returns are not used for this and you dont have to have a variance matrix. One of the simplest practical possibilities is a call like:
> op.d1 <- trade.optimizer(prices, dist.center=target.wts, + utility="distance", existing=current.portfolio, + turnover=30000, gross.value=1e6, long.only=TRUE) The key parts of this call are the dist.center argument that takes a representation of the target portfolio, the utility argument, and the turnover argument. The call is taking advantage of the default value of dist.style which is "weight". The target.wts object should be a named numeric vector that sums to 1, where the names are asset ids. Some sort of constraint on turnover is almost surely desirable. That could be accomplished using costs, but the turnover argument is a much easier option. Another constraint that is quite likely in this setting is a tracking error constraint. For this we do need a variance and the bench.constraint argument must be specied.
98
> op.d2 <- trade.optimizer(prices, dist.center=target.wts, + utility="distance", existing=current.portfolio, + turnover=30000, gross.value=1e6, long.only=TRUE, + variance=varian, bench.constraint=c(spx=0.04^2/252)) In this case we are specifying a maximum tracking error of 4% assuming the variance was produced with daily datasee Section 4.12 for more details. Section 4.13 has more information on the arguments that control distance.
6.5
When you either are required to raise cash or have cash to put into the portfolio, you can use optimization to nd a good trade. If there is signicant cash ow, then this process has the benet of keeping the portfolio closer to optimal than it otherwise would bethus reducing the need for large, periodic rebalances. The examples given below assume an active portfoliomerely eliminate the expected.return argument if yours is passive.
99
The formulations given for both injecting and extracting money are implicitly assuming that the optimizer will want to trade at least the amount requested. That doesnt have to be true. A safer approach would be to give turnover an interval, perhaps something like: turnover = cash.required * c(1, 1.01)
6.6
Asset Allocation
Asset allocation as we discuss it here means that we are interested in weights rather than prices and asset units (like shares or lots). We restrict asset allocation to long-only. trade.optimizer can handle such problems. The key step is to create a price vector that is all ones. > aa.prices <- rep(1, length(asset.names)) > names(aa.prices) <- asset.names You then select a gross value that is large enough to satisfy the accuracy that you desire. For example if getting the weight to the nearest basis point is good enough for you, then set the gross value to 10,000. The gross value should be given as an interval that has just one integer in it. The command: > aa.alt1 <- trade.optimizer(aa.prices, avar, + expected.return=aret, utility=mean-var, + risk.aversion=.02, long.only=TRUE, + gross.value=10000+c(-.5, .5)) performs a mean-variance optimization. You can get the weights from the portfolio optimizer object with: > valuation(aa.alt1)$weight A further example of this technique is the multiple scenario analysis on page 156.
6.7
Going Farther
Tasks that you might want to undertake: To add trading costs to your optimization command, see Chapter 9. To add constraints, see Chapter 4 on page 45. To review common mistakes, see Section 10.1. To improve your solution or examine the quality of solutions you are getting, go to Chapter 12 on page 133. To export your solution to a le, see Section 8.3. To perform optimizations with multiple variances, expected returns and/or benchmarks, go to Chapter 14.
100
Chapter 7
7.1
Required Inputs
The trade.optimizer function needs a price vector. The prices argument needs to be a vector of prices with names that are the asset names. The assets in prices control the assets that are used in the problem. Other inputssuch as the variance and linear constraintsmay contain additional assets (which will be ignored). Other than prices there is no single argument that is required. However, there are two concepts that are required: the amount of money in the portfolio has to be constrained somehow, and there has to be something that provides a utility.
Monetary Value
The turnover or the amount of money (both long and short) in the portfolio needs to be constrained. Details can be found on page 45
Utility
At least one of the arguments: variance 101
102
expected.return dist.center must be given. The dist.center argument and its mates are used to do utility-free optimization that is, to minimize the distance to an ideal target portfolio. This is likely to be more common for long-only portfolios, but there is no reason not to do it for long-short portfolios as well. If you do want to do it, see Section 6.4 (which assumes long-only). Assuming distance is out of the picture: If expected.return is not given, then the utility is the minimum variance. If a variance is not given, then the utility is the maximum expected return. If both are given, then the default utility is to maximize the information ratio.
7.2
Examples
This section presents some examples. Feel free to skip over those that dont apply to you. These examples do not include arguments for the cost of trading (see Chapter 9). Nor does it cover several of the constraints (see Chapter 4).
7.2. EXAMPLES
103
S code note These commands create two new vectors containing all zeros and with names that are the long assets for one of them, and names that are the short assets for the other. The rep function replicates its rst argument. These new vectors are then used as the lower.trade and upper.trade arguments:
104
CHAPTER 7. OPTIMIZING LONG-SHORT PORTFOLIOS > opt.list1 <- trade.optimizer(prices, varian, + gross.value=1.4e6, net.value=c(-1e4, 2e4), + lower.trade=long.zero, upper.trade=short.zero, + universe.trade=c(long.names, short.names), + ntrade=50)
The lower.trade and upper.trade arguments to trade.optimizer are limits on the number of units (e.g., lots or shares) that can be traded. Each asset can be given a dierent limit. If what is passed in is a vector with names, then any number of assets may be represented (in any order). So using long.zero as the lower.trade argument says that the assets named by long.zero can not have negative tradesthat is, can only be bought. The same logic applies to the upper.trade argument with short.zero. Another important piece of the command is setting universe.trade so that assets that are in neither long.names nor short.names wont trade at all. This example is a simple casethe lower and upper bounds can be more complicated when there is an existing portfolio. It is possible that using the positions argument (page 54) would be easier in such a case. If, for example, the existing portfolio has a long position in an asset that is on the short list, you may wish to force a sell in the asset that is at least as large as the existing position. The commands above will not do that you need to resort to either the forced.trade argument or the positions argument. > opt.list2 <- trade.optimizer(prices, varian, + gross.val=1.4e6, net.val=c(-1e4, 2e4), + lower.trade=long.zero, upper.trade=short.zero, + universe.trade=c(long.names, short.names), + ntrade=50, forced.trade=c(ABC=-25)) In this example we are forcing a sell of at least 25 units of asset ABC.
Mean-Variance Optimization
To do mean-variance optimization, you need to give both a variance and expected returns and set: utility = "mean-variance" You will want to specify the risk aversion as well. More details of the risk aversion parameter are on page 140. > opt.mv1 <- trade.optimizer(prices, varian, + expected.return=alphas, gross.value=1.4e6, + net.value=c(-1e4, 2e4), ntrade=55, max.weight=.05, + utility=mean-var, risk.aversion=3.7) Alternatively, you can do mean-volatility optimization by using: utility = "mean-volatility"
105
7.3
The optimizer can be used to select trades to take money out of the portfolio, or put money into it. The optimizer makes this process quite painless, even when you want a fair amount of control over the trade. The examples given below assume that the cash management is via the gross value. An alternative would be to control the longs and shorts directly. You would use the long.value and the short.value arguments for this more specic control.
106
7.4
Money Constraints
If you are looking for a control in the optimizer for leverage, there isnt one. The amount of leverage applied is determined by the fund managerleverage is external to the optimization. The decision on leverage comes rst, which determines the gross value and the net valuequantities that the optimizer does need to know. The constraints on the long and short values are not just redundant to the gross and net value constraints. Figure 7.1 shows that if we plot the gross and net value constraints, then these form a rectangle that is diagonal to the long value and short value axes. The constraints displayed in the gure are the gross value between 10 and 12, and the net value between -1 and 2. Constraints on the long value and the short value will be vertical and horizontal sections, so adding these change the shape of the allowable region.
7.5
Real-Time Monitoring
You may want to perform real-time optimization. The prices will change from minute to minute, and if your expected returns change continuously as well, then constantly updating the optimization could be valuable. Such a process is very easy to set up. Here is an example: > repeat{ + prices <- update.prices() # user-defined function + alphas <- update.alphas() # user-defined function + cur.port <- update.portfolio() # user-defined function + cur.opt <- trade.optimizer(prices, varian, + expected.return=alphas, turnover=5e4, ntrade=5, + existing=cur.port) + if(length(cur.opt$trade)) { + deport(cur.opt, what="trade") + } + } S code note In S, repeat performs an innite loop. Generally the body of a repeat loop will contain a test of when the loop should be exited. In this case there is no such testthe optimizations will continue to be performed until you explicitly interrupt the loop. It would be easy enough to use the date function to check the time of day and exit the loop after the close of trading. This example has three functions that you would need to write. These functions update the prices, the expected returns, and the positions in your
107
108
portfolio. If you are willing to assume that the suggested trades always happen, then you could merely have: cur.port <- cur.opt$new.portfolio The technique here might be characterized as looking for the next small thing the size of the trade and the maximum number of assets to trade should probably be small.
7.6
Going Farther
Tasks that you might want to undertake: To add trading costs to your optimization command, see Chapter 9 on page 115. To add constraints, see Chapter 4 on page 45. To review common mistakes, see Section 10.1 on page 121. To improve your solution or examine the quality of solutions you are getting, go to Chapter 12 on page 133. To export your solution to a le, see Section 8.3 on page 113. To perform optimizations with multiple variances and/or multiple expected return vectors, see Chapter 14. If you want a benchmark with a long-short portfolio, see Section 11.2.
Chapter 8
General Use
The core functionality of Portfolio Probe is generating random portfolios and optimizing trades. More detailed description of these activities can be found in other chapters. This chapter focuses on peripheral tasks. In particular there are hints about using the S language.
8.1
Setting Up Data
Before computations can be performed, there needs to be data in place on which to do the computations. Data that we may need include prices, existing positions in the portfolio, expected returns, variances, membership of assets in various groups such as countries and sectors, and so on.
110
strings. A data frame has the freedom to have dierent types of entries in dierent columnsone column could be numeric and another categorical. In this example read.table creates a data frame that has one column, which is numeric, and the asset names are used as the row names. This is then converted from a data frame into a matrix. Finally the drop function changes the object from being a one-column matrix into an ordinary vector. The reason that there is the intermediate step of converting to a matrix is that the row names are not preserved when a data frame is dropped to a vector. (There is a reason for thisyou can just think of it as an unfortunate fact.) Other data will be imported in a similar fashion. Using as.matrix is fairly likely, drop is only appropriate when there is a single column (or row) of data and a simple vector is desired.
Variance Matrix
A variance matrix is often used in Portfolio Probe. If your problem is an asset allocation with at most a few dozen assets, then a sample variance matrix as computed by the var or cov.wt S functions should do (though there may be better methods). For a large number of assets, a factor model or a shrinkage model will probably be more appropriate. You can use any variance matrix that you want. It can come from a commercial source or you can compute it yourself. If you need to compute the variance matrix yourself, there are functions in the BurStFin package that will do that. If you are using R on Windows, then you can get BurStFin with: > install.packages("BurStFin", + repos="http://www.burns-stat.com/R") You can get the source with: > download.packages("BurStFin", some directory, + type="source", repos="http://www.burns-stat.com/R") Once the package is installed for your version of R, you can do: > library(BurStFin) in each session in which you want to use its functionality (this command might be slightly more complicated if you are using S-PLUS). The factor.model.stat function in BurStFin creates a variance matrix using a factor model. The command to get a variance matrix can be as simple as: > retmat <- diff(log(price.history)) > varian <- factor.model.stat(retmat) Alternatively, a variance that shrinks towards the equal correlation matrix can be computed with: > varian <- var.shrink.eqcor(retmat)
111
Here price.history is a matrix of asset prices with the columns corresponding to the assets and the rows corresponding to times (the most recent time last). The calculation produces a matrix of returns which we call retmat in this case. (The vector that is used as the prices argument could be the nal row of the price history matrix.) The log function returns an object that looks like its input, but each element of the output is the natural logarithm of the corresponding element of the input. The diff function when given a matrix performs dierences down each column. The rst row of the result is the second row of the input minus the rst row of the input; the second row of the result is the third row of the input minus the second row of the input; et cetera. note The retmat object holds log returns (also known as continuously compounded returns). You may hear strong opinions on whether simple returns or log returns are preferred. In mean-variance optimization neither is entirely correct, however the results are likely to be quite similarespecially if daily or higher frequency returns are used.
caution Do not use daily returns when your data include assets from markets that close at dierent timesglobal data in particular. The asynchrony of the returns means that the true correlations are higher than those that will be estimated from the data. Thus optimizations will be distorted. Weekly is the highest frequency that should be used with data that are substantially asynchronous. (Note though that techniques do exist to adjust for asynchronysee [Burns et al., 1998].) caution There is the possibility of confusion with the word factor. In S an object that is of type factor is something that is categorical. For example, a vector of the sector or country of assets will (at some point at least) be a factor when used as a linear constraint. A factor model does not contain any factors in this sense. A large amount of the eort in factor.model.stat is dealing with missing values. The default settings are reasonable for long-only optimizations. Possibly a more appropriate matrix for benchmark-relative analyses and long-short optimization would be: > varian <- factor.model.stat(retmat, zero=TRUE) For assets with missing values this biases correlations toward zero rather than towards the average correlation as the previous command does.
112 advanced
The treatment of missing values can have a material impact on the optimization, and it can be benecial to specialize the estimation to your particular application. An easy way to do this is to return the factor model representation: > varfac <- factor.model.stat(retmat, out="factor") When the output argument is set to "factor", the result is a list with components named loadings, uniquenesses and sdev. You can modify these to correspond to an approach appropriate for you. Then you can create the variance matrix from the object: > varian <- fitted(varfac) This works because the result of factor.model.stat when it returns the factor representationvarfac in this casehas a class, and that class has a method for the fitted generic function. The result of this function is the variance matrix that is represented by the factor model.
8.2
The optimization process itself is covered in other chapters. The basics are covered in chapters that depend on what you are doing: Random portfolio generation: Chapter 3. Long-only portfolio optimization (active, passive or utility-free): Chapter 6. Long-short portfolio optimization: Chapter 7.
8.3. POST-OPTIMIZATION
113
Trading costs are discussed in Chapter 9, while Chapter 4 covers constraints. Chapters 10 and 14 have additional informationSection 10.1 on page 121 is particularly recommended. caution If in optimization a penalty is imposed because one or more constraints are broken, then the optimizer will produce a warning. Do not ignore the warning. It could well mean that there is a mistake in the specication. You can re-run the optimizationperhaps with an increased number of iterationsto see if the optimizer nds a solution that does not break any constraints. If that doesnt work, you should probably investigate if you have imposed constraints that dont allow a solution. Section 10.1 on page 121 lists some common mistakes. On the other hand, this method of treating constraints allows backtests to proceed even when the constraints that are imposed are infeasible for some of the time periods. The answers that break the constraints, while not optimal, may be (at least sometimes) reasonable.
8.3
Post-Optimization
This section is specic to optimization. If you are generating random portfolios, then the analogous discussion is Section 3.3. Once you have performed an optimization, there are two tasks remaining see what the trade is, and export the data so that the trade can be implemented.
> opti <- trade.optimizer(prices, varian, long.only=TRUE, + gross.val=1e6) > summary(opti) The summary function provides information on: the utility, costs, and penalties for broken constraints opening and closing positions, etc. valuations of the portfolio and trade realization of linear (and count) and distance constraints You can get some more specic information by printing the optimal portfolio object (which can be done by just typing its name): > opti
114
Alternatively, you could look at an individual component of the object, such as the trade: > opti$trade The prices that summary uses to value the portfolio are, by default, the prices used in the optimization. You can give summary dierent prices: > summary(opti, price=new.prices) You can also get the valuation of the individual assets as well as overall valuation using the valuation function: > valuation(opti) > valuation(opti, trade=TRUE) The default behavior of valuation when given a portfolio object is to give information on the portfolio. The trade can be examined instead by setting the trade argument to TRUE. When valuation is given a vector, the prices must be given: > valuation(opti$trade, price=new.prices) Prices are optional when portfolio objects are giventhe prices used during the optimization are used if none are specied.
Chapter 9
Trading Costs
This chapter covers trading costs. It is possible to constrain costs, so costs can be used when generating random portfolios. This is only likely when imitating a series of trades with random trades. Costs are a tricky part of optimization, but are extremely important.
9.1
Background
If there were any justication for the attitude that optimization is hard, it should be on account of trading costs. However, the problem of trading costs is fundamental to fund managementyou have to confront the issue whether or not you use an optimizer. There are numerous sources of trading costscommissions, bid-ask spread, and market impact are commonly considered. Less obvious costs can be such things as capital gains taxes if a position is sold, and the cost of imperfectly known expected returns. Costs need not be symmetric for buying and selling. You may, for example, want the cost of increasing positions to be more than the cost of decreasing them, because of increased liquidity risk and/or because of the implicit cost of needing to close the position again.
9.2
Specifying Costs
Costs are specied similarly in trade.optimizer and random.portfolio. When generating random portfolios, costs will be ignored unless limit.cost is given (or the utility is constrained). The specication of costs is very exible while simple costs can be given easily. To allow for asymmetry in costs, there are four cost arguments: long.buy.cost long.sell.cost short.buy.cost short.sell.cost 115
116
This allows dierences in the costs for long positions versus short positions, as well as dierences for selling and buying. If there is no dierence between buying and selling, and between long and short positions, then give only the long.buy.cost argument. caution If you give one of the cost arguments other than long.buy.cost but no others, then the named situation will be the only case where there is cost. That seems unlikely to be what you want. Minimally, costs must be given for all of the assets that are allowed to trade in the problem (if costs are given at all). Costs may be given for any number of assets as long as all the tradeable assets are included.
Linear Costs
Linear costs are given with a vector (or a one-column matrix) of costs. For example, to have costs of 30 basis points for all trades, the command would be: > opt1 <- trade.optimizer(prices, ..., + long.buy.cost = prices * .003) To have costs of 50 basis points for short sales and costs of 30 basis points for all other trades, the command is: > opt2 <- trade.optimizer(prices, ..., + long.buy.cost = prices * .003, + short.sell.cost = prices * .005) The ... in these commands and subsequent ones refers to additional arguments that you want in the optimization. The advantage of linear costs is that they are easy to think about and easy to write down. Linear costs are undoubtedly more popular than they otherwise would be because computational engines have generally been unavailable for more realistic models of cost.
Nonlinear Costs
Costs are unlikely to really be linear. It can be quite useful to have non-integer exponents in the trading cost functions. This is allowed via the cost.par argument. cost.par is a vector giving the exponents for the cost function. Consider: > cost.poly [,1] [,2] [,3] [,4] ABC 0.00 0.01 0.02 0.00 DEF 0.03 0.02 -0.01 0.01 > trade.optimizer( ..., long.buy.cost=cost.poly) > trade.optimizer( ..., long.buy.cost=cost.poly, + cost.par=0:3)
117
These two calls to trade.optimizer are equivalent (though there would be a slight computational eciency advantage to the rst one). Polynomial costs are discussed below. A more telling use of cost.par is: > trade.optimizer( ..., long.buy.cost=cost.poly, + cost.par=c(0, 1, 1.5, 2.34)) In this case if ABC trades 30 lots and DEF trades -25 lots, then the cost for ABC will be: 0 + .01(30) + .02(301.5 ) = 3.586 The cost for DEF will be: When cost.par is used, there is a restriction that all of the cost matrices must have the same number of columns as cost.par. In actuality there is no loss of generality as columns of zeros can be added to matrices as required, and cost.par can be the union of all of the desired exponents. Well see in the next section that in a long-short portfolio when a trade makes an asset go from long to short (or vice versa) that there is a choice of having the intercept counted in both instances or just the rst. There is no such choice when using cost.parit doesnt know about zero being the intercept and the full cost is always used. .03 + .02(25) .01(251.5) + .01(252.34) = 17.952
Polynomial Costs
This subsection describes creating polynomial costs. Using the more general cost.par argument is usually preferred, so feel free to skip this subsection. Polynomial costs are given by a matrix. The order of the polynomial is one less than the number of columns in the matrix. The rst column contains the intercept, the second column has the linear coecients, the third column has the quadratic coecients, etc. caution Do not forget the intercept. > WRONG <- cbind(linear.coefs, quad.coefs) > okay.costs <- cbind(0, linear.coefs, quad.coefs) Because it is easy to forget the intercept, a warning is issued if there are non-zero values in any intercept. If you really do have non-zero elements in cost intercepts and you dont want to see the warning about it, then you can set the do.warn argument to c(cost.intercept.nonzero = FALSE). Only the rst part of the name needs to be givenenough needs to be given so that it is unambiguous relative to the menu of names that can be given do.warn. Consider this example matrix given as the costs:
118 > cost.poly [,1] [,2] [,3] [,4] ABC 0.00 0.01 0.02 0.00 DEF 0.03 0.02 -0.01 0.01
If ABC trades 30 lots and DEF trades -25 lots, then the cost for ABC will be: 0 + .01(30) + .02(302) = 18.3 The cost for DEF will be: Note that the matrix needs to have enough columns to encompass the highest order polynomiallower order polynomials will have zeros for their higher order coecients. caution You should make sure that each polynomial that you give makes sense throughout the allowable trading range of the asset. Negative costs, for instance, are often not reasonable. (Though there are special circumstancessuch as crystallizing a loss for tax purposeswhere negative costs are appropriate.) If your costs are just interceptsthere is a xed cost for trading any of an asset, then you need to give a two-column matrix with the second column containing all zeros: > intercept.only <- cbind(intercepts, 0) This is most sensible if the intercepts are not all equalotherwise it is almost the same as constraining the number of assets to be traded. If the portfolio is long-short, then there can be more than one type of trading cost for a single asset. For example, an asset that is long in the existing portfolio can be sold heavily enough that it becomes short. There is then the issue of whether or not both intercepts should be counted in the trade. The doubleconst control parameter determines which is done. If doubleconst is FALSE (the default), then only the rst intercept is used. If doubleconst is TRUE, then both intercepts are counted. .03 + .02(25) .01(252 ) + .01(253) = 150.53
9.3
Power Laws
[Grinold and Kahn, 2000] present an argument that the trading cost per unit of an asset should be commission plus half the bid-ask spread plus some constant times the square root of the quantity: the amount to be traded divided by the average daily volume. The last term involving the square root comes from an inventory model. They suggest that the constant should be on the order of the daily volatility of the asset. Sample code implementing this model might look like: > cost.mat <- cbind(commission + spread[names(prices)] / 2, + volatility[names(prices)] / sqrt(ave.daily.volume[
9.4. ON SCALING COSTS RELATIVE TO UTILITY + names(prices)])) > > trade.optimizer(..., long.buy.cost=cost.mat, + cost.par=c(1, 1.5))
119
The rst command creates a two-column matrix. The rst column relates to the items that are linear (commission and spread), the second column contains the cost involving the square root of the trade volume. When this is used in an optimization, the exponents for these columns are 1 and 1.5, respectively. The second coecient is 1.5 (where naively you may expect it to be 0.5) because the square root rule says that the cost per share depends on the square root of the number of shares traded. (If you think in terms of calculus, we are taking the integral of the cost function.) [Almgren et al., 2005] empirically found a power of 0.6 to t a set of trades on large US stocks. Since a square root is a power of 0.5, their ndings suggest that trading costs grow slightly faster as the size of the trade increases. If you were to use this power law, then the cost.par in the example above would be 1 and 1.6 instead of 1 and 1.5.
9.4
There are (at least) three things we want in the costs that we declare: The cost of a larger trade in an asset is appropriate relative to a smaller trade of that asset. The cost of trading one asset is well-scaled relative to the cost of trading the other assets. The costs are appropriately scaled in light of the utility that is used. The proper scaling of the costs relative to the utilitywhether that is expected returns (and variance), or distanceis the point where the science gets a bit fuzzy. This is also a key issue controlling the merit of the optimization. If the costs given the optimizer are too small, then excessive trading eats up performance. If the given costs are too large, then performance is hurt via lost opportunity. Perfection is not necessary for the optimization to be of value, but attempting perfection is probably not such a bad mission. The added utility of a trade needs to compensate for the transaction cost. Thus the costs should correspond to the expected returns over the expected holding period. To be more specic, suppose that we think it will cost 50 basis points to trade a certain asset. If we are using daily data and our expected return for this asset is 10 basis points, then we will have to have a holding period of 5 days in order to break even. To amortize the cost, it (that is, the 50 basis points) should be divided by the number of days that we expect to hold this asset. If we expect to hold it for 2 days, the optimizer would see a cost of 25 basis points. If we expect to hold it 10 days, the optimizer would see a cost of 5 basis points. (The statement of this example might seem to imply that volatility is of no concernit is of concern.) Lets revisit the example from Section 9.3:
120
CHAPTER 9. TRADING COSTS > cost.mat <- cbind(commission[names(prices)] + + spread[names(prices)] / 2, + volatility[names(prices)] / sqrt(ave.daily.volume[ + names(prices)])) > > cost.amort <- cost.mat / expected.holding[names(prices)] > > trade.optimizer(..., long.buy.cost=cost.amort, + cost.par=c(1, 1.5))
This adds a command which divides (what might be called) the raw costs by how long we expect to hold each asset. The result is the amortized costs. The holding period should be in the same time units as the expected returns. If we have daily returns, then an asset that we expect to hold 25 days should have 25 as its expected holding period. However that same asset should have an expected holding period of about 0.1 when we are using annualized returns. That the expected returns are noisy suggests that the costs should be increased from this rational value to reduce trading. There is another sense of scaling costs. By default trade.optimizer sums the costs from all of the trading and then divides by the gross value of the portfolio. This is the natural thing to do as the expected returns and the variance are also scaled by the gross value when the utility is computed. However, you do have the option to scale by the trade value or not to scale the costs at allthis is controlled by the scale.cost argument.
9.5
Taxes present a cost structure that is entirely dierent from trading costs resulting from the market. In particular the cost will be dierent depending on whether you are buying or selling. More detail on costs due to taxes can be found in [diBartolomeo, 2003] and its references. Several optimizers implement piecewise linear cost functions. While this choice is likely to be due to mathematical tractability, costs (and benets) from taxes really are piecewise linear in some circumstances. The tax liability often depends on the length of time an asset has been held, and there may be several holding periods for a particular asset. Portfolio Probe does not have piecewise linear costs, but you can approximate them with nonlinear costs. Taxes also have a sort of time decay similar to options. Their eect changes as the end of the tax period gets closer.
9.6
Going Farther
Additional steps might be: To look for trouble spots with Chapter 10 on the facing page
Chapter 10
10.1
The phrase garbage in, garbage out has gone slightly out of fashion, but it still has currency. What follows are a few possibilities for garbage.
Data Mangling
The prices are not all in the same currency. It doesnt matter which currency is used, but there needs to be only one. 121
122
There are prices for the wrong quantity. For example, the optimization is thought of in terms of lots, but the prices for some or all of the assets are for shares. There are stock splits, rights issues, etc. that have not been accounted for. Stock splits and rights issues that are not caught can mean that your existing holdings are wrong in the optimization. A missed split in the price history degrades variance estimates that are based on returns created from the price history. The variance is of prices, not returns. The variance matrix of the returns of the assets is needed. If the variance is of the prices, the results will be seriously wrong. (The functions in BurStFin that create variance matrices from data give a warning if prices are used.) The variance and/or expected returns are not properly scaled. The scaling that is chosen does not matter, but it needs to be consistent. The (time) scales of the variance and the expected returns should matchfor example they can both be annualized, or both be daily. You also need such things as benchmark constraints to be scaled to match the variance. Costs need to account for the time scale of the expected returns, and whether or not the expected returns are in percent. The variance matrix is singular. While the Portfolio Probe optimization algorithm will happily handle singular matrices, the results need not be useful. If there are more assets than time points in the returns data, then a factor model or shrinkage estimator rather than a sample variance should be used. This problem is certainly more serious for long-short portfolios, but is problematic for long-only portfolios as well. If the true variance really is singularfor example cash is one of the assets (see page 156)then you should get good results. The problem is if the variance matrix says that there are portfolios that are essentially riskless when they are not. The variance matrix is not symmetric. The optimizers inherently assume that the variance is symmetric. They will get confused if this is not the case. If what you are using as the variance is not symmetric, then you can get a version that is with: > varian.sym <- (varian + t(varian)) / 2
123
Global data should not be used at a higher frequency than weekly unless asynchrony adjustments are made. In addition to dierent opening hours, asynchrony can be caused by assets that do not trade often. The illiquid assets will have stale prices. A model that adjusts for asynchrony due to dierent opening hours is presented in [Burns et al., 1998]. The expected return for a benchmark is not equal to the weighted average of the expected returns of its constituents. This problem is pointed out in [Michaud, 1998]. Minor violations are unlikely to have much aect, but dierences do distort the optimization. The signs of covariances in the variance matrix are reversed for assets that are intended to be short. While this could produce valid results if properly done, it is confusing and totally unnecessary. Portfolio Probe was developed with long-short portfolios in mindbizarre manipulations can be eliminated.
Input Mangling
While many mistakes are caught by the software, it is quite possible to confuse the inputs. One check is to see if the optimizer is using the objective that you intend. You can do this by: > opt.1$objective.utility [1] "mean-variance, risk aversion: 0.02" The name of an argument is inadvertently dropped. For example, you add a starting solution to a command, but forget to name it start.sol so the optimizer thinks it is a dierent argument. Most arguments are checked for validity, but in some cases the checks may not be good enough. You confuse the meaning of start.sol with existing. The start.sol argument should be one or more solutions that are used as starting values for the trade in optimization. The current portfolio should be given in the existing argument. You confuse the meaning of the lower.trade or upper.trade argument. If you habitually abbreviate the lower.trade argument to lower, you may begin to think of the bound as being on the nal portfolio rather than on the trade.
124
Bounds for linear constraints are not properly scaled. Values in lin.bounds need to match the corresponding value in lin.style. If the bounds are in values but the style is in weight, then there probably isnt a real constraint. Alternatively if the bounds are in weights and the style is in value, then the constraint is likely to be impossible. You do not give the lin.abs or lin.trade argument when needed, or reverse the meaning. If the bounds are set for absolute constraints but lin.abs is not given, then the constraints may be inconsistent. Perhaps even worse, they may not be inconsistent and give an answer that on the surface seems okay. You give just one trading cost argument, but it is the wrong argument. If you want costs to be the same whether the position is long or short, and whether you are buying or selling, you need to give the long.buy.cost argument. If you give a dierent argument, then the costs correspond to the named situation only and the costs in other situations are zero. There will be a warning if you do this.
10.2
While warning messages help avoid mistakes, they can be annoying (and counterproductive) if they warn about something that you know you are doing. The do.warn argument of random.portfolio and trade.optimizer can be used to suppress most warnings. Here is an example of its use: do.warn = c(utility.switch=FALSE, value.range=FALSE) Note that abbreviation of the names is allowed as long as the abbreviation is unique among the choices. Some warnings are never suppressedthese are more likely to be mistakes. You should reformulate the command to avoid such warnings whether or not the original is a mistake. The default is TRUE for all entries except converged (which defaults to FALSE since non-convergence of the optimization is seldom of concern). Here are the possible entries for do.warn: benchmark.long.short: A benchmark is used in a long-short portfolio. While this can be correct, it often is not. See Section 11.2. bounds.missing: There are rows missing from the lin.bounds object that should logically be there. The missing bounds are taken to be innite. This is almost certainly caused by something going wrong, or at least sloppy use.
125
converged: (optimization only) Convergence is declared if more than fail.iter consecutive iterations fail to improve the solution. If the maximum number of iterations is performed before this condition occurs, then there is no convergence for the run. When stringency is positive, the convergence is a little more complicated. cost.intercept.nonzero: One or more of the cost arguments have nonzero values for the intercept. (This only applies when polynomial costs are given.) This may well be correct, but it is easy to forget the intercept column in the cost matrix. dist.prices: One or more of the assets that are in prices are not in (a component of) dist.prices and hence are assumed to be zero. dist.style: Since it is easy to make a mistake between giving shares or values to the dist.center argument, a check is made to see if what is stated in dist.style ts into the gross value range that is given or inferred. dist.zero: One or more of the assets in (a component of) dist.center have a non-zero value but a price of zero in dist.prices. extraneous.assets: One or more objects contain data on assets that are not in prices and hence not of use in the computation. ignore.max.weight: One or more assets in the existing portfolio break their maximum weight constraint; and either maximum weights are not forced to be obeyed, or the trade can not be forced to be large enough. See page 49. infinite.bounds: All bounds in lin.bounds are innite, meaning there are no bounds at all. max.weight.restrictive: The vector of maximum weights is very restrictive. That is, the sum of (a subset of) the maximum weights is only slightly more than 1. neg.dest.wt: (optimization only) There is at least one negative destination weight in the utility table. This is almost surely wrong unless the problem is outside typical portfolio optimization. neg.risk.aversion: (optimization only) There is at least one negative risk aversion value. Risk aversion is traditionally thought to be nonnegative, but you could have a special situation. no.asset.names: One or more objects do not have asset names, and hence require that they be in the same order as the assets in prices. Some objects are allowed to not have asset names in order to facilitate quick experimentsasset names are always recommended for production situations. noninteger.forced: There is a forced trade that is constrained such that the trade must be non-integer. This could be something you want to do, but can mean that something is wrong with the specication of constraints.
126
nonzero.penalty: (optimization only) The penalty is non-zero, meaning that not all constraints are satised. notrade: The formulation does not allow any trading. If you really want no trading, then the recommended approach is to set ntrade = 0. novariance.optim: (optimization only) An optimization (other than with a distance) is requested but no variance is given. This can be a reasonable operation for some long-short situations, but is generally suspicious. penalty.size: (optimization only) The risk aversion parameter is large relative to the values in penalty.constraint. In this case the optimizer may have problems getting the constraints to be obeyed. positions.names: The asset names in the positions argument are not the same as the universe of assets in the problem. random.start: (random portfolios only) The start.sol argument is given, which is ignored in random.portfolio. start.sol is sometimes confused with existing (the current portfolio). randport.failure: (random portfolios only) Fewer than the requested number of random portfolios were generated. start.noexist: The start.sol argument is given, but existing is not. This is sometimes what is wanted, but it might indicate that the meaning of start.sol is confused with existing. turnover.max: The maximum turnover is too large to have an eect. If you want to specify a minimum turnover but no maximum turnover, then this warning is avoided by setting the maximum to Inf. utility.switch: (optimization only) The utility (either given or default) is switched to another utility. value.range: One or more of the ranges for monetary value (gross.value etc.) is narrow relative to the prices. variance.list: A compact variance is given, which means that there is no way to check that the assets are the same (and in the same order) as the assets in prices. zero.iterations: The iterations.max argument can be set to zero, but this is not the same as doing no optimization. To get the start.sol trade as the nal trade, set funeval.max to zero or one.
10.3
Cheatsheets
Implied Ranges
Table 10.1 lists the true meaning of a single number for those arguments that represent a range of values. Note that close.number is the outlier in that its single value means exactly that value.
10.4. TROUBLESHOOTING Table 10.1: Implied ranges of argument input gross.value x net.value see Section 4.2 long.value x short.value x turnover x ntrade n port.size n alpha.constraint x var.constraint x bench.constraint x limit.cost (must give the range) close.number n arguments. meaning [x * allowance, x ] [x * allowance, x ] [x * allowance, x ] [0, x ] [0, n] [1, n] [x, Inf) [0, x ] [0, x ] [n, n]
127
Table 10.2: The meaning of threshold inputs. threshold constraint 1 column 2 columns 3 columns trade sell x1 x1 x1 trade buy x1 x2 x2 portfolio short 0 0 NA portfolio long 0 0 x3
4 columns x1 x2 x3 x4
Threshold Inputs
Table 10.2 shows the meaning of the threshold argument when given a matrix with each of the allowable number of columnsa plain vector is the same as a one-column matrix. The subscript of the values in the table indicate the column of the value. So x1 is from the rst column and x2 is from the second column, and so on. A three-column matrix can only be given in the case of a long-only portfolio.
Positions Inputs
Table 10.3 indicates the allowable number of columns for objects given as the positions argument, and the meaning of those columns. See Section 4.7 for more details.
10.4
Troubleshooting
Utility Problems
The utility stays the same when I change the risk aversion. There are at least two possibilities:
128
Table 10.3: The column order for the positions argument. meaning 2 columns 4 columns 8 columns min portfolio value x x x max portfolio value x x x min trade value x x max trade value x x trade sell threshold value x trade buy threshold value x portfolio short threshold value x portfolio long threshold value x
1. The objective is neither mean-variance nor mean-volatilityit could be maximizing the information ratio, minimizing variance or maximizing return. Check the objective.utility component of the output. 2. You are passing in a value for the utable argument. In this case you need to set force.risk.aver to TRUE in order for the risk.aversion argument to override the value that is already in the utility table. Im minimizing the variance, and the utility is exactly zero. The problem is almost certainly that the objective is to maximize the information ratio. You can explicitly set the objective. This does not occur if neither the expected.return nor utable arguments are given. It can happen if you give a vector of expected returns that are all zero.
Portfolio Problems
The optimizer is suggesting trades in which some of the assets trade a trivial amount. Use the threshold argument see Section 4.5. The optimal portfolio has positions that are very small. Use the threshold argument see Section 4.5. Im building a portfolio and the optimizer is having problems getting the monetary value of the portfolio right. One possibility is that the max.weight argument is too small. If there are only a few assets allowed in the portfolio, the sum of allowable maximum weights can be less than 1. The error of max.weight being too small will be caught in simple cases. However, it is possible for the test to be fooled by a combination of restrictions on trading. Another possibility is that the ranges of the money values (gross.value, etc.) are too narrow.
129
The optimizer is not performing a trade that I am insisting upon via the lower.trade or upper.trade argument to trade.optimizer. Use the forced.trade argumentsee Section 4.6. What if I dont have prices? You can do an asset allocation and eectively get weights back. See page 99. Im trying to generate random portfolios with constraints I know work, but it fails. You can set the S language seed to some number and try the call to random.portfolio again. For example: > set.seed(123)
10.5
This section provides a little help with some of the aspects of the S language that you might be most likely to bang your head against. Some hints for the R beginner on the Tutorials page of the Burns Statistics website might also be of help.
Creating Matrices
There are a number of places in this document where we create a matrix. This process generally involves either cbind or rbind. cbind as in column bind and rbind as in row bind. That sounds simple enoughand it is as long as you pay attention to the details: > cbind(A=c(1, 2)) A [1,] 1 [2,] 2 > cbind(A=1, 2) A [1,] 1 2 Both of these forms can be useful, but they give matrices that are oriented in dierent directions.
Debugging
[Burns, 2009] contains a slightly expanded introduction to debugging, and there are a number of additional sources. Here is something to get you started. Suppose we do something and get an error: > update(soft.op1, expected.return=new.alpha) Error in trade.optimizer(prices = priceten, variance = varten: Object "new.alpha" not found
130
Now in this instance it is clear that the problem is the new.alpha that we tried to use. Perhaps we mistyped the name, or we havent created it yet. But the error could be very much more obscureit could be unable to nd an object that weve never heard of, and possibly in a function that we didnt know we were using. You can always do the following to get a sense of where the problem happened: > traceback() 5: trade.optimizer(prices = priceten, variance = varten, ... 4: eval(expr, envir, enclos) 3: eval(call, parent.frame()) 2: update.default(soft.op1, expected.return = new.alpha) 1: update(soft.op1, expected.return = new.alpha) Sometimes just seeing the traceback can be enough to understand what went wrong. Other times, no. For those other cases we can actually go into any of those function calls and see the state at the time of the error. But we can only do that if the ground was prepared before the error happened. Generally it is easy enough to reproduce the error. We want to set the error option to save the contents of the function calls and not just what the calls were: > options(error=dump.frames) > update(soft.op1, expected.return=new.alpha) Error in trade.optimizer(prices = priceten, variance = varten: Object "new.alpha" not found > debugger() Message: Error in trade.optimizer(prices = priceten, : Object "new.alpha" not found Available environments had calls: 1: update(soft.op1, expected.return = new.alpha) 2: update.default(soft.op1, expected.return = new.alpha) 3: eval(call, parent.frame()) 4: eval(expr, envir, enclos) 5: trade.optimizer(prices = priceten, variance = varten Enter an environment number, or 0 to exit Selection: 5 Browsing in the environment with call: trade.optimizer(prices = priceten, variance = varten Called from: debugger.look(ind) Browse[1]> Here we have selected frame 5 (the call to trade.optimizer) and at this point the objects that that function has created are available to us to explore. You can do: Browse[1]> ls() to see what objects are there. When you want to exit, just type c (as in continue): Browse[1]> c
Chapter 11
Special Instructions
This chapter contains instructions for how to overcome specic problems. For example, how to do something that the program perceives as an error. In general, these are longer versions of error or warning messages that you might receive.
11.1
It is an error to declare the portfolio to be long-only when there are short positions in the existing portfolio. If you have short positions currently and you want to create a long-only portfolio, then you should do the following: Declare long.only=FALSE Use positions to set the long-only constraint for the portfolio. For the last item you could alternatively use forced.trade and lower.trade, but positions is the easier approach. The positions argument could be built like: > posarg <- cbind(rep(0, length(prices)), Inf) > dimnames(posarg) <- list(names(prices), NULL) Then used like: positions = posarg
11.2
Naively using a benchmark with a long-short portfolio is probably not what you want to do. Creating a benchmark in Portfolio Probe (and elsewhere) is equivalent to being short the benchmark in the amount of the gross value. If you have a portfolio in the genre of 120/20, then a benchmark is conceptually ne but operationally wrong. If you use the benchmark argument, you 131
132
will be short 140% of the benchmark with a 120/20 portfolio while you want to be short 100%. Whether you have a benchmark in the traditional sense or in the sense that you have a set of liabilities, there are at least two choices: Put the portfolio of liabilities as a short asset in the portfolio and stop it from being traded. Use a variance matrix that is relative to the liabilities. There are two functions in the BurStFin package that may be of use in this situation. The var.add.benchmark function allows you to add a portfolio of assets to a variance matrix. var.relative.benchmark transforms a variance matrix to be relative to one of its assets.
Chapter 12
12.1
A special type of optimization is to just return an object where the solution is the same as the starting value. At rst glance this seems like a useless operation, but in fact there are many reasons to do thishere are a few: See the cost of a solution 133
134 CHAPTER 12. ADJUSTING OPTIMIZATION SPEED AND QUALITY See constraint violations of a solution Get expected returns, variances and so on of random portfolios (this is what randport.eval does) Ensure that another program is doing the same problem Examine the utility of your current portfolio, or of a proposed trade trade.optimizer makes this easy. Give the trade that you want to test as the start.sol argument, and set funeval.max to zero or one. An asset allocation example is: > aa.gs1 <- trade.optimizer(aprice, avar.medium, + aret.medium, gross.value=ten.thou, + long.only=TRUE, utility="mean-variance", + risk.aversion=.05, start.sol=e60b40, funeval=0) > valuation(aa.gs1)$weight equities bonds 0.6 0.4 > aa.gs1$utility.value [1] 2.08 > aa.gs1$alpha.value [1] 6.4 > aa.gs1$var.value [1] 169.6 Note that setting: iterations.max = 0 is not the same. This still allows the pre-iteration and post-iteration operations and the result could be substantially dierent from the starting solution. If the existing portfolio is where you want to stay, then you can use the argument: ntrade = 0
12.2
You may want to reduce the time that an optimization takes because you are doing a large number of optimizations. The only viable way to reduce time is to reduce the iterations.max argument. For a lot of problems 5 iterations is likely to optimize enough that backtests will give the right indication. Zero iterations will almost surely be too few, but even one iteration may not be so terrible as to be useless.
135
12.3
This section briey describes the optimization algorithm so that the suggestions for improving quality in the next section will make sense. The optimization process can be broken into three levels: sub-algorithms iterations runs The algorithm for trade.optimizer is generally described as a genetic algorithm. That is a simplication. Each iteration performs several sub-algorithms some of them genetic (plus simulated annealing), and others can be described as greedy algorithms. A run is a series of iterations. The run ends when too many consecutive iterations fail to nd an improvement. That number of failures is controlled by the fail.iter argument, which has a default of zero. So, by default, any iteration that fails to improve will cause the run to end. If fail.iter is 1, then it takes two consecutive iteration failures to end the run. The default is to perform a single run. Multiple runs are done if the stringency argument is positive. The stringency criterion is satised if the best trade has been found again (in dierent runs) stringency times. For example, suppose stringency is 1. After each run we have the best found so far. If the result of the new run is the same (the same assets traded, the same number of units traded) as the best found previously, then the stringency criterion is met and the process stops. If stringency were 2, then it would take three runs with the same result to end the process. When stringency is positive, there are three types of runs: initial runs nal runs non-convergence run Initial runs perform the optimization as stated. The number of initial runs is determined by the runs.init control argument. If the stringency hasnt been satised in the initial phase, then the nal runs are started. The key dierence is that only assets that were involved in trades in the initial phase are eligible to trade in this phase. The number of nal runs is controlled by runs.final. If the stringency is still not satised at the end of the nal runs, then one more run is done. This run starts with the best solution found in any of the previous runs. The restriction to trading as in the nal runs remains in place. The maximum number of iterations allowed in this run is a multiple of the maximum number of iterations for the other runsthat multiple is given by the nonconverge.mult control argument.
12.4
Improving Quality
The easiestand generally most eectiveapproach to improving quality is to set stringency to 1. You can improve quality by increasing iterations.max and fail.iter. If the converged component of the result is FALSE, then it is probably the case that the best solution has not been found. You can restart the optimization from where it left o: > opt1 <- trade.optimizer(prices, varian, ...) > opt2 <- trade.optimizer(prices, varian, + start.sol=opt1, ...) For more thorough optimization, increase iterations.max and fail.iter:
> opt3 <- trade.optimizer(prices, varian, start.sol=opt1, + iterations=100, fail=10, ...) Another way of doing this same thing would be: > opt3 <- update(opt1, start.sol=opt1,iterations=100, + fail=10) S code note If an object is a list and has a component named call (that is an image of the command that created the object), then the update function will recompute the call and change any arguments that are given in the call to update. The update function can be used on the results of trade.optimizer and random.portfolio. A natural thing to do if you want a quality solution is to increase the maximum number of iterations to something like 100 or 1000. That is not the best approach. For very large problems 100 iterations might be useful, but increasing the number of runs and increasing the number of iterations slightly is usually a better approach.
12.5
The optimization algorithm is random (pseudo-random to be technical). You get a consistent answer for any particular problem because the same random seed is used by default. You can get dierent paths (and hence probably dierent answers) by setting the seed argument to NULL. One way of getting a new optimization is: new.optobj <- update(orig.optobj, seed=NULL) You can create a list of a number of solutions for comparison with commands similar to:
137
opt.list <- vector("list", 10) for(i in 1:10) opt.list[[i]] <- update(orig.opt, seed=NULL) summary(unlist(lapply(opt.list, function(x) x$results["objective"])), digits=7) summary(unlist(lapply(opt.list, function(x) trade.distance(x, orig.opt))), digits=7)
Here weve created an empty list, and lled it up with solutions to the problem. Finally we looked at the distribution of the achieved objective, and the distribution of trade distances from the original solution. S code note Some care needs to be used when doing seed=NULL in a call to update. If seed was used in the original call (to trade.optimizer), then the result will be to use the default value of seed (which is xed) rather than creating a new random seed. The update function creates its own call to the function and then evaluates that call. Saying arg=NULL removes arg from the call if it was in the original call.
Chapter 13
Utility
Utility is ignored in random portfolios (but see Section 3.7), however it is central to optimization. The descriptions of the utilities use a few entities: The weight vector w. The vector of expected returns . The variance matrix V. The transaction costs C(w). In long-only portfolios all of the elements of w are non-negative and they sum to 1. In long-short portfolios the weights may be negative, and it is the absolute values of the weights that sum to 1that is, the weight of an asset is its value in the portfolio divided by the gross value of the portfolio. C (w ) is actually an abuse of notation for the transaction cost. Transaction cost obviously depends on the existing portfolio. Cost is not strictly a function of weight unless the gross value for the portfolio is constant.
13.1
This utility (which is the default if there is both a variance and expected returns) is specied with the argument: utility = "information ratio" or utility = "exocost information ratio" The generic information ratio problem is to maximize: T w wT V w
140
This inherently assumes zero costs. There are two approaches (at least) to incorporating costs. The rst is to subtract the trading cost from the expected value: T w C(w) wT V w This is what you get with: utility = "information ratio" The second approach is to have a separate term for costs: T w wT V w C(w)
You get this second version with: utility = "exocost information ratio"
13.2
Mean-Variance Utility
Mean-variance utility is invoked with: utility = "mean-variance" With mean-variance utility we maximize: T w wT V w C(w) where is the risk aversion parameter and w satises all of the constraints. Many implementations have a one-half in the variance term, meaning that the risk aversion parameter in those cases is a factor of two dierent than here. Also in some optimizers the parameter used divides the variance rather than multiplies itin which case it is a risk tolerance parameter. The mean-variance risk aversion parameter is invariant to annualization. As long as both expected.return and variance are for the same time scale, it doesnt matter what time scale it is. However risk aversion is aected when expected.return and variance are for returns that are put into percent. When percent returns are used, the risk aversion is divided by 100. [Kallberg and Ziemba, 1983] classify risk aversion greater than 3 as very risk averse, 1 to 2 as moderate risk aversion, and less than 1 as riskythese numbers would be divided by 100 for data representing percent returns. Some additional sense of suitable values for the risk aversion parameter may be found in [Burns, 2003b]. All of the above are focused on long-only portfolios. In long-short portfolios the variance is a smaller quantity so it takes a larger risk aversion to have the same eect. In general risk aversion is likely to be larger in long-short situations than for long-only.
141
13.3
Mean-Volatility Utility
Mean-volatility utility is invoked with: utility = "mean-volatility" With mean-volatility utility we maximize: T w wT V w C(w) where is the risk aversion parameter and w satises all of the constraints. The mean-volatility risk aversion does depend on annualization. For example the risk aversion for annual data is 252 times the risk aversion for daily data. But volatility aversion is invariant to whether or not the data are in percent.
13.4
Minimum Variance
Minimum variance utility is specied with: utility = "minimum variance" The minimum variance utility minimizes: wT V w + C(w) over the w that satisfy the constraints. This utility is used when: The utility argument is set to be "minimum variance". The expected.return argument is not given (or is NULL) and distance is not being minimized. The utility is either mean-variance or mean-volatility and risk.aversion is Inf.
13.5
The maximum expected return utility is specied via: utility = "maximum return" With this utility we maximize: T w C(w) where w must satisfy the constraints. This utility is used when: The utility argument is set to be "maximum return".
142
The variance argument is not given (or is NULL) and distance is not being minimized. The utility is mean-variance or mean-volatility and risk.aversion is 0. In the last of these items, the stated utility in the output remains mean-variance or mean-volatility, but a look at the denitions shows that it is really maximum expected return.
13.6
Minimum Distance
This is specied with: utility = "distance" This minimizes the distance from the portfolio to the specied target portfolio plus the trading costs. More details are found in Section 6.4.
13.7
Going Farther
Chapter 14 discusses utilities where multiple variances and/or expected return vectors are given.
Chapter 14
Advanced Features
This chapter will be of use if you have or want: Multiple variances Multiple expected return vectors Multiple benchmarks Compact variances (seldom recommended) If you merely want constraints on multiple benchmarks, that is easy and is discussed on page 72.
14.1
Multiplicity
The computation that is performed is really controlled by three entities: the alpha table, the variance table and the utility table. In the standard case of no multiplicity, these entities can safely be ignored. When you do have multiplicity, then understanding what these objects are saying is a good idea. If you understand them, then you can check to make sure that it is doing what you want done. If it isnt doing what you want automatically, then you can tell it to do what you do want. If you are generating random portfolios but not optimizing, then you can skip parts of this chapter. The recommended route in this case is: Section 14.2 Alpha and Variance Tables Section 14.3 Variance Constraints Section 14.4 Expected Return Constraints Table 14.1 describes arguments to trade.optimizer (and random.portfolio) that are useful when dealing with multiplicity. All of these arguments have defaultsyou only need to give values for these if the default behavior is not what you want. 143
144
Table 14.1: Arguments for multiple variances and expected returns. Argument Description vtable gives the variance-benchmark combinations atable gives the expected return-benchmark combinations utable describes all the combinations to get utilities quantile states which quantile of the multiple objectives to use dest.wt gives weights to the multiple objectives if desired
14.2
The purpose of the alpha table is to state what combinations of expected return vectors and benchmarks will be used in the problem. Likewise the variance table identies variance-benchmark combinations that will be used. An alpha table is a matrix with two rows and an arbitrary number of columns. In simple cases the alpha table will look like: > optobj$atable [,1] alpha 0 benchmark -1 attr(,"benchmarks") [1] "" Or if a benchmark is given, it will look something like: > optobj$atable [,1] alpha 0 benchmark 500 attr(,"benchmarks") [1] "spx" The rst row gives the zero-based index of the expected return. If there is only one set of expected returns, then this is always zero. If there are no expected returns, this is negative one. The second row gives the zero-based index of the corresponding benchmark. A negative number in this row means there is no benchmark. There is a benchmarks attribute which is a character representation of the benchmarks. When you are passing an object in as the atable argument, the second row of the matrix itself is overwritten with the information in the benchmarks attribute (which must be present). Variance tables are essentially the same as alpha tables. The rst row, of course, corresponds to variances rather than alphas. The other change is that there is a third row which states if this combination is only used in the utility (value 1) or if it is used in constraints as well (value 0). This information is used to more eciently generate random portfolios. As with alpha tables when you pass an object in as the vtable argument, the second row is overwritten with the information given in the benchmarks attribute of the object.
145
Here is an example of the alpha table and variance table from an optimization that has two variance matrices, three expected return vectors, and no benchmarks: > opt1$atable [,1] [,2] [,3] alpha 0 1 2 benchmark -1 -1 -1 attr(,"benchmarks") [1] "" "" "" > opt1$vtable [,1] [,2] variance 0 1 benchmark -1 -1 utility.only 1 1 attr(,"benchmarks") [1] "" "" Here are the tables from an optimization with the same variance and expected returns but with two benchmarks: > opt2$atable [,1] [,2] [,3] [,4] [,5] [,6] alpha 0 1 2 0 1 2 benchmark 8 8 8 9 9 9 attr(,"benchmarks") [1] "Index1" "Index1" "Index1" "spx" "spx" "spx" > opt2$vtable [,1] [,2] [,3] [,4] variance 0 1 0 1 benchmark 8 8 9 9 utility.only 1 1 1 1 attr(,"benchmarks") [1] "Index1" "Index1" "spx" "spx" note The values in the alpha table, variance table and utility table are zero-based because these entities are sucked directly into C where indexing is zero-based rather than one-based as in the S language.
14.3
Variance Constraints
This section discusses the argument: var.constraint Simple use of this argument is discussed on page 145. This argument also handles problems that are too complex for the bench.constraint argument. The var.constraint argument can be:
146
a plain vector (with one or more values) with or without names a onecolumn matrix with or without row names a two-column matrix with or without row names Plain vectors are equivalent to one-column matrices. One-column matrices are equivalent to two-column matrices where the rst column is all zeros. The names on vectors or row names of matrices specify the zero-based index of the column of the variance table. When there are no names, then the rst column(s) are implied (in order). So all six of the following are specifying the same constraints: var.constraint var.constraint var.constraint var.constraint var.constraint var.constraint where: > vc.vec [1] 2.4 1.5 > vc.1mat [,1] [1,] 2.4 [2,] 1.5 > vc.2mat [,1] [,2] [1,] 0 2.4 [2,] 0 1.5 > vc.vec.nam 0 1 2.4 1.5 > vc.1mat.nam [,1] 0 2.4 1 1.5 > vc.2mat.nam [,1] [,2] 0 0 2.4 1 0 1.5 All of these are saying the variance-benchmark combination in the rst column of the variance table has an upper bound of 2.4 and the variance-benchmark combination in the second vtable column is to be no more than 1.5. If you were to give the equivalent of vc.vec.nam directly, you would do: var.constraint = c(0=2.4, 1=1.5) A very similar command means something quite dierent: = = = = = = vc.vec vc.1mat vc.2mat vc.vec.nam vc.1mat.nam vc.2mat.nam
147
means that the (rst) variance is constrained to be at most 1.5 but no less than 2.4. (In this case you will get an error, but you can not depend on getting an error when you confuse the direction of matrices.) Two-column matrices give lower bounds as well as upper bounds on the variance. Lower bounds are most likely to be useful in generating random portfolios, but could potentially be of interest in optimization as well. Consider the matrix: > varcon.m1 <- rbind(c(1.2, 1.4), c(1.3, 1.5)) > varcon.m1 [,1] [,2] [1,] 1.2 1.4 [2,] 1.3 1.5 Suppose that there are three columns in vtable (perhaps the variance argument is a three-dimensional array with 3 slices in the third dimension). Then the command: var.constraint = varcon.m1 would say that the rst variance-benchmark combination is restricted to be between 1.2 and 1.4, the second variance-benchmark is restricted to be between 1.3 and 1.5, and the third is unrestricted. Now lets add some row names. > varcon.m2 <- varcon.m1 > dimnames(varcon.m2) <- list(c(2, 0), NULL) > varcon.m2 [,1] [,2] 2 1.2 1.4 0 1.3 1.5 The command: var.constraint = varcon.m2 says that the third variance-benchmark is to be between 1.2 and 1.4, the rst variance-benchmark is restricted to be between 1.3 and 1.5, and the second variance-benchmark is unconstrained. Some examples with multiple variances can be found on page 153.
14.4
148
Simple use of this argument is discussed on page 69. Use of this argument is analogous to the advanced use of var.constraint (page 145) except that a one-column matrix is a lower bound rather than an upper bound, and indices refer to expected return-benchmark combinations (columns of atable) rather than to variance-benchmark combinations. So for instance: alpha.constraint = c("1"=5.6, "0"=2.3) is equivalent to alpha.constraint = ac.2mat.nam or alpha.constraint = ac.2mat where: > ac.2mat.nam [,1] [,2] 1 5.6 Inf 0 2.3 Inf > ac.2mat [,1] [,2] [1,] 2.3 Inf [2,] 5.6 Inf
14.5
Multiple Utilities
This section does not pertain to generating random portfolios (unless a utility constraint is imposed). When there are multiple columns in the alpha or variance tables, there are usually multiple utilities for any given trade. In order to do an optimization, we need to combine that set of numbers into just one number. A common approach is to get a min-max solution. This process consists of always picking out the worst answer for each tradewe then try to get the best of these worst values. Another approach would be to take the median of the values for a trade. This represents an optimistic attitude, while min-max is very pessimisticminmax assumes that nature will do its worst against us. The quantile argument allows us to choose either of these, or something in between. The useful range of values for quantile is 0.5 (the median) to 1 (the maximumyielding the min-max solution). Each of the individual utilities for a trade is called a destination. What goes into a destination is under user control. For example, if some destinations are deemed to be more important than others, they can be given more weight with the dest.wt argument.
149
There are two sorts of destination weight. Each utility has a weight within its destination. The within destination weights can be used to create a weighted average of utilities within a single destination; they can also be used to modify the utility as demonstrated in the dual benchmark example which starts on page 150. The weights in the dest.wt argument, on the other hand, are weights among the destinations which are used along with the quantile argument to control the selection of the objective from among the destinations. For example in scenario analysis (see page 156) dest.wt could be the probability assigned to each scenario. note The answer when dest.wt is not given is, in general, slightly dierent than when dest.wt is given and all of the weights are equal. The actual problem that is done is mostly specied by the utility table.
14.6
Utility Tables
The utility table is a matrix that gives the combinations of variances, expected returns and benchmarks that are to be used and what to do with them. Here is the utility table from our example of two variances and three expected returns: > opt1$utable alpha.spot variance.spot destination opt.utility risk.aversion wt.in.destination [,1] [,2] [,3] [,4] [,5] [,6] 0 1 2 0 1 2 0 0 0 1 1 1 0 1 2 3 4 5 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
The rst row is the expected returnthis is the zero-based index of the columns of atable. That is, it indicates a particular expected return-benchmark combination. (But note that this is the zero-based location of the distance for columns corresponding to distance utilities.) The second row is similar to the rst except that it is columns of the vtable. The third row is the zero-based index of the destination for the utility with this combination of alpha and variance. The utility for the combination is computed, multiplied by its weight in the destination (the sixth row of the utility table) and added to whatever is there. That is, a destination may hold a weighted average of utilities. If a destination is a negative number, the utility is not placed into a destinationthese are combinations that are presumably to be used in a constraint. There must be at least one zero in the destination row, and the destinations must be numbered from zero upwards with no integers skipped. The fourth row is an indicator for the type of utility to be performed. The allowable codes are given in Table 14.2.
150
CHAPTER 14. ADVANCED FEATURES Table 14.2: Utilities codes for the utility table. Code 0 1 2 3 4 5 6 Meaning mean-variance information ratio exocost information ratio minimum variance maximum return mean-volatility distance
The fth row holds the risk aversion parameter for the cases where it is usedthe value is ignored in the other cases. When the utable argument is given, then the risk aversion in the utility table is used rather than using the value of the risk.aversion argument unless force.risk.aver is set to TRUE. The example utility table above species that six utilities are to be computed and each put into a dierent destination. Each combination of expected return and variance is used, and the information ratio is the utility in all cases.
14.7
Multiplicity Examples
This section provides a few examples of using multiple variances, expected returns and benchmarks.
Dual Benchmarks
Dual benchmarks can be used, for example, to incorporate predictions about a future rebalancing of the benchmark. Due to the relative movements that have occurred since the last rebalance, we may have predictions of what assets will enter and exit the benchmark and with what weight. The portfolio can be optimized against both benchmarksthis allows the portfolio to be rebalanced to some extent against the new benchmark while still having protection relative to the current one. This can mean a cheaper rebalance if it is done before others do their rebalancing. If our benchmark is spx and our prediction of the post-rebalance benchmark is newspx, then minimizing variance relative to the two benchmarks can be done as: > opts1 <- trade.optimizer(prices, varian, + long.only=TRUE, gross.value=1e6, existing=cur.port, + utility = "minimum variance", + benchmark=c("spx", "newspx"), quantile=1) Often a min-max solution is found for dual benchmarks. To get such a solution, set quantile to 1. The min-max solution says that we want to minimize the worst tracking error. It is typical (though not always true) that the utility in the optimal solution is the same for both benchmarks:
14.7. MULTIPLICITY EXAMPLES > sqrt(252 * opts1$var.values) [1] 0.009029673 0.009029442 > opts1$utility.values [1] 3.235515e-07 3.235350e-07
151
In this example the tracking errors are the same at 90 basis points, and the utilities are equal as well. A reasonable complaint about this optimization is that it treats both benchmarks equally even though the new benchmark is speculative while the current one is known precisely. It is probably more reasonable to insist on a smaller tracking error against the current benchmark. A dierence in the tracking errors is easily achieved with the addition of another argument. The utility table needs to be changed slightly from its default. So we recover the utility table from the original optimization, make the change, then do a new optimization: > utab1 <- opts1$utable > utab1 [,1] [,2] alpha.spot 0 0 variance.spot 0 1 destination 0 1 opt.utility 0 0 risk.aversion 1 1 wt.in.destination 1 1 > utab1[6,1] <- 1.4 > utab1 [,1] [,2] alpha.spot 0.0 0 variance.spot 0.0 1 destination 0.0 1 opt.utility 0.0 0 risk.aversion 1.0 1 wt.in.destination 1.4 1 All that we have done is change the weight-in-destination value for the rst (current) benchmark from 1 to 1.4. Now the optimization yields: > opts2 <- trade.optimizer(prices, varian, + long.only=TRUE, gross.value=1e6, existing=cur.port, + utility = "minimum variance", + benchmark=c("spx", "newspx"), quantile=1, + utable=utab1) > sqrt(252 * opts2$var.val) [1] 0.008227913 0.009735447 > opts2$utility.val [1] 3.761031e-07 3.761069e-07 The utility values are still equal, but now the utility represents something different for the current benchmark than for the new benchmark, so the tracking
152
errors are dierentnow they are 82 basis points for the current benchmark and 97 for the new benchmark. There is not a limit on the number of benchmarks that you can usejust add the names to the vector given as the benchmark argument. If you have an active portfolio, you may want to put on constraints relative to each benchmark. This is done precisely the same as with a single benchmark constraint. Adding a second benchmark constraint is just like adding any other constraintit does not create multiple utilities. An example is given on page 95.
153
The utility table we created says to maximize the information ratio. We need to put a name on the object given as the variance constraint because we want to constrain the variance-benchmark combination given in the second column of the variance table (not the rst column).
> varmult <- array(c(vstat[vnam, vnam], vfund[vnam, vnam], + vmacro[vnam, vnam]), c(length(vnam), length(vnam), 3), + list(vnam, vnam, NULL)) S code note A three-dimensional array is a generalization of a matrixinstead of a dim attribute that has length 2, it has a length 3 dim. Likewise, the dimnames is a list that has three components. We need each of the variance matrices to be in a slice of the third dimension. In the example above, we are ensuring that the matrices we are pasting together all correspond to the same assets in the same order by subscripting with vnam. You need to decide which quantile to optimize. The best quantile to use probably depends on how the variance matrices relate to each other and on the type of optimization that you are performing. Once this decision is made, the optimization is performed as with a single variance: > opts.m1 <- trade.optimizer(prices, varmult, + long.only=TRUE, gross.value=1e6, expected.return=alphas, + bench.constrain = c(spx=.05^2/252), quantile=.7) This command is maximizing the information ratio with a tracking error constraint as is done on page 94. In this case the tracking error is constrained relative to each of the variance matrices. The information ratio that is being optimized is the 70% quantile of the set of information ratios using each of the variances.
154
If you are doing a variance constraint, for instance maximizing the information ratio in a long-short portfolio as on page 103, then the approach is slightly dierent. You need to make the variance constraints argument the proper length yourself: > opts.m2 <- trade.optimizer(prices, varian, + expected.return=alphas, gross.value=1e6, + net.val=c(-1e4, 2e4), ntrade=55, max.weight=.05, + var.constraint=rep(.04^2/252, 3)) If the value given as the variance constraint werent replicated, then only the rst variance would be constrained. The var.constraint argument is the more general. Benchmark constraints actually create variance constraintsthe bench.constraint argument only exists for convenience. (When more than one variance is given, bench.constraint puts the constraint on each of the variances.) More control can be gained over variance constraints by putting names on the vector given. The names need to be the zero-based index of the columns of the variance table (see page 145). For example to constrain the rst variance to a volatility of 4% and the third to a volatility of 5%, the command would be: > opts.m3 <- trade.optimizer(prices, varian, + expected.ret=alphas, gross.val=1e6, + net.val=c(-1e4, 2e4), ntrade=55, max.weight=.05, + var.constraint=c("0"=.04^2/252, "2"=.05^2/252)) S code note The names given inside the c function need to be put in quotes because they are not proper names of S objects. S would try to interpret them as numbers if they were not in quotes. If any element of the var.constraint argument is named, then they all need to be. When there are no names, the columns of the variance table are used in order. A standard use of multiple variances is to optimize with one, and check the tracking error or volatility with the other. One good reason to evaluate the solution on a variance not used in the optimization is because of the bias created in the optimization processthe tracking error or volatility that is achieved in the optimization is a downwardly biased estimate of what will be realized. Using multiple variances in the optimization will reduce the amount of bias in any particular variance matrix, though some bias will still remain.
155
Credit Risk
The variance matrices that we have talked about have all been measures of market risk. There are other sources of riskcredit risk, for examplethat could be included in an optimization. For more on credit risk, see for example [Gupton et al., 1997]. While well speak of credit risk here, keep in mind that other forms of risk could be used instead or in addition. Suppose that we have var.market and var.credit that represent the two types of risk. While it may be possible to scale the credit risk so that it is comparable to the market risk, it may be easier to accept that they are dierent. If the two forms of risk are not comparable, then you can perform the optimization with the market risk as you normally would and put a constraint on the credit risk. When deciding how to constrain the credit risk, the rst thing to do is explore its limits. One end of the range is when credit risk is ignored entirely in the optimization, the other end is when the credit risk is minimized with no concern for the market risk. > op.cred0 <- trade.optimizer(prices, var.market, ...) > trade.optimizer(prices, var.credit, ..., + funeval=0, start.sol=op.cred0)$var.value [1] 26.02844 > trade.optimizer(prices, var.credit, + utility="minimum variance", ...)$var.value [1] 11.92467 So the range of interest is about 12 to 26. If we decided to constrain the credit risk to 18, then we would do: > var.mult <- array(c(var.market[anam, anam], + var.credit[anam, anam]), c(length(anam), + length(anam), 2), list(anam, anam, NULL)) > op.cred1 <- trade.optimizer(prices, var.mult, ..., + var.constraint=c("1"=18)) The rst command creates the three dimensional array containing the two types of risk (taking care to make sure the data line up properly). The second command does the optimization. The variance constraint is a named vector where the name is the zero-based index to the third dimension of the variance array. In this case the name is 1 because we are constraining the second variance. But op.cred1 is not the optimization that we are aiming for. The credit risk is used in the utility as well as being constrained. We need to tell the optimizer not to use credit risk in the utility by passing in a utable argument. We can make a simple change to the utility table from the optimization weve just done. > op.cred1$utable [,1] [,2] alpha.spot 0 0 variance.spot 0 1 destination 0 1 opt.objective 1 1
156
CHAPTER 14. ADVANCED FEATURES risk.aversion 1 1 wt.in.destination 1 1 > uticred <- op.cred1$utable > uticred[3,2] <- -1 > uticred [,1] [,2] alpha.spot 0 0 variance.spot 0 1 destination 0 -1 opt.objective 1 1 risk.aversion 1 1 wt.in.destination 1 1 > > op.cred2 <- trade.optimizer(prices, var.mult, ..., + var.constraint=c("1"=18), utable=uticred)
The change we made was to say that the column with credit risk in it shouldnt go into a destination. One indication that we are doing it right is that the utility.values component of op.cred2 has length one, while it is length two for op.cred1. Since we only want one variance in the utility, the utility that we want will have length one.
Multiple Scenarios
Scenario analysis creates some hypotheses about what might happen in the future, and then tries to pick the best course of action. The assumption is that the future will look like one of the scenarios, or perhaps some combination of scenarios. Our rst step is to create some data. We will create a medium scenario, a crash scenario and a prosperous scenario. (The probability that these numbers are reasonable is close to zero.)
> anam4 <- c("equities", "bonds", "commodities", "cash") > acor.medium <- matrix(c(1, .2, .1, .2, 1, .1, .1, .1, 1), + 3, 3) > acor.medium equities bonds commodities equities 1.0 0.2 0.1 bonds 0.2 1.0 0.1 commodities 0.1 0.1 1.0 > acor.crash <- matrix(c(1, -.3, .3, -.3, 1, -.2, .3, + -.2, 1), 3, 3) > acor.crash [,1] [,2] [,3] [1,] 1.0 -0.3 0.3 [2,] -0.3 1.0 -0.2 [3,] 0.3 -0.2 1.0 > acor.prosper <- matrix(c(1, .4, .3, .4, 1, .2, .3, + .2, 1), 3, 3)
157
> acor.prosper [,1] [,2] [,3] [1,] 1.0 0.4 0.3 [2,] 0.4 1.0 0.2 [3,] 0.3 0.2 1.0 > avol.medium <- c(20, 8, 12) > avol.crash <- c(35, 16, 20) > avol.prosper <- c(15, 5, 7) > avar.medium <- rbind(cbind(t(acor.medium * avol.medium) * + avol.medium, 0), 0) > avar.crash <- rbind(cbind(t(acor.crash * avol.crash) * + avol.crash, 0), 0) S code note The last command creating avar.crash does several things at once. First (in the center of the command) it multiplies the correlation matrix times the volatility vector which multiplies each row of the correlation matrix by the corresponding volatility. Then it transposes the matrix and multiplies again by the volatility vectorthis is now multiplying what originally were the columns of the correlation matrix by their corresponding volatilities. Then it binds on a column of all zeros (for cash), and nally binds on a row of all zeros. Below you can see the result of all this manipulation once the asset names are put onto the variance matrix. avar.prosper <- rbind(cbind(t(acor.prosper * avol.prosper) * avol.prosper, 0), 0) dimnames(avar.medium) <- list(anam4, anam4) dimnames(avar.crash) <- list(anam4, anam4) dimnames(avar.prosper) <- list(anam4, anam4) avar.medium equities bonds commodities cash equities 400 32.0 24.0 0 bonds 32 64.0 9.6 0 commodities 24 9.6 144.0 0 cash 0 0.0 0.0 0 > avar.crash equities bonds commodities cash equities 1225 -168 210 0 bonds -168 256 -64 0 commodities 210 -64 400 0 cash 0 0 0 0 > avar.prosper equities bonds commodities cash equities 225.0 30 31.5 0 bonds 30.0 25 7.0 0 commodities 31.5 7 49.0 0 cash 0.0 0 0.0 0 > aret.medium <- c(8, 4, 5, 3) > + > > > >
CHAPTER 14. ADVANCED FEATURES aret.crash <- c(-20, 6, -9, 3) aret.prosper <- c(25, 3.5, 8, 3) names(aret.medium) <- anam4 names(aret.crash) <- anam4 names(aret.prosper) <- anam4 aret.crash equities bonds commodities -20 6 -9 > aret.prosper equities bonds commodities 25.0 3.5 8.0
Since we are doing asset allocation, there are no prices for the assets and we want the answer to be the weights of the assets. Hence we want to create a price vector that is all ones and set the gross value to a useful value. > aprice <- rep(1, 4) > names(aprice) <- anam4 > aprice equities bonds commodities cash 1 1 1 1 > ten.thou <- 10000 + c(-.5, .5) > op.medium <- trade.optimizer(aprice, avar.medium, + aret.medium, gross.value=ten.thou, long.only=TRUE, + utility="mean-variance", risk.aversion=.02) > op.medium$new.portfolio / 100 equities bonds commodities cash 27.86 20.83 28.69 22.62 Above we have performed a mean-variance optimization with the medium data, and then shown the weights of the resulting portfolio in percent. (The variance matrix is not positive denite but the optimizer doesnt carenote, though, that no more than one zero variance should be in a single variance matrix.) In order to perform simultaneous optimization, we need to put the three variance matrices into a three-dimensional array, and the expected return vectors into a matrix: > avar.all <- array(c(avar.crash, avar.medium, avar.prosper), + c(4, 4, 3)) > dimnames(avar.all) <- list(anam4, anam4, NULL) S code note A three-dimensional array is a generalization of a matrixinstead of a dim attribute that has length 2, it has a length 3 dim. Likewise, the dimnames is a list that has three components. We need each of the variance matrices to be in a slice of the third dimension. (We also need the order of assets within each matrix to be the same, but that is already true.)
159
> aret.all <- cbind(crash=aret.crash, medium=aret.medium, + prosper=aret.prosper) > aret.all crash medium prosper equities -20 8 25.0 bonds 6 4 3.5 commodities -9 5 8.0 cash 3 3 3.0 Min-Max Solution The most common approach to simultaneous optimization is to maximize the minimum utilityalso known as Pareto optimality. To clarify: each allocation will produce a utility for each scenario, the optimizer only pays attention to the worst utility for an allocation (with no regard to which scenario produces that utility); it then nds the allocation that does best with respect to the worst utility. The aim, then, is to never do really badly. Before we do the actual optimization, we need to do some ground work. > op.0 <- trade.optimizer(aprice, avar.all, aret.all, + gross.value=ten.thou, long.only=TRUE, + utility="mean-variance", iterations=0) # (warning message omitted) > utab3 <- op.0$utable > utab3 [,1] [,2] [,3] [,4] [,5] [,6] [,7] [,8] [,9] alpha.spot 0 1 2 0 1 2 0 1 2 variance.spot 0 0 0 1 1 1 2 2 2 destination 0 1 2 3 4 5 6 7 8 opt.utility 0 0 0 0 0 0 0 0 0 risk.aversion 1 1 1 1 1 1 1 1 1 wt.in.destination 1 1 1 1 1 1 1 1 1 The default behavior is to get all of the combinations of expected returns and variancesin this case we have three of each so there are 9 combinations. For our scenario analysis, though, we want only the combinations where the expected returns and the variances matchthree combinations. In order to get the behavior that we want, we need to provide a suitable matrix as the utable argument. We could build a matrix from scratch, but it is slightly easier to start with the one that is wrong. There are three changes that we need to make to this matrix. We need to select the three columns that we want, we need to reset the destinations so that they are numbered from zero through one less than the number of combinations, and we should specify the risk aversion parameter that we want. > > > > utab3 <- utab3[, c(1, 5, 9)] utab3["destination", ] <- 0:2 utab3["risk.aversion", ] <- 0.02 utab3
CHAPTER 14. ADVANCED FEATURES [,2] 1.00 1.00 1.00 0.00 0.02 1.00 [,3] 2.00 2.00 2.00 0.00 0.02 1.00
At this point we are ready to do the actual optimization. In addition to needing the utable argument, the quantile argument needs to be 1 in order to get a min-max solution: > op.minmax <- trade.optimizer(aprice, avar.all, aret.all, + gross.value=ten.thou, long.only=TRUE, + utility="mean-variance", utable=utab3, quantile=1) > op.minmax$new.portfolio / 100 equities bonds cash 0.91 34.82 64.27 A look at the utility values shows typical behavior of a min-max solutionthe solution has the same utility for two of the scenarios. General Simultaneous Optimization Min-max optimization protects against the very worst outcome. However, it may really be buying too much insurance. We can change the quantile argument to a number that is less than 1 (but only numbers that are at least one-half are sensible): > op.q7 <- trade.optimizer(aprice, avar.all, aret.all, + gross.value=ten.thou, long.only=TRUE, + utility="mean-variance", utable=utab3, + quantile=.7) > op.q7$new.portfolio / 100 equities bonds cash 0.76 32.70 66.54 In this case (of only three utility values) the quantity it is minimizing is a weighted average of the two worst utilities. For this example there is very little change from the min-max solution (and it seems to go more conservative rather than less conservative). We can adjust the risk aversion but when utable is given, we need to use the force.risk.aver control argument. By default the risk aversion in the utility table that is passed in is taken as what is wanted. > op.q7r1 <- trade.optimizer(aprice, avar.all, aret.all, + gross.value=ten.thou, long.only=TRUE, + utility="mean-variance", utable=utab3, quantile=.7, + risk.aversion=.01, force.risk.aver=TRUE) > op.q7r1$new.portfolio / 100 equities bonds cash 1.52 65.41 33.07
161
14.8
Usually the full variance matrix is given. In the case of multiple variances, a three-dimensional array is given where the third dimension represents the dierent variance matrices. Computationally, full matrices are the fastest, so the preferred format is full matrices as long as the memory of the machine on which the optimization is being done is large enough. This will almost certainly be the case unless the universe of assets is very large, or there is a large number of variance matrices. caution Unlike when a full matrix is given, there is no mechanism when giving compact variances for the optimizer to ensure that the assets are in the correct order within the variance. The user needs to take full responsibility that the asset order in the variances is the same as in the prices argument. This includes benchmarks, which enjoy some automatic treatment when full matrices are given. In addition to full matrices, there are three formats supported: simple factor models. These have formula T + where is a matrix of loadings with dimensions that are the number of assets by the number of factors, and is a diagonal matrix containing the specic variances. This will most likely be the result of a statistical factor model. full factor models. These have formula T + where is a matrix of loadings with dimensions that are the number of assets by the number of factors, is the variance matrix of the factors among themselves, and is a diagonal matrix containing the specic variances. vech. This is the lower triangle of the variance matrix stacked by column. So there is the variance of the rst asset and the covariances of the rst asset with all of the other assets, then the variance of the second asset and the covariances of the second asset with assets 3 and onward, etc.
162
vartype: Always required. This is a vector of integers with length equal to the number of variance matrices represented. The integers indicate the type of representation for each matrixthere is no restriction on the combinations of types that may be used. The codes for the formats are: full matrix : 0 simple factor model : 1 full factor model : 2 vech: 3 nvarfactors: Required if any representation is a factor model (either simple or full). When given, this must have length equal to the length of vartype. The elements of this vector that correspond to a factor model must contain the number of factors for that model. varoffset: Never required, but may add some safety. This is an integer vector containing the oset for the start of each variance matrix within the vardata component. The rst element is zero, the second is the length of the data for the rst variance matrix, the third is the combined lengths of the rst and second variance matrices, etc. The length of this vector is the number of matrices given, so the element that would be the total length of vardata is not given. Here is an example of putting an object produced by factor.model.stat(from the BurStFin package) into this format: > varfac <- factor.model.stat(retmat, out="fact") > vdata <- c(t(varfac$sdev * varfac$loadings), + varfac$uniqueness * varfac$sdev^2) > vlist <- list(vardata=vdata, vartype=1, + nvarfactors=ncol(varfac$loadings)) The vdata object is a vector of the pertinent numbers in the correct order. Note that the loadings are rst multiplied by the standard deviations, then this matrix is transposed. Likewise the uniquenesses need to be scaled by the squared standard deviations. Since this is producing a simple factor model, the type of variance is 1. Finally, the number of factors is given as the number of columns of the loadings matrix. The vlist object is ready to be passed in as the variance argument. caution Be careful when annualizing a factor model representation. You want each element of the actual variance to be multiplied by some numbersuch as 252 but not every number in the representation is multiplied by the same thing.
Chapter 15
Dregs
This chapter contains topics that dont seem to belong anywhere else.
15.1
The pprobe.verify function is useful for seeing if the installation on a particular machine worked okay. It is far from a full test suiteit merely checks that all of the functions are present, and that one particular problem optimizes okay. This can also be used to see which version of Portfolio Probe is present.
15.2
The Objectives
The objective function of an optimization has a numeric value as its result, and a set of inputs that can be changed. The job of the optimizer is to nd the combination of inputs that achieves the best result for the objective function. By convention optimizers nd the minimum of objective functions. The optimizer in trade.optimizer follows the convention. The objective in trade.optimizer is the sum of two quantities: the negative of the utility (except minimum variance and minimum distance dont take the negative), and the penalty for broken constraints.. The penalty for a broken constraint is the appropriate element of penalty.constraint times a measure of how broken the constraint is. The penalties for all of the constraints are summed to get the penalty. The same process is done for generating random portfolios. The dierence is that the utility part is zero when random.portfolio is usedso there is only the penalty part that is minimized.
15.3
The C code that underlies the optimization and random portfolio generation can be incorporated into your own C or C++ programs. There are quite a few arguments to the C functions for optimization, and some of the arguments involve a number of variables. Thus it is far from trivial to write the program calling these functions. 163
164
Speed of execution is not a good reason to go completely to Cthe execution of the S code takes a fraction of a second and so is negligible for practical problems. Perhaps the most likely reasonable reason to stay all in C is to do real-time optimization within a C or C++ environment. However, doing realtime optimization in S is quite easy as shown on page 106. If you are determined, there is a function in the package that can help you write C code. The Cfrag.list function will write a le that has the C initialization of variables given in an S language list. The list that you would want to do this for is an intermediate result of an optimization. For example: > op.clist <- trade.optimizer(prices, varian, ..., + intermediate="Clist") > Cfrag.list(op.clist[-1:-3], file="test.c") The test.c le now holds a fragment of C code initializing most of the variables that are passed into the C function that does the optimization. This can be of use in two wayseither to initialize some variables that will always be the same (or at least mostly the same in your application), or to test your implementation as you are writing it. To create a C program, you essentially need to recreate the trade.optimizer function. The S language code for trade.optimizer is the documentation for how to write the C code.
15.4
Bug Reporting
Send email to [email protected] to report any bugs that you nd. A report should include: The operating system (including the version) that you are using. The version of R or S-PLUS that you are using. Give the version component (optimization) or attribute (random portfolios). For example: > op$version C.code S.code "portgen BurSt 1.17" "trade.optimizer 023" > attr(randport, "version") C.code S.code "randport BurSt 1.17" "random.portfolio 010" A full description of the problemwhat happens, and what you wanted to happen. If S creates an error, then give the results of the call: > traceback() This displays the state that S was in when the error occurred.
165
If possible, send the data for a small problem that reproduces the bug. If the bug is sporadic and uses random.portfolio, then also give the random seed from an object where the bug is observed: > attr(ranport, "seed") A reward is given to the rst to report a bug in the code or documentation. Ideas for improving the ease of use or functionality are always welcome.
166
Bibliography
[Almgren et al., 2005] Almgren, R., Thum, C., Hauptmann, E., and Li, H. (2005). Equity market impact. Risk. [Burns, 1998] Burns, P. (1998). S Poetry. http://www.burns-stat.com. [Burns, 2003a] Burns, P. (2003a). On using statistical factor models in optimizing long-only portfolios. Working paper, Burns Statistics, http://www.burnsstat.com/. [Burns, 2003b] Burns, P. (2003b). Portfolio sharpening. Working paper, Burns Statistics, http://www.burns-stat.com/. [Burns, 2009] Burns, P. (2009). The R Inferno. Tutorial, Burns Statistics, http://www.burns-stat.com/. [Burns et al., 1998] Burns, P., Engle, R., and Mezrich, J. (1998). Correlations and volatilities of asynchronous data. The Journal of Derivatives, 5(4). [diBartolomeo, 2003] diBartolomeo, D. (2003). Portfolio management under taxes. In Satchell, S. and Scowcroft, A., editors, Advances in Portfolio Construction and Implementation. ButterworthHeinemann. [Grinold and Kahn, 2000] Grinold, R. C. and Kahn, R. N. (2000). Active Portfolio Management. McGrawHill. [Gupton et al., 1997] Gupton, G. M., Finger, C. C., and Bhatia, M. (1997). CreditMetrics TM Technical Document. http://www.riskmetrics.com. [Kallberg and Ziemba, 1983] Kallberg, J. G. and Ziemba, W. T. (1983). Comparison of alternative utility functions in portfolio selection problems. Management Science, 29:12571276. [Michaud, 1998] Michaud, R. O. (1998). Ecient Asset Management. Harvard Business School Press. [Ulrich, 2009] Ulrich, J. (2009). TTR: Technical Trading Rules. http://CRAN.R-project.org/package=TTR. R package version 0.20-1.
167
Index
120/20 portfolio, 24, 48 abbreviation of function arguments, 34 abline function, 90 allowance argument, 47, 48 alpha table, 144145 alpha.constraint argument, 70 alphas, see expected returns amortize cost, 119 annualization information ratio, 94 risk aversion, 140, 141 argument abbreviation, 34 array three-dimensional, 153, 158, 161 as.matrix function, 109, 110 asset allocation, 99 assets used in optimization, 91, 101 asynchronous data, 111, 123 atable, 149 atable argument, 144 backtest, 29 bench.constraint argument, 7172, 94, 95, 153 benchmark add to variance, 112 alpha table, 144145 constraint, 9495, 122, 153 dual, 95, 150152 variance table, 144145 benchmark for long-short, 131132 benchmark-relative utility, 152 bug report, 164 build.constraints function, 58 BurStFin package, 110 buy-hold-sell list, 9697, 103 c function, 74, 76, 94 C or C++ code, 163164 168 call component, 136 cash ow, 9899, 105106 CAUTION, 39, 49, 51, 111, 113, 116 118, 161, 162 cbind function, 61, 129 Cfrag.list function, 164 cheatsheet, 126127 class function, 41 close.number argument, 75 comma-separated le, 40, 109, 114 compact variance representation, 161 162 compare to another program, 134 constraint absolute linear, 6364 all, 45 benchmark, 9495, 122, 153 building linear, 5863 close number, 75 cost, 75 count, 6769 country, 34 distance, 7274 expected return, 69, 147148 linear, 5767 liquidity, 5051, 55 long-side linear, 64 market capitalization, 62 monetary, 4548 numerical linear, 61 penalty for breaking, 7879, 113, 163 portfolio size, 93, 102 portfolio value, 106 quadratic, 7678, 120 sector, 34 short-side linear, 64 soft, 7879 sum of largest weights, 7475 threshold, 5253, 56
INDEX tracking error, 34, 7172, 94 95, 153 trade, 6364 trade number, 93 turnover, 93, 95, 102 unsatisable, 4142 utility, 42 variance, 70, 103, 145147, 154 violation, 6566 weight, 4975 constraint bounds, testing, 2122, 8486 constraints.realized function, 65 continuously compounded return, 111 conventions, typography, 17 Convergence, 125 costs, 115 amortization, 119 asymmetric, 115 constraint, 75 linear, 116 nonlinear, 116117 piecewise linear, 120 polynomial, 117118 power law, 118119 relative to expected returns, 119 120 relative to noise, 136 square root, 118119 taxes, 118, 120 count constraint, 6769 country constraint, 34 cov.wt function, 110 credit risk, 155156 csv le, 40, 109, 114 currency, 121 data frame, 109 debugger function, 130 debugging S, 129130 decay, time, 120 deport function, 40 deport functionexport function, 114 dest.wt argument, 148, 149 destination of utility, 149 weight, 148, 149 distance constraint, 7274 optimization, 9798
169 do.call function, 39 do.warn argument, 117, 124126 dollar neutral portfolio, 24, 48 doubleconst argument, 118 drop function, 109, 110 dual benchmarks, 95, 150152 enforce.max.weight argument, 49 error produced by S, 164 examples, 9299, 102106, 156160 existing argument, 123 expected returns, 93 multiple, 148, 150160 relative to costs, 119120 factor model, 153 full, 161 fundamental, 153 macroeconomic, 153 simple, 161, 162 statistical, 110112, 161 factor vs factor, 111 factor.model.stat function, 110, 162 fail.iter argument, 42, 136 feasible solution, 65 le read, 109 write, 40, 114 tted function, 112 force.risk.aver argument, 150 forced trade automatic, 49 positions argument, 56 forced.trade argument, 54, 104 funeval.max argument, 134 garbage, 121 GARCH, 154 gen.fail argument, 42 generating random portfolios, 3343 global data, 111, 123 gross value denition, 48 gross.value argument, 47 head function, 38 illiquid assets, 123 import data into S, 109110 information ratio annualizing, 94
170 denition, 93 maximize, 9395, 102, 139 init.fail argument, 41 intermediate argument, 164 intersect function, 61 inventory model, 118 iterations.max argument, 42, 134 leverage, 106 library function, 14 limit.cost argument, 75 lin.abs argument, 63 lin.bounds argument, 58 lin.constraint argument, 58 lin.direction argument, 64 lin.style argument, 5960, 68 lin.trade argument, 63 linear constraint, 5767 building, 5863 liquidity, 5051, 55, 115 log return, 111 long and short lists, 103 long value denition, 47 long-only portfolio, 9199 long-only with shorts, 131 long-short portfolio, 101108, 118 long-side linear constraint, 64 lot round, 45 size, 114 lower.trade argument, 50, 97, 104 machine memory, 161 market impact, 115 market risk, 155 max.weight argument, 49, 102, 128 maximize information ratio, 9395, 102, 139 maximum expected return utility, 141 maximum weight, 49 mean-variance optimization, 96, 104 mean-variance utility, 140 mean-volatility utility, 141 median optimization, 148 memory of the machine, 161 min-max optimization, 148, 150, 159 160 minimize tracking error, 92
INDEX minimize variance, 92 Minimum variance utility, 141 miniter argument, 42 missing values variance estimation, 112 monetary constraints, 4548, 106 multiple expected returns, 148, 150 160 multiple time periods, 154 multiple variances, 147, 150161 net asset value, 37 net value denition, 48 nonlinear costs, 116117 not optimizing starting solution, 133 134 ntrade argument, 51, 93, 97, 104 objective function, 142, 163 operating system, 164 optimization algorithm, 135 distance, 9798 mean-variance, 96, 104 median, 148 min-max, 148, 150, 159160 not doing, 133134 objective, 142, 163 Pareto, see min-max optimization real-time, 106108 utility-free, 9798 wrong, 121124 out.trade argument, 33, 34 Pareto optimization, see min-max optimization passive portfolio, 9293, 150152 penalty for broken constraint, 113, 163 performance measurement, 1921, 27, 8384, 8990 polynomial costs, 117118 port.size argument, 52, 93, 95, 102 portfolio long-only, 9199 long-short, 101108, 118 passive, 9293, 150152 position
INDEX too small, 128 positions argument, 54127 postions argument, 57 ppoints function, 90 pprobe.verify function, 163 prices argument, 91, 101 quadratic constraint, 7678, 120 quantile argument, 148, 153, 160 quotes in S, 74, 154 R language instance of S language, 14 R website, 13 random portfolio, 3343, 134 random portfolio summary, 39 random weights, 37 random.portfolio function, 3343, 45 random.portfolio.utility function, 42 randport.eval function, 3839, 66 rbind function, 129 read a le, 109 read.table function, 109 real-time optimization, 106108 rep function, 97, 103 repeat (S construct), 106 return continuously compounded, 111 log, 111 simple, 111 rights issue, 122 risk credit, 155156 market, 155 risk aversion parameter, 140, 150 risk model, testing, 24, 8689 risk tolerance, 140 risk.aversion argument, 150 rival forecasts, 153154 round lot, 45 S code note, 17, 33, 34, 39, 40, 61, 74, 76, 97, 103, 106, 109, 111, 136, 153, 154, 157, 158 S function abline, 90 as.matrix, 109, 110 build.constraints, 58 c, 74, 76, 94
171 cbind, 61, 129 Cfrag.list, 164 class, 41 constraints.realized, 65 cov.wt, 110 debugger, 130 deport, 40, 114 do.call, 39 drop, 109, 110 factor.model.stat, 110, 162 tted, 112 head, 38 intersect, 61 library, 14 ppoints, 90 pprobe.verify, 163 random.portfolio, 3343, 45 random.portfolio.utility, 42 randport.eval, 3839, 66 rbind, 129 read.table, 109 rep, 97, 103 summary, 39, 113 tail, 38 traceback, 130, 164 trade.optimizer, 9199, 101108 update, 136 valuation, 114 var.add.benchmark, 112 S language debugging, 129130 includes R, 14 multi-line command, 17 S-PLUS website, 13 scale.cost argument, 120 scenario analysis, 156160 sector constraint, 34 sense, making, 121 short value denition, 47 short-side linear constraint, 64 shrinkage, see Bayesian simple return, 111 small portfolio, 128 soft constraint, 7879 split, stock, 122 stale prices, 123 start not optimizing, 133134 start.sol argument, 123, 134
172 statistical factor model, 110112, 161, 162 stock split, 122 stringency argument, 133, 136 sum of largest weights constraint, 7475 summary function, 39, 113 tab-separated le, 109, 114 tail function, 38 taxes, 118, 120 three-dimensional array, 153, 158, 161 threshold argument, 5253, 127 threshold constraint positions argument, 56 time decay, 120 tol.positions argument, 57 traceback function, 130, 164 tracking error, 151 constraint, 34, 7172, 9495 minimize, 92 trade too trivial, 128 trade.optimizer function, 9199, 101 108 trading costs, see costs trading strategy, testing, 29, 90 transaction costs, see costs troubleshooting, 127129 turnover, see trade.value argument turnover argument, 47, 93, 95, 102, 105 txt le, 109, 114 typography conventions, 17 unchanging utility, 127 universe.trade argument, 50, 56, 97, 104, 105 update function, 136 upper.trade argument, 50, 97, 104 utable argument, 149, 160 utility benchmark-relative, 152 constraint, 42 denitions, 139 destination, 149 table, 149150, 159160 unchanging, 127 zero, 128
INDEX utility-free optimization, 9798 valuation function, 3537, 114 too small, 128 value constraints, 106 var.add.benchmark function, 112 var.constraint argument, 70 variance, 110112 adding benchmark, 112 argument, 161162 compact representation, 161162 constraint, 70, 103, 145147, 154 factor model, see factor model minimize, 92 missing value treatment, 112 multiple, 147, 150161 table, 144145, 149 vech representation, 161 vech representation, 161 version component, 164 violation of constraints, 6566 volume, average daily, 50, 51, 118 vtable argument, 144 warnings, suppressing, 124126 weight constraint, 4975 destination, 148, 149 sum of largest constraint, 74 75 write a le, 40, 114 wrong optimization, 121124 zero utility, 128