key: cord-0464431-ccxshym7
authors: Rosenzweig, Jan
title: Fat Tails and Optimal Liability Driven Portfolios
date: 2022-01-26
journal: nan
DOI: nan
sha: fb54352bcfc1bf38f0130343cf3e3018079dc5d6
doc_id: 464431
cord_uid: ccxshym7

We look at optimal liability-driven portfolios in a family of fat-tailed and extremal risk measures, especially in the context of pension fund and insurance fixed cashflow liability profiles, but also those arising in derivatives books such as delta one books or options books in the presence of stochastic volatilities. In the extremal limit, we recover a new tail risk measure, Extreme Deviation (XD), an extremal risk measure significantly more sensitive to extremal returns than CVaR. Resulting optimal portfolios optimize the return per unit of XD, with portfolio weights consisting of a liability hedging contribution, and a risk contribution seeking to generate positive risk-adjusted return. The resulting allocations are analyzed qualitatively and quantitatively in a number of different limits.

Portfolio optimization in the presence of fat tails has received a considerable amount of attention recently (see [7, 8] and references therein).

Briefly, it is known that, as the risk penalty moves from variance towards fat-tailed risk measures, the allocation becomes less dependent on the return of the component, and the resulting portfolio becomes more diversified [7] . In the limit of extremal risk measures, the portfolio allocation becomes perfectly diversified, with the return dependence reducing to a simple in-out step function [8] .

In the context of LDI, and especially LDI for Solvency 2, there is also increased awareness of the importance of tail and extremal risk scenarios. While classical LDI considers portfolios that are optimal with respect to a variance-like risk measure such as the Value at Risk (VaR), there has been a considerable shift towards adopting tail based risk measures such as the Expected Shortfall or the Conditional Value at Risk (ES, CVar) [2] .

In this paper, we look to join the two approaches, by considering liability-driven portfolios with algebraic tail and extremal risk measures of [7, 8] . We do this in the classical LDI scenarios of pension funds and insurances, where the liabilities comprise of a series of fixed or almost fixed cash flows, but also in the context of derivatives books, where the liabilities comprise of derivatives written to clients, in the form of either delta one products or options.

The paper is organised as follows: Section 2 goes through the general analysis of the optimization problem and its solutions. Section 3 briefly justifies the extremal risk limit. Section 4 focuses on the classical pension fund and insurance LDI scenarios. Sections 5 deals with liabilities arising in client-facing derivative books, and Section 6 summarizes the conclusions.

The liability driven portfolio optimization problem is to find the optimal weights vector w = (w i ) of the asset vector A = (A i ), against a fixed liability portfolio L. Mathematically, assuming that the relevant moments exist and that they are finite, we are solving

for w, where E is the expectation operator, dA and dL denote the respective returns of the asset vector and the liability, λ > 0 is a fixed risk appetite, r A and r L are the respective asset and liability funding rate vector and scalar, and 2k is a positive even integer. The expectation is taken ex-ante at time t = 0, so that the current values of A and L are known, their future returns dA and dL are random, and we wish to influence the statistics of the returns distribution of the asset-liability portfolio. In abstract terms, the task is to maximise the returns of the joint portfolio, while minimizing its variability, for a range of measures of variability.

The choice of exponent as an even integer 2k restricts us to variance-like symmetric penalties, where large positive returns are penalised equally as large negative returns, as opposed to a skewlike penalty with a power of 2k − 1. The reason for this choice is twofold. First, any skew penalty term could become negative by switching from long to short holding. A negative skew term could not serve as an effective penalty, and the optimal portfolio would have weights that grow without bound. Second, the standard negative skew in financial time series corresponds to frequent small positive returns, and infrequent large negative returns. While this is undesirable, the converse of this, which is frequent small negative returns, and infrequent large positive returns -is generally not particularly desirable either. We therefore restrict ourselves to even exponents, while noting that it is nonetheless possible to extend the results to other exponents, as per [8] .

By letting k > 1, we move away from variance as a measure of portfolio risk, and this allowes us to capture fat-tailed [7] and extremal risk measures [8] . Our goal is to analyse the structure of the optimal portfolio as the risk measure changes from variance-like risk measures to fat-tailed and extremal risk measures.

Following [7] and [8] , we solve (1) using an appropriate orthogonal decomposition of the asset vector, such as the ICA, kernel PCA or a neural network-based decomposition.

Denoting the resulting orthogonal components C 1 , C 2 , ... and their funding rates and expected rates of return r 1 , r 2 , ... and µ 1 , µ 2 , ..., we follow the analysis from [8] to write the approximate formal solutions for their respective weights w 1 , w 2 , ....

Briefly, in the direction of each C i , (1) is a polynomial in w i of order 2k, with up to k local maxima. We are looking for its global maximum.

The problem simplifies considerably in two limits. The first limit, when the return term is small compared to the risk appetite, (µ i −r i )/2kλ 1, at leading order gives a simple polynomial with a single maximum with multiplicity 2k, reached at the weight w i that minimizes the 2kdistance between w i dC i and dL. This solution is known analytically [4] , and the global maximum is then found by its perturbation expansion in powers of (µ i − r i )/2kλ. Its first two terms are

where σ i denotes the standard deviation of C i , ρ i its correlation with the liability, σ L is the standard deviation of the liability, andL i is the part of the liability that is orthogonal to C i ,

The other limit of interest is when the return term is large compared to the risk appetite, (µ i − r i )/2kλ

1 . The analysis is then analogous to that described in [8] ; the global maximum is reached approximately where the leading term w 2k i balances the linear term w i , and the result is

Where dL = dL − r L dt is the driftless part of dL, so that E(dL) = 0. Note that (2) and (3) only correspond to the same local maximum when k = 1. Otherwise, they are different local maxima, whose status as a global maximum changes discontinuously as the risk tolerance parameter changes.

The interpretation of the two maxima is straightforward; (2) minimizes the risk generated by the liability, while (3) maximizes the return of the combined asset-liability portfolio. We therefore refer to (2) as the risk-avoiding allocation, and to (3) as the return-seeking allocation.

Both solutions (2) and (3) indicate two sources of allocation to a risky asset; a return term, allocated in accordance with its return and the risk appetite λ, and a hedge, a risk-toleranceindependent, return-independent term depending on its cross momens with the liability.

No simplifying assumptions have been made on the nature of any of the processes, other than that the relevant expectations exist and are finite.

For the classical case k = 1, the O(1) error term in (3) vanishes, and (3) simplifies to the usual formula

comprising of risky allocation in accordance to volatility-normalised Sharpe ratio, and the liability hedging allocation according to the hedge ratio. (4) is, of course, the same as (2) for k = 1, and the risk-avoiding and the return-seking allocations are one and the same. Qualitatively, the risk-avoiding allocation (2) is independent of k. The allocation consists of the standard linear hedge ρ i σ i /σ L and a risk-adjusted term proportional to the volatilitynormalised Sharpe ratio. The only dependence on k comes in, at the leading two orders in the asymptotics, through the rescaling of the return-seeking term by a constant depending on k, which can be absorbed into rescaling the risk appetite at the level of a single component. We therefore omit further analysis of (2) in this paper.

The goal of this paper is to analyze return-seeking allocations (3) for k ≥ 1, and in the limit of k → ∞.

In this section, we elaborate on the choice of the penalty function, which is taken as the 2k'th central moment of the returns distribution. As discussed above, this is a straightforward generalization of the variance, and the value of k = 1 corresponds exactly to variance. As the value of k increases, the penalty function is more sensitive to the tails of the distribuion.

An additional interpretation of the penalty function is found in the limit of k → ∞. Say

Then

where all |dx i /dx ∞ | ≤ 1. See Figure 1 for a graph of the power-law function applied to each dx i /dx ∞ . In essence, taking a small number to a high power works as a high-pass amplitude filter. Numbers close to 0 vanish, and only numbers in a band around ±dx ∞ remain. The band becomes narrower as k increases.

In the k → ∞ limit,

where n 0 is the nuber of points on which the maximum norm is reached. We therefore get a straightforward interpretation of our high order moments as first order estimates of extreme de-meaned returns. The limit k → ∞ is in itself a risk measure, belonging to the family of deviation risk measures [5] , which we call Extreme Deviation (XD). It is the expected maximum absolute return of the de-meaned returns distribution, i.e. maximum absolute deviation around the mean.

A comparison between VaR, CVaR and XD on the daily returns of the SPX from 2017-2021 is shown in Figure 2 .

Briefly, XD is an unconditional extremal risk measure such that, for any random variable Z,

for any probabilty p. Its importance in our context is that, (i) it can be estimated robustly from the high order moments, and (ii) it provides the interpretation of the optimal portfolio in the limit of k → ∞ as the portfolio with maximum return per unit of XD. 

There is a qualitative change of behaviour in the character of the optimization problem (1) as the value of the exponent k increases from k = 1 towards k = ∞. Specifically, for k = 1, the optimization is a least squares problem; for k > 1 it is 2k-norm optimization; and, for k = ∞, it is the ∞-norm, or minimax optimization.

To get some sense for how the optimal allocation depends on the risk appetite, we can look at (3) a bit more closely as a function of µ. The graph of (3) for varying µ and k is shown in Figure  3 . By inspection, we can see that the crossed moment term effectively generates a correction to the hurdle rate r, given as

and (3) can then be re-writen as

In other words, the weight profile in terms of the return is still, as shown in in Figure 3 , the familiar pattern from [8] of a hockey stick collapsing to a step function as k progresses from 1 to ∞, but, this time, with a hurdle rate that decreases exponentially as k increases. 

For Delta one products, the calculation of the ratio of the moments E (dC j − µ j dt)dL 2k−1 and E(dC j − µ j dt) 2k is straightforward if the underlying processes are Gaussian [1] ; using

and orthogonalizing

We can substitute (10) back into (3) to get the full allocation weight as

For large k and fixed λ, the hedge ratio simplifies to

in other words, this limit has an "effective correlation" used for hedging, equal to the actual correlation raised to the power-law power of 1/(2k − 1). The profile of this effective correlation is shown in Figure 4 . The the effective correlation term in the hedging ratio (12) has a simple limiting value for the weight w i as k → ∞ with λ fixed of

so all positive correlations are effectively treated as constant and equal to 1, all negative correlations likewise as constant and equal to −1, and correlation 0 remains 0. This is somewhat in line with long-standing trading practice [3] . In practice, one would select a finite-width band around correlation 0 where the effective correlation would be deemed to small to count, and it would be set to ±1 outside this band. We note that the limits (12) and (13) break the conditions under which (3) is the global maximum of (1), and that this simple effective correlation therefore never actually appears as a hedging correlation. It is therefore useful as an intuitive shorthand, but it should not be used as an actual formula to construct portfolios.

The correct limit should ensure that (3) remains the global maximum; it is taken by using (6) with the effective hurdle rate

and it results in a flat positive allocation with no hurdle rate.

We now turn to the case where the liability is an option on the hedging underlyings S 1 , S 2 , ..., with its value denoted Ω(S 1 , S 2 , ...). Standard Ito's lemma gives the dynamics of Ω as

where we now introduce additional terms for α i , the vol-of-vol of S i , and the stochastic vol factor dZ i , and the summation convention applies. The smiles come from the vol-of-vol parameters α i , and the skews come from the spot-vol correlations,

The randomness of the option process comes from the randomness of the spot processes, mediated by the respective deltas, and the randomness of the vol processes, mediated by the respective vegas.

Orthogonalizing the variance factor as

we go through the same motions as in (9), and the moment ratio term comes out as

In other words, the moment ratio has a component equal to the skew-adjusted delta. The skew adjustment is the usual product of spot-vol correlation, vol-of-vol to vol ratio, and the option vega. Plugging this back into (3), we get the full weight as

The classical case of k = 1 reduces to

and the (incorrect) limit of large k, fixed λ is analogous to (12), as

The effective hurdle rate for (6) is

7 Examples

We use an example of an actual liability profile of a Middle-Eastern pension fund, depicted in Figure 5 . The hedging universe consists of ten iShares bond ETFs, shown in Figure 6 . The ETF universe was chosen to cover the USD treasury curve, USD corporate bonds and USD high yield bonds. The time period under observation was 3 years, from 20th February 2019 until 19th February 2022. This time period includes the time of significant shock in the bond markets in March 2020 due to the Covid-19 pandemic, which serves as an example of a fat-tailed event.

We applied ICA decomposition to the return vectors of these ten bond ETFs to generate the independent components. The ICA package used was fastICA as implemented in the Python library scikit-learn.

Even though there are ten ETFs in the universe, the rank of their return vectors is only 9, indicating that one is redundant. Consequently, ICA returned nine independent factors. Their weights and trajectories are shown in Figure 7 .

As seen from Figure 7 , the ICs all have a straightforward interpretation, namely: IC6 describes the short end of the treasury curve; IC3 describes the parallel shift of the 1y7y treasury curve; IC7 describes the 1y10y steepener of the treasury curve; IC1 describes the 10y part of the treasury curve; IC4 describes the 20y part of the treasury curve; IC2 describes the short end of the corporate curve; IC9 describes the long end of the corporate curve; IC8 describes the short end of the high yield curve; and IC5 describes the long end of the high yield curve.

The NPV of the liability profile was discounted using treasury yields. We composed optimal portfolios according to (3) for values of k = 1, 5, 10, 50 and the risk appetite λ = 1%. The trajectories of the resulting portfolios are shown in Figure 8 . As seen from Figure 8 , all choices of k significantly lower the volatility of the liability portfolio, and this effect improves with increasing k. This is almost entirely due to high k absorbing the Covid-19 related shocks in March 2020. The trajectories for increasing k converge rapidly, and the curves for k = 10 and k = 50 are next to indistinguishable from each other.

A closer look at the weights distribution in Figure 8 reveals that the most significant effect of increasing k is adding allocations to corporate and high yield factors contained in IC2, IC6, IC7 and IC8. The liablity is discounted at treasury yields, so corporate and high yield factors contribute diversification, rather than hedging.

There is a small curiosity in the fact that one high yield factor, IC5, does find its way into the variance minimizing k = 1 portfolio, instead of, say, the comparable treasury factor in IC7. The explanation for this is somewhat technical; namely, the 1y7y steepener in IC7 does not contribute to hedging due to the straight line shape of the liability profile over the entire 1y20y bucket, and it is only added to high k portfolios as a diversifier. The appearance of the 5y high yield factor in IC5 is an unrelated artefact of the selected time period of observation. We compared the PnL of daily delta hedging to those from a hedging strategy with weights given by (19) .

For delta hedging, option deltas were calculated based on daily Close, and delta hedging was also performed at the same daily Close. A fixed trading cost of 0.06% was applied to all transactions. While this assumes some forward knowledge on behalf of the book runner, in being able to trade at the same price that the delta was derived from, it is a reasonable assumption in practice. In practice, the deltas would be calculated from market prices prior to the Close, and, for liquid stocks, they would not significantly differ from the deltas obtained from Close. The delta used for the delta hedging was simple Black-Scholes delta at option strike, with the implied volatility taken as the average of the call and the put volatility derived from the option close prices.

For the risk-seeking component in (19), we used the mean return and moments of the daily returns of the underlying from the two months prior to the option being written, namely 3rd January 2022 until 25th February 2022. The risk appetite parameter was taken as λ = 1%. The exponents were k = 1, 5, 10, 50, 100. The expected return based on the 3rd January to 25th February bucket was negative for AAPL, MSFT and GM, and positive for GE and XOM. Realised return in the 28th February to 8th April bucket was negative for GE and GM, and positive for AAPL, MSFT and XOM.

The resulting hedge weights for zero option delta are shown in Figure 9 , and the PnL trajectories are shown in Figure 10 . The modified hedge (19) generally produced lower PnL volatility than pure delta hedging. The typical picture is that the modified hedge performs very similarly to the delta hedge when k = 1, and the PnL volatility then decreases with increasing k. For increasing values of the penalty exponent k, the modified hedging PnLs showed marked convergence. Higher exponents k = 50 and k = 100 were remarkably close in all examples, indicating that the effect of increasing k levels off reasonably quickly. This is true both on the level of weights, as seen in Figure 9 , and PnL trajectories, as seen in Figure 10 .

The resulting hedging PnL is noticably higher where the guess of the drift term was correct, namely XOM. It was similarly higher for GM, where the guess was incorrect. For the remaining symbols, the final PnL was markedly close for all trajectories.

The methods presented here are intended to shed new light on optimal portfolio construction in the presence of fat tails, especially in the context of liability-driven portfolios. The sort of portfolios considered include classical LDI pension fund and insurance portfolios, with a stream of essentially fixed liabilities, but also the sort of portfolios that arise in client-facing derivatives books, where the liabilities comprise of delta one products or options written to clients.

The risk measures employed are a continuous family parametrized by a single parameter k, and they range from variance, when k = 1, through maximum-drawdown-like measures as k → ∞. These risk measures were studied extensively in [8] , but not in the context of liabilitydriven portfolios.

The limit of k → ∞ results in a new risk measure, XD. XD is significantly more sensitive to extreme returns than CVaR, and yet it can be estimated robustly as the limit of high order moments. Our optimal portfolios in the k → ∞ limit optimize the return per unit of XD.

The general pattern, which holds across all the examples we show, is summarized in equation (3) . Briefly, the weight of an orthogonal component is comprised of a risk term, proportional to the ratio of the return to the relevant moment of the component returns and the risk appetite parameter, and a hedging term neutralizing as much of the liability as possible, all raised to the power of 1/(2k − 1).

The limit of k → ∞ reveals some non-trivial results, namely that the liability term reduces the hurdle rate for the component return, and that the resulting effective hurdle rate tends exponentially to −∞ for k → ∞.

The actual choice of k in a practical situation is subject to multiple trade-offs. The large k limit is of obvious interest, due to its link to the maximum drawdown. The flip side of that is, however, difficult to estimate any statistics for. Leaving aside whether it is even theoretically possible to predict future extremals of an unknown distribution from its history, we saw in practical examples that the convergence is reasonably rapid. In all examples, there is a rapid change in the optimal portfolio as k increases from k = 1 to k ∼ 10, follow by convergence by k ∼ 100. We can therefore reasonably conclude that the limit, insofar as it can be estimated from the historical distribution, converges rapidly.

On the other hand, the theory behind the k → ∞ limit can be summarised in five words -"to manage drawdowns, diversify everything", even if we can not accurately estimate moments and extremals. 

A general kronecker formula for the moments of the multivariate normal distribution, Cahiers de recherches economiques 9002

Liability-driven Investors

Frictionless asset allocation with elliptically symmetric distributions of returns

An Analytical Solution to the Minimum L p -Norm of a Hyperplane

Deviation Measures in Risk Analysis and Optimization

The Estimation of Probability of Extreme Events for Small Samples

Fat-tailed factors

Power-law Portfolios

We looked at the eamples of exchange traded vanilla options on five NYSE underlyings: AAPL, MSFT, GE, GM and XOM. All options were written on the 28th February 2022, and expired on the 8th April 2022. The option position considered was a short ATM straddle based on Close prices on the 25th February, which produced the strikes 170 for AAPL, 300 for MSFT, 95 for GE, 47 for GM, and 78 for XOM. The tickers for the options used are AAPL220408C00170000, AAPL220408P00170000, MSFT220408C00300000, MSFT220408P00300000,GE220408C00095000, GE220408P00095000, GM220408C00047000, GM220408P00047000, XOM220408C00078000, and XOM220408P00078000.