key: cord-0208366-vjbwkhx2
authors: Bellini, Fabio; Fadina, Tolulope; Wang, Ruodu; Wei, Yunran
title: Parametric measures of variability induced by risk measures
date: 2020-12-09
journal: nan
DOI: nan
sha: 8e093948043c60465739e6413aed422f5907bf59
doc_id: 208366
cord_uid: vjbwkhx2

We present a general framework for a comparative theory of variability measures, with a particular focus on the recently introduced one-parameter families of inter-Expected Shortfall differences and inter-expectile differences, that are explored in detail and compared with the widely known and applied inter-quantile differences. From the mathematical point of view, our main result is a characterization of symmetric and comonotonic variability measures as mixtures of inter-Expected Shortfall differences, under a few additional technical conditions. Further, we study the stochastic orders induced by the pointwise comparison of inter-Expected Shortfall and inter-expectile differences, and discuss their relationship with the dilation order. From the statistical point of view, we establish asymptotic consistency and normality of the natural estimators and provide a rule of the thumb for cross-comparisons. Finally, we study the empirical behaviour of the considered classes of variability measures on the S&P 500 Index under various economic regimes, and explore the comparability of different time series according to the introduced stochastic orders.

Several measures of distributional variability are widely used in statistics, probability, economics, finance, physical sciences, and other disciplines. In this paper, we study a general theory of variability measures with an emphasis on three symmetric one-parameter families generated by popular parametric risk measures: Value-at-Risk (VaR), Expected Shortfall (ES), and expectiles. The corresponding induced variability measures are the inter-quantile difference, the inter-ES difference, and the inter-expectile difference. While the first one is a classical measure of statistical dispersion widely used e.g. in box plots, the other two are, to the best of our knowledge, relatively new: the inter-ES difference appears in Example 4 of Wang et al. (2020b) as a signed Choquet integral, and the inter-expectile difference has been studied in Bellini et al. (2020) via a connection to option prices. The present paper is a first unifying study, focused on their comparative qualitative and quantitative properties.

The mathematical theory of risk measures is extensive, and a standard reference is Föllmer and Schied (2016) . As it is well-known, VaR is simply a quantile and ES is a coherent risk measure in the sense of Artzner et al. (1999) . Both VaR and ES are implemented in current banking and insurance regulation frameworks; we refer to McNeil et al. (2015) for a comprehensive background and Wang and Zitikis (2021) for a more recent account. Expectiles, originally introduced in the statistical literature by Newey and Powell (1987) , have received an increasing attention in risk management, as it has been shown that they are the only elicitable coherent risk measures (Ziegel (2016) ). We refer e.g. to Bellini et al. (2014) and Bellini and Di Bernardino (2015) for more on the theory and financial applications of expectiles. For a comparison of the above risk measures in the context of regulatory capital calculation, see Embrechts et al. (2014) and Emmer et al. (2015) .

The theory of variability measures has been studied from different angles; see David (1998) for a review in the context of the measurement of statistical dispersion. A mathematical formulation closer to our setting is the notion of deviation measure introduced in Rockafellar et al. (2006) , and further developed by Grechuk et al. (2009 Grechuk et al. ( , 2010 . A similar notion of variability measure was proposed by Furman et al. (2017) with an emphasis on the Gini deviation. We will explain in Section 2 the differences between our general definition and the ones given in the literature; in particular, the inter-quantile difference does not satisfy the definition of deviation measure of Rockafellar et al. (2006) due to its lack of convexity.

Our main contribution is a collection of results towards a general theory of variability measures, with particular emphasis on the three parametric classes mentioned above. Various novel properties are studied to underline the special role these measures play among other variability measures. Since statistical inference for VaR, ES, and expectiles is well developed (see e.g. Shorack and Wellner (2009) for VaR and Krätschmer and Zähle (2017) for the expectiles), the estimation of the corresponding variability measures is quite straightforward.

The rest of the paper is organized as follows. In the remainder of this section, we introduce some notation. The definitions of the three classes of variability measures induced by VaR, ES, and the expectiles is presented in Section 2, with some basic properties. In Section 3, we summarize many properties of some common variability measures which are arguably desirable in practice. A characterization result of these measures is established. The stochastic ordering of the three classes of variability measures based on pointwise comparison is discussed in 4. In Section 5, we discuss non-parametric estimation of the three classes of variability measures.

We obtain the asymptotic normality and the asymptotic variances explicitly for the empirical estimators. It may be undesirable and financial unjustifiable to choose the same probability level for the three classes of variability measures induced by VaR, ES, and the expectiles; see Li and Wang (2019) for a detailed analysis on plausible equivalent probability levels when ES is to replace VaR. A simple analysis of a cross-comparison of an equivalent probability level for the variability measures using different distributions is carried out in Section 6. A small empirical analysis using the variability measures on the S&P 500 index is conducted in Section 7, where we observe the differences between these variability measures during different economic regimes. Further, we explore the symmetric variability orders between log-returns of Facebook and Berkshire Hathaway in 2020. In Section 8, we conclude the paper with some discussions on the suitability of the three classes in different situations. Appendix A contains a list of classic variability measures, and proofs of all results are put in Appendix B.

Notation. Throughout the paper, L q is the set of all random variables in an atomless probability space (Ω, A, P) with finite q-th moment, q ∈ (0, ∞), and L ∞ is the set of essentially bounded random variables. X = L 0 is the set of all random variables, and M is the set of all distributions on R. For any X ∈ L 0 , F X represents the distribution function of X, F −1 X its left-quantile function, and U X is a uniform random variable such that F −1 X (U X ) = X almost surely. The existence of such a U X for any X is given, for example, in Lemma A.32 of Föllmer and Schied (2016) . Two random variables X and Y are said to be comonotonic if there exist two increasing functions f, g : R → R such that X = f (X + Y ) and Y = g(X + Y ). We write X d = Y if X and Y have the same distribution. In this paper, the terms "increasing" and "decreasing" are meant in the non-strict sense.

Generally speaking, a variability measure is a functional ν : X → [0, ∞] that quantifies the magnitude of variability of random variables. In order for our definition to be as general as possible, we only require three natural properties.

Definition 1. A variability measure is a functional ν : X → [0, ∞] satisfying the following properties.

(A1) Law invariance: if X, Y ∈ X and X d = Y , then ν(X) = ν(Y ).

(A2) Standardization: ν(m) = 0 for all m ∈ R.

(A3) Positive homogeneity: there exists α ∈ [0, ∞) such that ν(λX) = λ α ν(X) for any λ > 0 and X ∈ X . The number α is called the homogeneity index of ν. random variables, such as the Gini coefficient or the relative deviation (see Appendix A). In this paper, we do not deal with these cases, although our definition can be easily amended to include them by replacing X with a positive convex cone. We call the set X ν = {X ∈ X : ν(X) < ∞} the effective domain of ν.

Remark 1. A deviation measure in the sense of Rockafellar et al. (2006) satisfies, in addition to (A2) and (A3) with homogeneity index 1, also subadditivity and strict positivity for nonconstant random variables. As we will see in Section 3, the latter two properties are not satisfied by the inter-quantile difference. For this reason, our more general definition is more suitable here than the one of Rockafellar et al. (2006) . Alternatively, Furman et al. (2017) required location-invariance instead of positive homogeneity, but this property is not satisfied by relative variability measures. Thus, we identify (A1), (A2), (A3) as the defining properties of a variability measure, and all other properties, such as location invariance and subadditivity, will be additional properties that may or may not be satisfied, as we will discuss in see Section 3.

Remark 2. In applications, we may choose the domain X of a variability measure as a convex cone contained in L 0 . For risk measures, the domain plays an essential role, which is often chosen as a general convex cone containing L ∞ , because many risk measures cannot be naturally extended to L 0 ; see e.g., Filipović and Svindland (2012) . For variability measures defined on a convex cone X ⊂ L 0 , since it takes non-negative values (thus, no issues with ∞ − ∞ which occur for some risk measures), we could always extend the domain by mapping L 0 \ X to {∞} without affecting the properties studied in this paper.

Value at Risk (VaR), Expected Shortfall (ES) and expectiles are very popular financial risk measures (see e.g. Embrechts et al. (2014) and Emmer et al. (2015) ). We recall the basic definitions below.

(i) The right-VaR (right-quantile): for p ∈ (0, 1),

The left-VaR (left-quantile): for p ∈ (0, 1),

(ii) The ES: for p ∈ (0, 1),

The left-ES: for p ∈ (0, 1),

(iii) The expectile: for p ∈ (0, 1),

In the above, Q p and Q − p are finite on L 0 , while ES p , ES − p and ex p are finite on L 1 . We only define expectiles on L 1 since generalizing them beyond L 1 is not natural; on the other hand, ES can be naturally defined on a set larger than L 1 by taking possibly infinite values.

We now introduce the variability measures induced by the aforementioned risk measures, that are the main object of the paper.

(i) The inter-quantile difference: for p ∈ [1/2, 1),

It is obvious that ∆ Q p is finite on X = L 0 .

(ii) The inter-ES difference: for p ∈ (0, 1),

Here, ES p takes values in (−∞, ∞], and ES − 1−p takes values in [−∞, ∞), and hence the above ∆ ES p is well defined on X .

(iii) The inter-expectile difference: for p ∈ (1/2, 1),

and we set by definition ∆ ex p (X) = ∞ for X ∈ X \ L 1 .

We consider also the limiting cases

which is the range functional, and it is simply denoted by ∆ 1 . Both ∆ Q p and ∆ ES p belong to the class of distortion riskmetrics (Wang et al. (2020a,b) ), with many convenient theoretical properties. On the other hand, ∆ ex p does not belong to this class, but it also has several nice properties, inherited from those of expectiles.

In Theorems 1-2 and Table 1 below, the range of p is p ∈ [1/2, 1) for ∆ Q p , p ∈ (1/2, 1) for ∆ ex p , and p ∈ (0, 1) for ∆ ES p .

Theorem 1. For each p, the following statements hold.

(i) ∆ Q p , ∆ ES p , ∆ ex p and ∆ 1 are variability measures.

(ii) The effective domains of ∆ Q p , ∆ ES p , ∆ ex p and ∆ 1 are L 0 , L 1 , L 1 , and L ∞ , respectively.

(iii) Each of ∆ Q p , ∆ ES p and ∆ ex p is increasing in p.

(iv) For each X ∈ X , the following alternative formulations hold:

It is straightforward to check that for p = 1/2, ∆ ES p is equal to two times the mean median deviation (see Appendix A, item (v) ). The next proposition shows that it suffices to consider p ∈ [1/2, 1), as we will tacitly assume in most results of the next sections.

In this section, we study comparative advantages of ∆ Q p , ∆ ES p and ∆ ex p , among with several other measures of variability, namely the standard deviation (STD), the variance, the mean absolute deviation (MAD), the Gini deviation (Gini-D), and the range; see Appendix A for the definition of these classic variability measures.

We consider the following additional properties of a variability measure ν, which are all arguably desirable in some situations. In what follows, cx is the convex order, defined by

for all convex φ : R → R such that the above two expectations exist.

(B1) Relevance: ν(X) > 0 if X is not a constant, and there exists β ∈ (0, ∞) such that ν(X) β for all X ∈ X with |X| 1. In Table 1 below, α represents the homogeneity index. Table 1 shows properties of different variability measures including the inter-quantile, inter-ES, and inter-expectile differences, as well as the aforementioned classic variability measures.

Theorem 2. The statements in Table 1 hold true.

The proof of Theorem 2, thus checking the properties in Table 1 , relies on several existing results on properties of risk measures and distortion riskmetrics from Newey and Powell (1987) , Bellini et al. (2014 , Liu et al. (2020) and Wang et al. (2020a) .

Notably, the inter-ES difference satisfies all properties (B1)-(B8), along with the Gini deviation and the range. Next, we establish that any variability measure satisfying (B1)-(B8)

admits a representation as a mixture of ∆ ES p for p ∈ (0, 1].

Theorem 3. The following statements are equivalent for a variability measure ν : X → [0, ∞]: (i) ν satisfies (B1)-(B8).

(ii) ν satisfies (B1)-(B4) and one of (B5)-(B6).

The measure µ in (1) for a given ν is generally not unique. Using Proposition 1, we can require µ in (1) to be supported on [1/2, 1] instead of (0, 1].

Example 1. There are three variability measures in Table 1 that satisfy all of (B1)-(B8), and each admit a representation as in Theorem 3. We give below a corresponding measure µ for each of them.

3. The range ∆ 1 : µ = δ 1 .

As we have seen from Theorem 2, all of ∆ Q p , ∆ ES p , ∆ ex p are invariant under location shifts. In the next result, we show that each of the one-parameter families ∆ Q p , ∆ ES p , ∆ ex p characterize a symmetric distribution up to location shifts.

Proposition 2. Suppose that X has a symmetric distribution, i.e., X d = −X. Each of the curves p → ∆ Q p (X), p → ∆ ES p (X) and p → ∆ ex p (X) for p ∈ (1/2, 1), if it is finite, uniquely determines the distribution of X.

Remark 3. If the distribution of X is not symmetric, none of p → ∆ Q p (X), p → ∆ ES p (X) and p → ∆ ex p (X) for p ∈ (1/2, 1) determines its distribution up to location shifts. This is because the inter-quantile difference curve p → Q p − Q − 1−p does not determine the quantile curve p → Q p . For instance, given a quantile curve p → Q p (X), we can define another quantile

The inter-quantile difference curves of X and Y are the same, but the distributions of X and Y are not the same up to a location shift unless f is a constant.

Remark 4. From Kusuoka (2001) it is well known that any coherent risk measure admits a representation as a supremum of mixtures of ES; see Bellini et al. (2014) for the case of expectiles. One naturally wonders whether an inter-expectile difference can be represented as the supremum of mixtures of inter-ES differences, i.e., the supremum over functions of form (1). Rather surprisingly, it turns out that such a relationship does not hold in general, as illustrated by Example 3 in Section 4.

Since the variability measures can be easily estimated from real data (see Section 5 below), one may conclude some ordering relations between two data sets with ordered measures of variability. For this purpose, we consider stochastic orders induced by pointwise comparison of inter-quantile, inter-ES, and inter-expectile differences. The first case has been studied in Townsend and Colonius (2005) under the name of quantile spread order, defined as follows:

Note that the order QS is weaker than the well-known dispersive order, defined by

for which we refer e.g. to Müller and Stoyan (2002) and Shaked and Shantikumar (2007) . We define two stochastic orders based on inter-ES and inter-expectile differences as follows:

It turns out that for symmetric random variables, these orders are equivalent to the dilation

as shown in (v) and (vi) below; the other properties are summarized in the following.

Proposition 3. Let X, Y ∈ L 1 . The following statements hold:

(ii) If |a| 1, then X ∆-ES aX and X ∆-ex aX;

(vi) If X and Y are symmetric with respect to their means, then

In case X or Y is not symmetric, then the equivalence relations in (vi) may fail, as the following simple example shows. Therefore, the two new orders ∆-ES and ∆-ex are generally weaker than the dilation order. This provides more flexibility for these new orders in real-data applications, as we will illustrate in Section 7.

and ∆ ex p (Y ) = 4p − 2 for 1/2 p 1.

It follows that ∆ ex p (X) ∆ ex p (Y ) for each p ∈ [1/2, 1]. However, X dil Y because X and Y have the same mean, and the support of X is not contained in that of Y . This shows that ∆-ES and ∆-ex do not imply dil .

Finally, in the asymmetric case the ∆-ES and ∆-ex orders are not related. In the next example we have that X ∆-ES Y but X ∆-ex Y , and a (real-data) example in which the opposite situation occurs can be found in Section 7. 1/2 p 3/4.

The properties of non-parametric estimators of ∆ Q p (X), ∆ ES p (X) and ∆ ex p (X) can be derived from those of VaR, ES and expectiles, as we will explain in this section.

Suppose X 1 , X 2 , . . . , X n ∈ L 1 is an iid sample from a random variable X. Recall that the empirical distribution F n of X 1 , . . . , X n is given by

Let ∆ Q p (n) be the empirical estimator of ∆ Q p (X), obtained by applying ∆ Q p to the empirical distribution of X 1 , . . . , X n . Similarly, let ∆ ES p (n) and ∆ ex p (n) be the empirical estimators of ∆ ES p (X) and ∆ ex p (X). We will establish consistency and asymptotic normality of the empirical estimators, based on corresponding results on empirical estimators of VaR, ES and expectiles in the literature, e.g., Chen and Tang (2005) , Chen (2008) , and Krätschmer and Zähle (2017) .

We make the following standard regularity assumption on the distribution of the random variable X.

(R) The distribution F of X ∈ L 1 is supported on a convex set and has a positive density function f on the support.

Denote by g = f • F −1 and let p ∈ (1/2, 1). We will show in the next theorem that the asymptotic variances of the empirical estimators for ∆ Q p and ∆ ES p are given by, respectively,

and that for ∆ ex p is given by

where for r ∈ {p, 1 − p},

Theorem 4. Suppose that p ∈ (1/2, 1) and Assumption (R) holds.

(ii) If X ∈ L 2+δ for some δ > 0, then

where σ 2 Q , σ 2 ES and σ 2 ex are given in (2), (3) and (4), respectively.

Simulation results are presented in Figure 1 for p = 0.9 in the case of standard normal and Pareto risks with tail index 4, that confirm the asymptotic normality of the empirical estimators in Theorem 4. More general asymptotic results for α-mixing processes could be similarly established using results in Chen (2008) and Krätschmer and Zähle (2017) . For the sake of space we do not discuss here the case of dependent observations.

Remark 5. For part (i) of Theorem 4, the assumption (R) is used to guarantee that the empirical quantiles converge to the true quantile (more precisely, we only need the quantile function to be continuous at p and 1 − p); this is not needed for the consistency statements on ∆ ES p and ∆ ex p in part (i). a stationary and ergodic data sequence (X n ) n∈N ; that is, ρ(n) → ρ(X 1 ) almost surely. For a general variability measure ν, a similar result holds if we further assume that ν is normcontinuous on H Ψ , following the same arguments as in the proof of Theorem 2.6 of Krätschmer et al. (2014) , where the only used property of ρ is norm-continuity. This statement includes consistency of ∆ ES p (n) and ∆ ex p (n) on L 1 in part (i) of Theorem 4 as special cases. For real-valued convex risk measures, norm-continuity is implied by monotonicity and convexity. To establish such continuity, monotonicity is essential and it plays the role of positivity in the Namioka-Klee theorem for linear functions; see Biagini and Frittelli (2009) . For convex variability measures, since monotonicity is not satisfied, we have to assume norm-continuity for the analog of Theorem 2.6 of Krätschmer et al. (2014) .

As mentioned in the introduction, we are interested in comparing the inter-quantile, the inter-ES and the inter-expectile differences. Due to the different meanings of the parameter p in VaR p , ES p and ex p , there is no reason to directly compare ∆ Q p , ∆ ES p and ∆ ex p using the same probability level p. For a fair cross comparison, we may calibrate p, q, r such that the variability measures have the same value, that is,

for some common choices of distributions. In particular, we will consider normal (N), t-and exponential distributions as benchmarks, and the curves of q and r in terms of p for these distributions are plotted in Figure 2 . We observe that the values of r is typically much closer to 1 than the corresponding p or q. The matching value of q is smaller than the corresponding p but the relationship between q and p is close to linear; a corresponding observation on comparing VaR and ES is noted by Li and Wang (2019) , where they obtained the ratios (1 − q)/(1 − p) ≈ 2.5 for normal risks and (1 − q)/(1 − p) = e ≈ 2.72 for exponential risks (this corresponds to the straight line in Figure 2b ).

In empirical studies, it has been costumary in the literature to use the matching values for normal distribution as a rule of thumb for general comparisons; note that the location and scale parameters are irrelevant for such a comparison due to location-invariance and positive homogeneity. Roughly, we obtain 

In this section, we illustrate the three classes of variability measures studied in this paper by means of a few empirical studies on financial data.

We first analyze the difference between the performances of these variability measures during different periods of time (different economic regimes). Our data are the historical price movements spanned from 01/04/1999 to 06/30/2020 of the S&P 500 index. 1 We use its daily log-loss data 2 over the observation period with moving window of 253 days for daily estimation of the variability measures. To compare the relative performance of the three measures, we report the ratios ∆ ES q /∆ ex r and ∆ ES q /∆ Q p for the S&P 500 daily log-losses in Figures 3 and 4 using the rule of thumb for (p, q, r) obtained in Section 6 induced by the normal distribution.

In Figure 3 , spikes in the ratio of ∆ ES q /∆ Q p are located around the 2008 subprime crisis and the COVID-19 period. On the other hand, the ratio ∆ ES q /∆ ex r in Figure 4 experiences a downslide around the subprime crisis and the COVID-19 period. These results suggest that ∆ ES q is more sensitive to extremely large losses than ∆ Q p , but ∆ ex r is even more sensitive than ∆ ES q . Recall that these ratios should be 1 if the underlying losses are normally distributed, whereas we observe ∆ ES q /∆ Q p > 1 and ∆ ES q /∆ ex r < 1 for most dates during the period of 2000 -2020 (∆ ES q /∆ ex r is almost always smaller than 1). Hence, Figures 3 and 4 confirm that the log-losses of S&P 500 are not normally distributed, and in fact, they typically show paretian tails, as is well studied in the literature (see, e.g., McNeil et al. (2015) ). are comparable in one of the symmetric variability orders considered in Section 4, we recall that an equivalent condition for the dilation order is given by

, for each p ∈ (0, 1).

We see in the left panel of Fig. 6 that there is an intersection point in the ES p − E curves, so Facebook's log-returns do not dominate Berkshire Hathaway's according to the dilation order (and vice versa). In this specific example, this is due to the presence of two large values in the distribution of Berkshire Hathaway's daily log-returns. On the contrary, looking at the center and left panels of Fig. 6 we see that there are no intersection points, so Facebook's log-returns dominate Berkshire Hathaway's according to both the ∆-ES and ∆-ex orders.

Hence, both ∆-ES and ∆-ex are able to model an ordering relation in the variability between two distributions, when the classic dilation order fails to hold, and this shows the additional flexibility of the new orders over the classic notion.

As a third example, we compare the distributions of log-returns of the S&P500 Index in 2008 and in 2020, displayed in Fig. 7 . As in the previous example, we plot the relevant curves in Fig. 8 . Here there is an intersection point both in the left and in the center panel, and no intersections in the right panel, so only the ∆-ex order applies.

In order to give a first exploratory assessment of how often the various symmetric variability orders do apply, we checked the comparability of daily log-returns of the S&P500 Index for each pair of years ranging from 2008 to 2020, for a total of 78 = 13 × 12/2 pairs. The results are reported in Table 2 . It turns out that in 66 cases the dil order applies, and so as a consequence also the other two weaker orders apply. In the remaining 12 cases, one or both of the ∆-ES and ∆-ex orders apply in 8 cases, so when the dil order does not apply, we have a fraction of 8/12 67% of cases in which the data can still be compared. Notice also that the ∆-ES order without the ∆-ex order never occurred for this dataset; however, Example 3 in Section 4 shows that also this situation is theoretically possible. Table 2 : Number of occurencies of the symmetric variability orders dil , ∆-ES and ∆-ex in the 78 = 13 × 12/2 pairs of years of daily log-returns of the S&P500 Index, ranging from 2008 to 2020, and corresponding pairs. For brevity we report only years' last two digits. 

In this paper, we introduce variability measures induced by three very popular parametric families of risk measures, that is, the inter-quantile, the inter-ES, and the inter-expectile differences. The three classes of variability measures enjoy many nice theoretical properties (Theorem 1); in particular, each of them characterizes symmetric distributions up to a location shift (Proposition 2). We study several desirable functional properties of general variability measures including the above three classes and many other classic ones; a grand summary is obtained in Theorem 2 and Table 1 . The family of variability measures that satisfy a set of desirable properties is characterized as mixtures of inter-ES differences (Theorem 3). It is important to note that the three classes of variability measures introduced in this paper are well defined on L 1 and that each depends on a single parameter which allows for flexible applications. This distinguishes them from other deviation measures (e.g., Rockafellar et al. (2006) ) where no parametric family is given. The empirical estimators of the inter-quantile, the inter-ES, and the inter-expectile differences can be formulated based on those of VaR, ES and the expectile, and the asymptotic normality of the estimators is established (Theorem 4). In the financial application, we observe that the behaviour of these variability measures is similar to the corresponding parametric families of risk measures. However, a comparison of different ratio of the variability measures reveals that ∆ ex is the most sensitive to extreme losses, and ∆ Q is the least sensitive.

For the end-user, if tail risk is of particular concern, then ∆ ex may be a better variability measure to use, as it captures tail-heaviness quite effectively. However, ∆ ex is usually cum- bersome in computation and optimization because of the lack of explicit formulas in terms of quantile or distribution functions; another technical disadvantage is that ∆ ex is not concave with respect to mixtures. On the other hand, if robustness is more important and tail risk is not relevant, then ∆ Q is a good choice, because quantiles are easy to compute and they are generally more robust than coherent risk measures including ES and expectiles (see Cont et al. (2010) ). Moreover, ∆ Q is well defined on risks without a finite mean; nevertheless we should keep in mind that ∆ Q ignores tail risk just like a quantile. Finally, ∆ ES lies somewhere in between ∆ Q and ∆ ex regarding the above considerations, which giving rise to a good compromise; further, it is the only one among the three classes that is concave with respect to mixtures (see Table 1 ), and it is the building block for many other measures of variability (see as entropic risk measures (e.g., Föllmer and Schied (2016) ) and RVaR (e.g., Embrechts et al. (2018) ), can also be used to design flexible variability measures. 

The uniqueness of solution x to (5) implies −ex 1−p (X) = ex p (−X). Hence,

thus the desired formula.

Proof of Proposition 1. By definition, for X ∈ L 1 ,

By Theorem 1,

Hence, the desired statements hold.

Proof of Theorem 2. We first explain some general observations on all variability measures in Table 1 . The effective domains and the homogeneity indices follow directly from definition. Continuity (B2) is implied by L q continuity since all variability measures are finite and thus continuous on their effective domains. Symmetry (B3) and location invariance (B8) are straightforward to check, and they hold for all variability measures in Table 1 . The conditions (B5)-(B7) are connected. In particular, Theorem 3 of Wang et al. (2020a) states that (B5)-(B7) are equivalent for distortion riskmetrics, which are functionals satisfying (A1), (B4) and some continuity assumptions. It is well known that the inter-quantile differences and the inter-ES differences are distortion riskmetrics.

Next, we explain that convexity (B6) implies Cx-consistency (B5) for all variability measures we consider. By Theorem 2.2 of Liu et al. (2020) , all law-invariant convex risk functionals, i.e., functionals satisfying (A1), (B6) and (B8), can be written as the supremum of a family of convex distortion riskmetrics. Each distortion riskmetric is Cx-consistent as stated in Theorem (i) The following example shows that ∆ Q p does not satisfy (B1). Take ε > 0 such that p + ε < 1 and X ∼ Bernoulli(1 − p − ε). Notice that X is not a constant but ∆ Q p (X) = Q p (X) − Q − 1−p (X) = 0 − 0 = 0. C-additivity (B4) is satisfied since ∆ Q p is a distortion riskmetric. (B6)-(B7) are explained above.

(ii) ∆ ES p , Gini-D and range are all convex distortion riskmetrics; see Table 1 of Wang et al. (2020a) . Hence, they all satisfy (B4)-(B7). Relevance (B1) can be easily verified.

(iii) If X is not a constant, by Newey and Powell (1987, Theorem 1) , ex p is strictly increasing in p ∈ (0, 1), which means that ∆ ex p (X) = ex p (X) − ex 1−p (X) > 0 for p ∈ (1/2, 1). By Proposition 7 of Bellini et al. (2014) , ex p is increasing in X, so for |X| 1, −1 ex p (X) 1 for p ∈ (0, 1). Thus ∆ ex p (X) 2 and Relevance (B1) is satisfied. Convexity (B6) is satisfied by Theorem 1 (iv) and convexity of expectiles.

We show that M-concavity (B7) is not satisfied by ∆ ex p (X) via the following example from . Take p = 1/10. Define X by P(X = −1) = 1/2, and P(X = 1) = 1/2; Y by P(Y = 0) = 2/3, P(Y = 5) = 1/3. Then ∆ ex 1/10 (X) = − 8 5 and ∆ ex 1/10 (Y ) = − 800 209 . Let F = 9 10 F X + 1 10 F Y and Z ∼ F . Then ∆ ex 1/10 (Z) = − 2531 1311 < 9 10 ∆ ex 1/10 (X) +

and hence ∆ ex p is not mixture concave. C-additivity (B4) is not satisfied since by Theorem 3, a variability measure satisfying (B1)-(B5) must satisfy (B7).

(iv) For the variance, Relevance (B1) can be easily verified. Variance does not satisfy (B4) since (B4) requires the homogeneity index to be 1. For (B6), the variance is well known to be convex (Deprez and Gerber (1985) ); see also Example 2.2 of Liu et al. (2020) . The variance satisfies M-concavity (B7) because of the well known equality

Since σ 2 is the minimum of mixture-linear functionals, we know that it is mixture concave.

(v) For STD, Relevance (B1) can be easily verified. C-additivity (B4) is not satisfied by STD since STD is not additive for comonotonic random variables X and Y with correlation less than 1. STD is convex (B6); see Example 2.1 of Liu et al. (2020) . To show that STD satisfies M-concavity (B7), take X, Y ∈ L 1 and let Z ∼ λF X + (1 − λ)F Y for λ ∈ [0, 1]. By definition,

which is equivalent to σ(Z) λσ(X) + (1 − λ)σ(Y ).

(vi) For the mean absolute deviation (MAD), Relevance (B1) can be easily verified. MAD satisfies convexity (B6), since, for λ ∈ [0, 1] and X, Y ∈ L 1 ,

We give an example showing that MAD does not satisfy M-concavity (B7). Take X ∼ Therefore,

and hence MAD is not mixture concave.

C-additivity (B4) is not satisfied by MAD since by Theorem 3, a variability measure satisfies (B1)-(B5) must satisfy (B7).

Proof of Theorem 3. Write the functional ν µ = 1 0 ∆ ES p dµ(p), which is the right-hand side of (1). First, obviously (i) implies (ii). It is also straightforward to check that (iii) implies (i), since ∆ ES p for p ∈ (0, 1] satisfies (B1)-(B8) by Theorem 2, and so is ν µ ; the only non-trivial statement is (B2) of ν µ which is guaranteed by Theorem 5 of Wang et al. (2020a) , which shows that the representation ν µ belongs to a class of convex distortion riskmetrics with continuity (B2). Below, we show (ii)⇒(iii).

Let X ν be the effective domain of ν. Take X ∈ X ν such that ν(X) > 0. By (B4), ν(2X) = ν(X) + ν(X) = 2ν(X). Hence, the homogeneity index of ν is 1.

Suppose that Cx-consistency (B5) holds. Take any X, Y ∈ X ν and let X d = X and Y d = X such that X and Y are comonotonic. It is well known that X + Y cx X + Y ; see e.g., Theorem 3.5 of Rüschendorf (2013) . Using (B4) and (B5), we have ν(X + Y ) ν(X + Y ) = ν(X ) + ν(Y ) = ν(X) + ν(Y ). Therefore, ν is subadditive, that is,

Note that convexity (B6) and homogeneity (A3) with α = 1 together also imply subadditivity. Hence, either assuming (B5) or (B6), we get (6). It follows from (6) and (B1) that there exists β > 0 such that ν(Y ) − ν(X) ν(Y − X) β Y − X ∞ where Y − X ∞ is the essential supremum of |Y − X|. Hence, ν is uniformly continuous with respect to the supremum norm. Moreover, as a consequence of (B1), (A3) and (6), X ν is a convex cone that contains L ∞ . Theorem 1 of Wang et al. (2020a) states that a real functional on a convex cone that is uniformly continuous with respect to the supremum norm, law-invariant, and satisfying (B2) and (B4) is a distortion riskmetric in the sense of that paper; see (7) below. Further, Theorem 3 of Wang et al. (2020a) says that each of (B5)-(B7) is equivalent to the convexity of a distortion riskmetric. Hence, ν is a convex distortion riskmetric on X ν ∩ L 1 . Theorem 5 of Wang et al. (2020a) gives a representation of convex distortion riskmetrics; that is, ν has a representation, for some finite measures µ 1 and µ 2 , ν(X) = 1 0 ES p (X)dµ 1 (p) + 1 0 ES p (−X)dµ 2 (p), X ∈ X ν ∩ L 1 .

By symmetry (B3), we know ν(X) = ν(−X) = 1 0 ES p (X)dµ 2 (p) + 1 0 ES p (−X)dµ 1 (p), X ∈ X ν ∩ L 1 .

Hence, we can take µ = (µ 1 + µ 2 )/2, and get ν(X) = 1 0 ∆ ES p (X)dµ(p), X ∈ X ν ∩ L 1 .

Relevance (B1) implies µ = 0, which in turn implies X ν ⊂ L 1 , as the effective domain of ∆ ES p is L 1 for p ∈ (0, 1). Hence, the two functionals ν and ν µ coincide on X ν which contains L ∞ . Also note that both ν and ν µ satisfy continuity (B2), and hence one can approximate any random variable outside X ν with truncated random variables, and obtain that ν and ν µ also coincide on X .

be applied to any p ∈ (0, 1 Hence,

Coherent measures of risk

A note on quantiles in large samples

Conditional expectiles, time consistency and mixture convexity properties

Risk management with expectiles

Generalized quantiles as risk measures

Expectiles, Omega Ratios and Stochastic orderings

On the dependence structure between S&P500, VIX and implicit Interexpectile Differences. Quantitative Finance

On the extension of the Namioka-Klee theorem and on the Fatou property for risk measures

Nonparametric estimation of Expected Shortfall

Nonparametric inference of Value-at-Risk for dependent financial returns

Robustness and sensitivity analysis of risk measurement procedures

Early sample measures of variability

On convex principles of premium calculation

An academic response to Basel 3.5. Risks

Quantile-based risk sharing

What is the best risk measure in practice? A comparison of standard measures

The canonical model space for law-invariant convex risk measures is L 1

Stochastic Finance. An Introduction in Discrete Time

Gini-type measures of risk and variability: Gini shortfall, capital allocation and heavy-tailed risks

Maximum entropy principle with general deviation measures

Chebyshev inequalities with law invariant deviation measures

Comparative and quantitiative robustness for law-invariant risk measures

Statistical inference for expectile-based risk measures

On law-invariant coherent risk measures

PELVE: Probability equivalent level of VaR and ES

Convex risk functionals: representation and applications

Quantitative Risk Management: Concepts, Techniques and Tools. Revised Edition

Comparison Methods for Stochastic Models and Risks

Asymmetric least squares estimation and testing. Econometrica

Jackknife empirical likelihood method for some risk measures and related quantities

Generalized deviation in risk analysis

Mathematical Risk Analysis. Dependence, Risk Bounds, Optimal Allocations and Portfolios

Stochastic Orders

Empirical Processes with Applications to Statistics

Variability of the max and min statistic: A theory of the quantile spread as a function of the sample size

Distortion riskmetrics on general spaces

Characterization, robustness and aggregation of signed

An axiomatic foundation for the Expected Shortfall

Coherence and elicitability

We thank an Editor, an Associate Editor, and two anonymous referees for helpful comments. T. Fadina and R. Wang are supported by the Natural Sciences and Engineering Research Council of Canada (RGPIN-2018-03823, RGPAS-2018.

(vi) The Gini deviation (Gini-D):1 2 E[|X 1 − X 2 |], X ∈ L 1 , X 1 , X 2 , X are iid.(vii) The relative deviation: SD(X)

, X ∈ L 2 + .(viii) The Gini coefficient:, X ∈ L 1 + , X 1 , X 2 , X are iid.Here, L q + , q ∈ [0, ∞] is the set of all non-negative random variables X in L q with P(X > 0) > 0.

Proof of Theorem 1. (i) Law invariance (A1) is obvious. For standardization (A2), note that the risk measures ρ ∈ {Q p , Q − p , ES p , ES − p , ex p } are all monetary (Föllmer and Schied (2016) ) and satisfies ρ(m) = m for any constant m. Hence, for a constant m, ∆ Q p (m) = ∆ ES p (m) = ∆ ex p (m) = 0. Positive homogeneity follows from that of Q p , Q − p , ES p , ES − p and ex p .(ii) The effective domains of these variability measures can be easily checked from the effective domain of the corresponding risk measures.(iii) Since Q p is increasing in p and Q − 1−p is decreasing in p, ∆ Q p is increasing in p. The same applies to ∆ ES p and ∆ ex p .(iv) It is well known that, for X ∈ L 0 , Q p (−X) = −Q − 1−p (X); see e.g., Föllmer and Schied (2016, (4.44) ). Hence,, follows directly from definition.Next we show the formula for ∆ ex p . From Newey and Powell (1987) , the expectile ex p (X), for p ∈ (1/2, 1) is the unique solution x toHence, the expectile of −X satisfiesProof of Proposition 2. (i) If X has a symmetric distribution, then by Theorem 1 (iv), we haveAssume X 1 and X 2 are symmetric distributions with finite ∆ Q p (X 1 ) = ∆ Q p (X 2 ) for p ∈ ( 1 2 , 1). It follows that Q − p (X 1 ) = Q − p (X 2 ) for p ∈ (0, 1 2 ). By the left-continuity of the left-quantile, Q − 1/2 (X 1 ) = Q − 1/2 (X 2 ). By symmetry of the distribution of X, we have Q − p (X 1 ) = Q − p (X 2 ) almost every p, and thus X 1 and X 2 have the same distribution.(ii) If X has a symmetric distribution, then similarly to (i), we have ∆ ESBy taking a derivative of both sides of (8) with respect to p, we getat all common continuity points p of p → Q p (X 1 ) and p → Q p (X 1 ). Since both functions are right-continuous, we know that the two functions are identical. This argument can 2 ). Similarly to part (i), we conclude that X 1 and X 2 have the same distribution.(iii) If X has a symmetric distribution, then similarly to (i), we haveThe expectile has alternative definitions from Newey and Powell (1987) ,which leads toSince ex p (X) is continuous in p and takes all values in the range of X, we knowfor all x ∈ R, implying that the distributions of X 1 and X 2 are identical.Proof of Proposition 3. (i), (ii), (iii) follow immediately, respectively from location invariance, positive homogeneity of order 1 and symmetry of ∆ ES p and ∆ ex p , while (iv) follows immediately from the second part of the thesis of Proposition 1.(v) By passing if necessary to the random variablesX =from (i) we can assume without loss of generality that E[X] = E[Y ] = 0. Then X dil Y ⇒ X cx Y , and the thesis follows from Cx-consistency of ∆ ES p and ∆ ex p , for each p ∈ (1/2, 1).(vi) As in (v), we can assume without loss of generality that Edr, for each p ∈ (1/2, 1). From symmetry and the assumption E[X] = E[Y ] = 0 it follows that the same inequality holds also for each p ∈ (0, 1/2), that implies X cx Y by Theorem 3.A.5 in Shaked and Shantikumar (2007) . Similarly, under symmetry ∆ ex p (X) = 2ex p (X), so X ∆-ex Y ⇒ ex p (X) ex p (Y ) for each p ∈ (1/2, 1), and since ex p (X) = ex p (−X) = −ex 1−p (X), the opposite inequality holds for p ∈ (0, 1/2). By reasoning as in the proof of Theorem 12 of , it follows that π X (x) π Y (x) for each x ∈ R, where π X (x) := E[(X − x) + ] and π Y (x) := E[(Y − x) + ] are the usual stop-loss transforms of X and Y ; the thesis then follows from Theorem 3.A.1 of Shaked and Shantikumar (2007) .Proof of Theorem 4. (i) Let Q p (n), ES p (n), and ex p (n) be the empirical estimators of Q p (X), ES p (X), and ex p (X) based on n sample data points. It is well known (e.g., Bahadur p → Q r (X) at each r of continuous point of Q r (X), which implies ∆ Q p (n) p → ∆ Q p (X) under assumption (R). Since ES p and ex p are law-invariant convex risk measures, by Theorem 2.6 of Krätschmer et al. (2014) , ES r (n) p → ES r (X) and ex r (n) p → ex r (X) for each r. Hence we have ∆ ES p (n) p → ∆ ES p (X) and ∆ ex p (n) p → ∆ ex p (X).(ii) By Proposition 1 of Shorack and Wellner (2009, p.640) , if assumption (R) is satisfied, then we havewhere B p is a standard Brownian bridge. With assumption (R), Q p (X) = Q − p (X). Hence,which has a Gaussian distribution. Using the covariance property of the Brownian bridge, that is, CovNext, we address the inter-ES difference. Applying the convergence in (9) to ES p , we obtainds, and thusds.Note thatdtds, and 1 (1 − p) 2 Cov, with σ 2 ES given in (3), namely,dtds.For the inter-expectile difference, we use Theorem 3.2 of Krätschmer and Zähle (2017) . The conditions for this theorem are satisfied in our setting noting that X ∈ L 2+δ ; see Remark 3.4 of Krätschmer and Zähle (2017) . We obtain, for p ∈ (1/2, 1), √ n( ex p (n) − ex p (X)) → N(0, s ex p )where for r ∈ {1 − p, p},and f ex r,F (t) =(1 − r)1 {t exr(X)} + r1 {t>exr(X)} (1 − 2r)F (ex r (X)) + r .

This completes the proof.