Abstract

We introduce a new model for the dynamics of fat-tailed (realized) covariance-matrix-valued time-series using the F-Riesz distribution. The model allows for heterogeneous tail behavior across the coordinates of the covariance matrix via two vector-valued degrees of freedom parameters, thus generalizing the familiar Wishart and matrix-F distributions. We show that the filter implied by the new model is invertible and that a two-step targeted maximum likelihood estimator is consistent. Applying the new F-Riesz model to U.S. stocks, both tail heterogeneity and tail fatness turn out to be empirically relevant: they produce significant in-sample and out-of-sample likelihood increases, ex-post portfolio standard deviations that are in the global minimum variance model confidence set, and economic differences that are either in favor of the new model or competitive with a range of benchmark models.

Covariance matrix modeling and estimation play an important role in many areas of economics and statistics, such as financial risk assessment and decision making under uncertainty (Markowitz 1991; Engle, Ledoit, and Wolf 2019). Today’s data-rich environment has led to a shift in ambition from estimating static covariance matrices to estimating covariance matrices on a frequent basis over many short spans of data, also known as realized covariance matrix estimation. Examples include Andersen et al. (2003); Barndorff-Nielsen and Shephard (2004); Chiriac and Voev (2011); Lunde, Shephard, and Sheppard (2016); Callot, Kock, and Medeiros (2017); Bollerslev, Patton, and Quaedvlieg (2018); Bollerslev et al. (2020); and the references cited therein.

An important challenge is to design parsimonious yet flexible time-series models for such series of realized covariance matrices that can be used for forecasting and decision purposes. A complication is that the time-series observations are matrix-valued (rather than vector-valued), have positive (semi)-definite outcomes only, and may be subject to fat-tailed behavior and outliers. Most of the models currently available cannot cope with all of these challenges simultaneously or are highly restrictive. Recent work on tensor-valued time-series such as Wang, Liu, and Chen (2019) and Chen, Yang, and Zhang (2022) can deal with matrix-valued time-series, but not with restrictions on positive definiteness of the observations or with the fat-tailed nature of these data in many applications. Other approaches that can deal with positive definite random matrices are typically highly restrictive. For instance, the often-used Wishart or inverse Wishart distributions for matrix-valued time-series only feature two parameters: a matrix-valued mean, and a single scalar-valued degrees of freedom parameter to describe the tail behavior across all coordinates (Golosnoy, Gribisch, and Liesenfeld 2012; Jin and Maheu 2013, 2016). Similarly restrictive, the matrix-F distribution only features two tail parameters for any k×k realized covariance matrix (Konno 1991; Opschoor et al. 2018). While such distributions might be suitable for low-dimensional cases, in moderate to high dimensions the implied constraints on tail behavior in the cross-section are typically empirically too restrictive.

The typical approach found in the literature to flexibilize multivariate distributions by splitting them into the marginal distributions and a copula (see for instance Patton 2009; Oh and Patton 2017, 2018; Opschoor et al. 2021) cannot easily be applied here. Most copula methods relate to vector-valued observations and cannot deal with the positive definiteness of covariance-matrix-valued observations. Second, many copula structures available in the literature are also tightly parameterized, such as the Gaussian, (skewed) Student’s t, and Archimedean copulas, with very little heterogeneity in the tail-dependence structure.

In this article, we, therefore, introduce the dynamic F-Riesz distribution to model sequences of realized covariance matrices. We do so by building on the beta type II Riesz distribution of Díaz-García (2016).1 We introduce dynamics for the key scale parameter of the F-Riesz distribution and derive the invertibility of the filter and the consistency properties of the maximum likelihood estimator (MLE) for the static parameters of the model. A key property of the F-Riesz distribution is that it allows for tail heterogeneity in each of its coordinates. It does so by replacing the two scalar degrees of freedom parameters of the matrix-F distribution of Konno (1991) by two vectors of degrees of freedom parameters. If each of these vectors is scalar (i.e., has the same elements), then the F-Riesz model reduces to the matrix-F model (see Konno 1991; Opschoor et al. 2018).

The F-Riesz distribution is constructed by mixing a Riesz distribution (Hassairi and Lajmi 2001; Díaz-García 2013) with an Inverse Riesz distribution (Tounsi and Zine 2012; Louati and Masmoudi 2015), both of which are generalizations of the Wishart and Inverse Wishart distributions. The Riesz distribution has thus far mainly been used in the physics literature (Andersson and Klein 2010). In economic statistics, Gribisch and Hartkopf (2023) also recently apply the Riesz distribution to financial data. They introduce a state-space version of the dynamic Riesz distribution and estimate the model using Bayesian techniques. We differ from their approach in two important ways. First, we use an observation-driven rather than a parameter-driven approach to model the dynamics of realized covariance matrices. As a result, we can obtain the likelihood function in closed form and can stick to standard maximum likelihood rather than simulation-based or Bayesian techniques for the estimation of model parameters. Second, we use the more flexible and fat-tailed F-Riesz-distribution rather than the Riesz. This allows for additional distributional flexibility, which appears relevant in our application within our class of observation-driven models when comparing the behavior of the F-Riesz model with the (Inverse) Riesz.2

We apply the dynamic F-Riesz distribution to a sample of daily realized covariance matrices of dimensions 5 and 15 using U.S. stock data over the period 2001–2019. The results show that both tail heterogeneity and fat tailedness as captured by the F-Riesz distribution are empirically relevant compared to tail heterogeneity only (Riesz) or fat-tailedness only (matrix-F). The data strongly reject both the dynamic Riesz and the dynamic matrix-F specifications for the dynamics of realized covariance matrices compared to the F-Riesz alternative, despite both the Riesz and matrix-F already performing significantly better in terms of likelihood fit than the dynamic Wishart and inverse Wishart models. In addition, weekly and biweekly point forecasts of the predicted covariance matrices from the F-Riesz distribution are at par or better than those of the alternative model specifications, while its density forecasts measured by the log-scoring rule are clearly better. The one-step-ahead predicted covariance matrices from the F-Riesz distribution also perform well in a global minimum variance portfolio strategy: they yield statistically lower ex-post portfolio standard deviations and higher economic gains after taking into account transaction costs compared to competing models and several standard benchmarks such as the HAR-DRD model of Oh and Patton (2016). We conclude that the dynamic F-Riesz distribution can prove useful for the statistical analysis of covariance-matrix-valued time-series.

The rest of this article is set up as follows. In Section 1, we introduce the model. Section 2 considers filter invertibility and the consistency properties of the two-step targeted MLE and also studies the new model’s performance in a simulated setting. Section 3 presents the empirical results. Section 4 concludes. The Supplementary Appendices gather further technical and empirical results. As a notational convention in this article, we write scalars in normal type face, vectors in bold, and matrices in capitalized bold font.

1 The Conditional Autoregressive F-Riesz Model

1.1 Dynamic Model Specification

Let XtRk×k for t=1,,T denote a time-series of realized covariance matrices, and define the filtration Ft1={X1,,Xt1} containing all lagged observations of Xt. We describe the dynamics of Xt by a conditional, matrix-valued distribution with an autoregressive, time-varying mean,
(1)
where LVt is the lower-triangular Cholesky decomposition of Vt, such that Vt=E[Xt|Ft1]=LVtLVt, and where Et is a matrix-valued innovation with conditional expectation equal to the identity matrix, E[Et|Ft1]=Ik. We can view Equation (1) as the matrix version of the scalar MEM model of Engle and Gallo (2006). We explicate the conditional pdf of Xt in Equation (4) further below. The parameter matrix Ω in Equation (1) is symmetric and positive definite. For simplicity, we take A and B as scalar parameters like in the original DCC model of Engle (2002), but generalizations of this can easily be accommodated. Note that for positive definite Ω and V1 and for A>0,B>0, and A+B<1,Vt is automatically positive definite for all t. We initialize V1 by the unconditional mean of Xt. Also note that more complex dynamic structures can easily be allowed for in Equation (1). For instance, in the empirical application in Section 3 we incorporate HAR-type dynamics as in Corsi (2009) to better capture the possible long-memory behavior of realized covariance matrices.

If we assume the conditional distribution of Xt|Ft1 to be Wishart, the dynamics in Equation (1) resemble the Conditional Autoregressive Wishart (CAW) model (see Golosnoy, Gribisch, and Liesenfeld 2012). For the Wishart case, the model also collapses to one of the two core equations of the Multivariate HEAVY model of Noureldin, Shephard, and Sheppard (2012). As argued in the introduction, however, assuming an (Inverse) Wishart or even a matrix-F distribution for Et can be too restrictive in terms of the tail behavior it implies. This is why we introduce the F-Riesz distribution in Section 1.2 to allow for tail heterogeneity.

Note that model (1) is observation-driven. Therefore, it allows for easy parameter estimation via standard maximum likelihood methods employing a prediction error decomposition using the expression for the F-Riesz pdf in Equation (4). To reduce the dimensionality of the optimization, we can use a targeting approach to pre-estimate Ω. This can be done in the following way. Assuming stationarity and the existence of an unconditional first moment of Xt, one can take unconditional expectations on both sides of the equations in Equation (1) to obtain V¯=E[Vt]=E[Xt]=Ω. This result can be used to estimate Ω by the sample average of the realized covariance matrices, Ω^=T1t=1TXt. Plugging this expression into the log-likelihood function, the resulting function then only depends on the remaining parameters A and B, plus any parameters describing the conditional distribution of Et.

1.2 The F-Riesz Distribution

The family tree of the F-Riesz distribution considered in this article is provided in Figure 1. The Wishart and to a lesser extent the matrix-F distributions are assumed to be sufficiently well known. The Riesz distribution, however, may be less familiar. Therefore, we first briefly recapitulate the basics of the Riesz distribution before presenting the F-Riesz distribution. A more extensive introduction to the different distributions and some more technical results can be found in the Supplementary Appendices A and B.

Graphical representation of the relationship betweens various matrix distributions. The F-Riesz distributions is a combination of the Riesz and the inverse-Riesz distribution
Figure 1

Family of matrix distributions.

Notes: This figure shows a family tree of the F-Riesz distributions. Connected lines mean that distributions are related by generalization.

The Riesz distribution is characterized by two parameters: a positive definite scaling matrix Σ=LL with lower triangular Cholesky decomposition L, and a vector of degrees of freedom parameters ν=(ν1,,νk), with νi>i1 for i=1,,k. Arguably the easiest way to introduce the Riesz distribution is via its so-called Bartlett decomposition, which is a familiar simulation device for the standard Wishart distribution; see Anderson (1962). Consider a random matrix GRk×k, defined as
(2)
where all elements of G are independent random variables. Then Y=LGGL has a so-called Riesz distribution of type I, which we write as YRI(Σ,ν).3

Given the use of the Cholesky decomposition, it is clear that the ordering of the variables matters. This is a well-known and accepted feature in the Riesz literature, as the order of the variables can also be recovered from the data under the assumption of correct specification by maximizing the likelihood also over the order of the variables in the system, rather than over Σ and ν only. In small dimensions, this can be done by full enumeration. For moderate dimensions, we provide a heuristic algorithm in Section 1.3 that works well in our simulated and empirical setting. Empirically, we find that the step of allowing for tail heterogeneity by generalizing the Wishart to the Riesz or the matrix-F to the F-Riesz distribution increases the likelihood much more than subsequent likelihood increases due to determining the precise ordering of the coordinates. The latter, in our setting, are typically of second-order importance.

If ν=(ν,,ν) for some ν>k1, the Riesz distribution collapses to the well-known Wishart distribution with ν degrees of freedom. In that case, the degrees of freedom parameters νi on the diagonal of the Bartlett decomposition G in (2) all have the same value ν. In contrast to the Wishart, the Riesz distribution thus allows for heterogeneous tail behavior in the cross section. Tail fatness, however, is left unaffected and is still exponential (thin) in all directions.

To introduce fatter tails for the Riesz distribution, we draw the analogy between the Wishart and matrix-F distribution (Konno 1991). If Y has a Wishart distribution, YW(Ik,ν), and X given Y also has a Wishart distribution, X|YW(Y1,μ), then the unconditional distribution of X is a matrix-F: XF(Ik,μ,ν). Replacing the Wishart distribution by its generalization, the Riesz, we might expect that a similar result can be obtained which allows for both fat and heterogeneous tails. This is confirmed in Theorem 2.3, which we prove in Supplementary Appendix A.

To formulate the theorem, we first need to define the concepts of the generalized (lower and upper) gamma function and the (lower) power weighted determinant. 

Definition 2.1

(Generalized multivariate gamma functions). The lower generalized multivariate gamma function for a vector-valued argument ν=(ν1,,νk)Rk×1 is defined as Γ¯(ν)=πk(k1)/4i=1kΓ(νi+1i2), with 2νi>i1 for i=1,,k. Similarly, the upper generalized multivariate gamma function is defined as Γ¯U(ν)=πk(k1)/4i=1kΓ(νi+ik2)=Γ¯(ν+γ˜), for 2νi>ki for i=1,,k, and γ˜=12(k+1)(k,k1,,1)=(12(k1),,12(k1)).

The upper and lower generalized multivariate gamma functions enter the integrating constant of the Riesz and F-Riesz distributions. If ν=(ν,,ν), then Γ¯(ν)=Γk(ν), where Γk(ν) is the standard multivariate gamma function.

Next, we introduce the Lower Power Weighted Determinant (LPWD). The power weighted determinant takes a similar role in the expressions for the density of the F-Riesz distribution as standard determinants do for the Wishart and the matrix-F distribution. 

Definition 2.2

(LPWD). Consider the vector νRk×1 and a positive definite matrix Y. Let L be the (unique) lower triangular Cholesky decomposition of Y, that is, Y=LL. Then the LPWD |Y|ν is given by |Y|ν=i=1kLi,i2νi.

In the physics literature, the power weighted determinants are commonly introduced via so-called weight functions; see for instance Gross and Richards (1987). In this article, we instead use the notation of power weighted determinants as it is closer to the econometric literature and stresses the simplification of the Riesz to the Wishart and of the F-Riesz to the matrix-F distribution if ν=νιk. Note that the power weighted determinant is not as simple as a regular determinant. Properties like |A·B|=|A|·|B| for matrices A,BRk×k either do not hold or hold in modified form for power weighted determinants. Supplementary Appendix Lemma B.1 provides manipulation rules for power weighted determinants.

We now obtain the following formulation of the pdf of the F-Riesz distribution. 

Theorem 2.3

(F-Riesz distribution).

  • If  X|YRI(Y,μ)  has a conditional Riesz type-I distribution, and  YiRII(Σ,ν)  has an Inverse Riesz type-II distribution, then  X  is  FRI(Σ,μ,ν)  distributed with density function  
  • Let  Σ=LL  for a lower triangular matrix  L. If  XFRI(Σ,μ,ν), then  L1X(L)1FRI(Ik,μ,ν).

Theorem 2.3 corrects a result from Díaz-García (2016) on the pdf of the generalized Beta II distribution. The result in that article is only valid for Σ=Ik, and incorrect otherwise. The correction is therefore crucial for our application, where we allow for time-varying scale matrices Σt.

Interestingly, except for the use of the non-standard generalized gamma functions and power weighted determinants, the density expression of the F-Riesz one-on-one mirrors and generalizes that of the matrix-F distribution. The following corollary establishes this link and shows that the matrix-F distribution of Konno (1991) as used by Opschoor et al. (2018) is a special case of the F-Riesz distribution. 

Corollary 2.4.
Under the conditions of  Theorem 2.3  part (i) or (ii), if we assume  μ=μ·ιk  and  ν=ν·ιk, then  X  has a matrix-F distribution  F(μ,ν), and  
(3)
where  Γk(·)  is the multivariate gamma function.

The corollary makes clear that it is possible to test whether the F-Riesz collapses to the matrix-F distribution by testing whether all elements in μ are the same, as well as all elements in ν.

Similar to the scalar or matrix-F distributions, the expectation of the F-Riesz does not always exist. The following theorem derives expressions for the expectation and the conditions for its existence. An expression for the first moment of the F-Riesz distribution is important for our dynamic model in Equation (1), as Et is normalized to have the unit matrix as its conditional expectation. 

Theorem 2.5
(Expectation of the F-Riesz distribution). Let  YFRI(I,μ,ν), then  E[Y]=M(μ,ν), where  M(μ,ν)  is a diagonal matrix with ith diagonal element  Mi,i(μ,ν)  equal to  
provided  μi>0  and  νi>k+2i  for  i=1,,k.

Combining this result with Theorem 2.3, we immediately obtain that the disturbance term Et in our dynamic F-Riesz model in Equation (1) should have scale matrix Mi,i(μ,ν)1 in order to have conditional expectation Ik. As a result, the conditional pdf of Xt=LVtEtLVt is given by
(4)
with pFRI(·) as defined in Equation (1), and where Vt follows the recursion in Equation (1) with lower triangular Cholesky decomposition LVt. The static parameters to be estimated by maximum likelihood now comprise Ω,A,B,μ, and ν, where Ω can be targeted as explained in Section 1.1. For a given set of static parameters, Vt can be obtained from Equation (1) for every t. From these, we can compute the Cholesky decompositions LVt for every t, which can finally be inserted into the expression for the pdf in Equation (4) to obtain the value of the likelihood.

1.3 Ordering of Variables

The order of the coordinates in Xt matters for the specification of Riesz and F-Riesz distributions. As mentioned before, this is well accepted in the Riesz literature due to the use of the Cholesky decomposition in the construction of the Riesz distribution. The order of the coordinates can be regarded as another variable that can be optimized over. For sufficiently small dimensions, enumeration of all possible orders is possible. However, such an approach quickly becomes unwieldy: for k =10, we already would have to estimate and compare more than 3.6M models. To approximate the optimal ordering, we therefore propose the following heuristic algorithm.

Algorithm 2.6 (Approximating the optimal ordering of variables in the system).

Let  o=(o1,,ok)  be a permutation of the first  k  integers, indicating the order of the variables in the system that make up the covariance matrix observations  Xt. Let  θ  denote the static parameters that characterize the model and that need to be estimated by maximum likelihood.

Step 0:Set j = 0.

Step 1:Select a random order  o(j)=(o1(j),,ok(j)).

Step 2:Given the ordering  o(j), estimate  θ  and obtain  θ^(j).

Step 3:Loop over asset i,i=1,,k:

  Step 3a: Find  i  such that  i=oi(j), that is, find the position of asset i in the current ordering  o(j).

  Step 3 b: Put asset i in each of the possible positions  1,,k, while keeping the order of the other variables as in  o(j), that is, consider the permutations

  (oi(j),o1(j),,oi1(j),oi+1(j),,ok(j)),(o1(j),oi(j),o2(j),,oi1(j),oi+1(j),,ok(j)), up to  (o1(j),,oi1(j),oi+1(j),,ok(j),oi(j)). For each re-ordering, re-estimate  θ  as  θ^(j,i)  and retain the ordering plus estimated  θ^(j,i)  that yields the highest log-likelihood value and store it as  o(j+1)  and  θ^(j+1).

  Step 3c: Increase j to j + 1.

  Step 3d: Continue the loop by proceeding to the next asset i + 1.

Step 4 (optional):Repeat steps 1–3  p1  times (or until convergence), possibly repeating the entire process for  p2  different random initial orderings. Retain the final order and estimate that yields the highest log-likelihood. Call this final order  o(opt)  with corresponding parameter estimate  θ^(opt).

The algorithm above ensures that the maximized likelihood never decreases when searching over different orderings. Moreover, the algorithm is relatively efficient for moderate dimensions since it limits the number of times we re-estimate θ. The latter is costly due to the required non-linear optimization. The algorithm re-estimates θ about p1p2k2 times, which is substantially smaller than the full k! enumerated possibilities, thus providing a substantial computational gain. Though no guarantee is given that we arrive at the true optimum using this heuristic algorithm, the simulation evidence in the next section shows that even without using Step 4 non-negligible likelihood increases can be obtained. Also, the algorithm typically lands close to the correct ordering of the variables in terms of rank correlations.

2 Theory and Simulation Evidence

In Section 2.1, we establish the invertibility of the conditional autoregressive F-Riesz (CAFr) filter as defined in Equations (1) and (4) for the dynamic parameters Vt. In addition, we prove the consistency of the MLE of the static parameters Ω,A,B,μ, and ν. In Section 2.2, we then study the performance of the MLE and of the heuristic algorithm introduced in Section 1 in a simulated setting.

2.1 Filter Invertibility and MLE Consistency

To establish the consistency of the MLE for the unknown static parameter θ of the CAFr model,4 we follow the usual two-step targeting approach that is typically found in empirical work and described in Section 1.1. We first estimate Ω using a simple sample mean of Xt. Next, fixing this estimate of Ω, we estimate the remaining parameters by non-linear maximum likelihood optimization.

We make the following assumptions for consistent estimation of Ω. 

Assumption 3.1.

The sequence  {Xt}t=1,,T  is generated by (4) and (1) for some  (Ω0,A0,B0,μ0,ν0)  for every  t=1,,T.

 
Assumption 3.2.

Ω0  is positive definite, μ0,j>K+1j, ν0,i>i+1i, A0>0, B00, and  |A0+B0|<1.

Assumption 3.1 is a standard assumption on correct specification. Assumption 3.2 then allows us to establish the stationarity and ergodicity of the model as a data generating process. The strong consistency of the sample average Ω^=T1t=1TXt to Ω0 then follows by an application of the ergodic theorem. 

Proposition 3.3.

Let  Assumptions 3.13.2  hold. Then  Ω^=T1t=1TXta.s.Ω0  as  T.

The consistent estimate of Ω can be used as a plug-in for a targeted estimation approach for the remaining static parameters of the model.

We now turn to the invertibility of the filtering Equation (1). To do so, we first introduce some new notation. Let V^t(θ) denote the filtered sequence from (1), initialized at some point V^1, and evaluated at some parameter vector θΘ. Following the literature (e.g., Straumann and Mikosch 2006; Wintenberger 2013), invertibility ensures that the filter “forgets” the possibly incorrect initialization; that is, the filtered sequence {V^t(θ)}tN converges path-wise and exponentially fast to a unique stationary and ergodic limit sequence {Vt(θ)}tN. This means that for every θ in the parameter space Θ there is a c >1 such that ct||V^t(θ)Vt(θ)||a.s.0 as t, regardless of the initialization V^1. We also write V^t=V^t(θ0) and Vt=Vt(θ0), such that the filter asymptotically recovers the true Vt series from the data generating process if the filter is evaluated at the true static parameter θ0.

In the current setting, filter invertibility can be obtained by ensuring that the following conditions hold: (i) stationarity of the data {Xt}tZ; (ii) a logarithmic bounded moment for Xtt; (iii) a contraction condition for the filtering equation. The stationarity of the data in (i), and the logarithmic moment in (ii) follow directly from Assumptions 3.1 and 3.2. The contraction condition for the filtering equation, however, requires additional restrictions on the parameter space Θ. Assumption 3.4 ensures that the filtered V^t(θ) matrices are positive definite and that the stochastic filtering equation is contracting in the appropriate sense. It also ensures identification of the ordering of the variables in the system by requiring the degrees of freedom parameters to be different across coordinates i and j. 

Assumption 3.4.

The parameter space  Θ  is compact and satisfies  A0, B0, supB|B|<1, and  minjinfμj(μjK1)>0, miniinfνi(νii1)>0, μiμj  for  ij, and  (μi+νi)(μ+j+νj)  for  ij.

Proposition 3.5 now establishes the invertibility of the initialized filter V^t(θ) for its stationary and ergodic limit Vt(θ) and opens the door to the consistency of the MLE. 

Proposition 3.5.

Let  Assumptions 3.1-3.4  hold. Then the filter  {V^t(θ)}tN  is invertible.

We are now ready to formulate our consistency result of the MLE (A^T,B^T,μ^T,ν^T). This MLE takes the form of a targeted two-step estimator as it depends on the first-step estimator for Ω0. As is common for filtering models, the log-likelihood depends directly on the properties of the filter V^t(θ), which is itself a function of the estimated Ω^T, the parameters A and B, and the initialization V^1 as noted above. To make this clearer in the notation, we write explicitly V^t(Ω^T,A,B) rather than V^t(θ). In addition, we write o^T as the ordering of coordinates that maximizes the log-likelihood. Putting all elements together, we define the MLE as the maximizer of the plug-in log-likelihood logp˜FRI(Xt;V^t(Ω^T,A,B),μ,ν) for a specific ordering o,
(5)
 
Theorem 3.6.
Let  Assumptions 3.13.4  hold. Then for  T  the targeted MLE  (A^T,B^T,μ^T,ν^T,o^T)  defined in (5) satisfies  

Theorem 3.6 provides the consistency of the MLE. Though the simulation results in Table 1 suggest that the ML estimator for the static parameters can be well approximated by a normal distribution in finite samples, we instead choose to focus on the predictive performance of the model using Diebold–Mariano (DM) tests (Diebold and Mariano 1995) rather than on the behavior of the static parameters. The latter is typically deemed of less interest in dynamic parameter models such as the CAFr, where the focus is mostly on the filtered paths of V^t and the model’s predictive performance. Note that the consistency of the filtered paths V^t follows directly from the consistency of the MLE and the filter invertibility established earlier.

Table 1

Parameter estimations of CAFr model

Coef.TrueMeanStdMean (S.E.)
A0.16000.15960.00480.0044
B0.83000.82960.00490.0049
μ116.6416.691.1251.128
μ227.1527.071.5671.544
μ341.6141.522.2842.229
μ458.1858.033.0733.030
μ584.6784.124.4024.381
ν120.0520.451.5841.510
ν218.7218.940.9570.880
ν319.3619.570.7800.722
ν420.5920.850.7820.734
ν514.6114.770.5120.442
Coef.TrueMeanStdMean (S.E.)
A0.16000.15960.00480.0044
B0.83000.82960.00490.0049
μ116.6416.691.1251.128
μ227.1527.071.5671.544
μ341.6141.522.2842.229
μ458.1858.033.0733.030
μ584.6784.124.4024.381
ν120.0520.451.5841.510
ν218.7218.940.9570.880
ν319.3619.570.7800.722
ν420.5920.850.7820.734
ν514.6114.770.5120.442

Notes: This table shows Monte Carlo averages and standard deviations of parameter estimates of simulated covariance matrices from the five-dimensional CAFr model of (1). Guided by empirical results, we set μ=(16.64,27.15,41.61,58.18,86.67) and ν=(20.05,18.72,19.36,20.59,14.61). We estimate Ω by targeting in a first step, while A, B, and the DoF parameters are estimated in a second step by maximum likelihood. The table reports the true values, the mean and standard deviation of the estimated coefficients, as well as the mean of the computed standard error using the inverse of the Hessian. Results are based on 1000 Monte Carlo replications.

Table 1

Parameter estimations of CAFr model

Coef.TrueMeanStdMean (S.E.)
A0.16000.15960.00480.0044
B0.83000.82960.00490.0049
μ116.6416.691.1251.128
μ227.1527.071.5671.544
μ341.6141.522.2842.229
μ458.1858.033.0733.030
μ584.6784.124.4024.381
ν120.0520.451.5841.510
ν218.7218.940.9570.880
ν319.3619.570.7800.722
ν420.5920.850.7820.734
ν514.6114.770.5120.442
Coef.TrueMeanStdMean (S.E.)
A0.16000.15960.00480.0044
B0.83000.82960.00490.0049
μ116.6416.691.1251.128
μ227.1527.071.5671.544
μ341.6141.522.2842.229
μ458.1858.033.0733.030
μ584.6784.124.4024.381
ν120.0520.451.5841.510
ν218.7218.940.9570.880
ν319.3619.570.7800.722
ν420.5920.850.7820.734
ν514.6114.770.5120.442

Notes: This table shows Monte Carlo averages and standard deviations of parameter estimates of simulated covariance matrices from the five-dimensional CAFr model of (1). Guided by empirical results, we set μ=(16.64,27.15,41.61,58.18,86.67) and ν=(20.05,18.72,19.36,20.59,14.61). We estimate Ω by targeting in a first step, while A, B, and the DoF parameters are estimated in a second step by maximum likelihood. The table reports the true values, the mean and standard deviation of the estimated coefficients, as well as the mean of the computed standard error using the inverse of the Hessian. Results are based on 1000 Monte Carlo replications.

We use the DM test based on two loss functions. First, we use the log-scoring rule dt=t1t2, where t1 and t2 are the log-likelihood contributions of two different model specifications. The test requires dt to be a finite variance martingale difference series under the null of equal model performance. The existence of a second moment of the log-likelihood for the F-Riesz distribution is easily obtained using similar arguments as for the consistency proof, where a bounded first moment of the log-likelihood was established. Second, we use a more economic perspective to compare the different models by constructing global minimum variance portfolios and comparing their ex-post portfolio variance performance using the DM test. See Section 3.4 for further details.

2.2 Simulation Experiment

This section presents the results of a Monte Carlo study for the statistical properties of the MLE of the conditional autoregressive F-Riesz (CAFr) model. We simulate from a k=5 dimensional version of the CAFr model with empirically relevant values for the static parameters. We do so 1000 times and for each simulated series estimate the static parameters of the model by MLE, as well as their standard errors.

Table 1 presents the results. We see that all parameters are estimated near their true values. This holds both for the dynamic parameters A and B, as well as for the degrees of freedom parameters μ and ν, underlining the consistency result from Section 2. We also note that the Monte Carlo standard deviation of the MLE across simulations (in the std column) is close to the average of the estimated standard errors using the inverse Hessian (in the mean(s.e.) column).

The second simulation experiment investigates the statistical gain of the F-Riesz distribution over the matrix F distribution. Guided by the empirical application, we focus on a five-dimensional F-Riesz I distribution with degrees of freedom vectors μ=(18.7,35.8,58.2,89.4,143.9) and ν=(22.8,24.3,28.6,22.3,18.2). We define μ¯=69.2 and ν¯=23.3 as the average values of the vectors μ and ν, respectively, and μrange=μμ¯ιk and νrange=νν¯ιk. The simulation experiment now consists of the following steps. First, we simulate 1000 matrices Xt from a FRI(Σ,μ˜,ν˜) with μ˜=μ¯ιk+λμrange and ν˜=ν¯ιk+λνrange for λ=(0,0.02,,0.08,0.10). Note that if λ = 0, the FRI distribution collapses to a matrix-F distribution with μ¯ and ν¯ degrees of freedom. Second, we estimate Σ (using the targeting approach) and the degrees of freedom parameters assuming a matrix F or FRI distribution. For each λ, we test the null-hypotheses μ=μ¯ιk and ν=ν¯ιk. This boils down to the Likelihood-Ratio test with 2×k2 degrees of freedom. We repeat this exercise 1000 times.

Table 2 shows the results. In Panel A, we see that if we simulate from a matrix-F distribution (ie λ = 0), the likelihood ratio test has been rejected in 8.4% of all cases. Further, when we deviate slightly from the matrix-F setting, we immediately reject the null hypothesis of a scalar μ and ν in all cases. Panel B lists that the correct matrix-F parameters are indeed estimated back on average. Also the average parameter estimates of the F-Riesz I correspond to the simulated values of 69.2 and 23.25.

Table 2

The matrix F versus the F-Riesz distributions

Panel A: Matrix F vs F-Riesz I
Λ00.020.040.060.080.10
Rejection rate0.0840.1260.3110.5940.8390.980
Panel A: Matrix F vs F-Riesz I
Λ00.020.040.060.080.10
Rejection rate0.0840.1260.3110.5940.8390.980
Panel B: DoF parameters when λ = 0
matrix-Fμ¯ν¯

True69.2023.25
Mean69.2523.33
Std5.720.63

F-Rieszμ1μ2μ3μ4μ5

True69.2069.2069.2069.2069.20
Mean69.6069.4769.5469.4469.42
Std7.266.526.296.055.83

ν1ν2ν3ν4ν5

True23.2523.2523.2523.2523.25
Mean23.3623.3223.3423.4023.42
Std0.990.910.991.171.52
Panel B: DoF parameters when λ = 0
matrix-Fμ¯ν¯

True69.2023.25
Mean69.2523.33
Std5.720.63

F-Rieszμ1μ2μ3μ4μ5

True69.2069.2069.2069.2069.20
Mean69.6069.4769.5469.4469.42
Std7.266.526.296.055.83

ν1ν2ν3ν4ν5

True23.2523.2523.2523.2523.25
Mean23.3623.3223.3423.4023.42
Std0.990.910.991.171.52

Notes: This table shows Monte Carlo results on the difference between the F-Riesz and the matrix-F distribution. Panel A lists results on simulating 1000 matrices from a FRI(Σ,μ˜,ν˜) distribution with μ˜=μ¯ιk+λμrange and ν˜=ν¯ιk+λνrange for λ=(0,0.02,,0.08,0.10) with μ¯=69.2,ν¯=23.3,μrange=μμ¯ιk and νrange=νν¯ιk and μ=(18.7,35.8,58.2,89.4,143.9) and ν=(22.8,24.3,28.6,22.3,18.2). We estimate the parameters assuming a matrix-F or FRI distribution. For each value of λ we perform a Likelihood-Ratio test on the null-hypothesis μ=μ¯ιk and ν=ν¯ιk. Panel A lists the percentage rejections of this hypothesis for different values of λ. Further, Panel B reports results on the estimated degrees-of-freedom parameters of the matrix-F and/or F-Riesz I distribution for the case λ = 0. The panel reports the true values, the mean, and the standard deviation of the estimated coefficients. All results are based on 1000 Monte Carlo replications.

Table 2

The matrix F versus the F-Riesz distributions

Panel A: Matrix F vs F-Riesz I
Λ00.020.040.060.080.10
Rejection rate0.0840.1260.3110.5940.8390.980
Panel A: Matrix F vs F-Riesz I
Λ00.020.040.060.080.10
Rejection rate0.0840.1260.3110.5940.8390.980
Panel B: DoF parameters when λ = 0
matrix-Fμ¯ν¯

True69.2023.25
Mean69.2523.33
Std5.720.63

F-Rieszμ1μ2μ3μ4μ5

True69.2069.2069.2069.2069.20
Mean69.6069.4769.5469.4469.42
Std7.266.526.296.055.83

ν1ν2ν3ν4ν5

True23.2523.2523.2523.2523.25
Mean23.3623.3223.3423.4023.42
Std0.990.910.991.171.52
Panel B: DoF parameters when λ = 0
matrix-Fμ¯ν¯

True69.2023.25
Mean69.2523.33
Std5.720.63

F-Rieszμ1μ2μ3μ4μ5

True69.2069.2069.2069.2069.20
Mean69.6069.4769.5469.4469.42
Std7.266.526.296.055.83

ν1ν2ν3ν4ν5

True23.2523.2523.2523.2523.25
Mean23.3623.3223.3423.4023.42
Std0.990.910.991.171.52

Notes: This table shows Monte Carlo results on the difference between the F-Riesz and the matrix-F distribution. Panel A lists results on simulating 1000 matrices from a FRI(Σ,μ˜,ν˜) distribution with μ˜=μ¯ιk+λμrange and ν˜=ν¯ιk+λνrange for λ=(0,0.02,,0.08,0.10) with μ¯=69.2,ν¯=23.3,μrange=μμ¯ιk and νrange=νν¯ιk and μ=(18.7,35.8,58.2,89.4,143.9) and ν=(22.8,24.3,28.6,22.3,18.2). We estimate the parameters assuming a matrix-F or FRI distribution. For each value of λ we perform a Likelihood-Ratio test on the null-hypothesis μ=μ¯ιk and ν=ν¯ιk. Panel A lists the percentage rejections of this hypothesis for different values of λ. Further, Panel B reports results on the estimated degrees-of-freedom parameters of the matrix-F and/or F-Riesz I distribution for the case λ = 0. The panel reports the true values, the mean, and the standard deviation of the estimated coefficients. All results are based on 1000 Monte Carlo replications.

In Supplementary Appendix E, we present further simulation results showing that (i) the targeting approach and full estimation approach for Ω perform similarly well; (ii) all distributions (from Wishart to F-Riesz) exhibit a similar quality of the MLE in finite samples if that distribution is correctly specified; (iii) the heuristic approach for ordering the variables results in further likelihood increases; (iv) even running the heuristic once already closes much of the gap with the optimal ordering in small dimensional settings; (v) the rank correlations between the true ordering and the ordering found by the heuristic algorithm is 99% on average, and 90% of the top 5 ranks are correctly identified in the k=15 dimensional case. We thus feel confident to proceed with our empirical application.

3 Empirical Application

3.1 Data and Setup

In this section, we apply the F-Riesz distribution to an empirical data set of 45 U.S. equities from the S&P 500 index over the period January 2, 2001, until December 6, 2019, a total of 4,696 trading days. We extract transaction prices from the Trade and Quote database and clean the high-frequency data in line with Brownlees and Gallo (2006) and Barndorff-Nielsen et al. (2009). After this cleaning procedure, we construct realized covariance matrices Xt using 5-minute returns. We refer to Supplementary Appendix F for more information about the data (Tickers) and the cleaning procedure.

We consider six different matrix distributions with a time-varying mean Vt for the realized covariance matrices: the Wishart, the Riesz, the inverse Wishart, the Inverse Riesz, the Matrix-F, and the F-Riesz distribution. We also allow our dynamic F-Riesz model from Equations (1) and (4) to include HAR-type dynamics by considering an extension of Equation (1) to
(6)
with Xtw=(1/5)i=04Xti and Xtm=(1/22)i=021Xti, respectively. We use the two-step targeting approach from Section 1.1 to estimate Ω, and the algorithm from 2.3 with p1=p2=1 to determine the ordering of the variables in the CAFr model. We use the same ordering for the other models.

3.2 Full Sample Results

Table 3 and Figure 2 report the results for the full sample. For each dimension (k=5,15), we randomly choose stocks (without replacement) from our pool of 45 assets. We only present parameter estimates for models with dynamics as in Equation (1) and compare their fit graphically afterward to models with the HAR specification from Equation (6).

Graphical impression of the improvement in fit of several matrix distributions compared to the Wishart distribution. The F-Riesz distribution improves the most upon the Wishart distribution, regardless the model specification or the dimension.
Figure 2

AIC improvements.

Notes: The figure shows the difference between the AIC of the Wishart and that of the other distributions for the models in Table 3 (black bars) and their HAR extension of (6) (white bars). The left panel of the graph depicts results of a random initial ordering of the constructed realized covariance matrices. The right panel is based on the optimized ordering using the algorithm from Section 1.3.

Table 3

Parameter estimates, likelihoods, and information criteria

DistributionABμminμmaxνminνmaxLAICNo. para
Panel A: XOM/PG/WMT/PFE/MCD

Wishart0.2810.69116.09−517810,3623
(0.004)(0.004)(0.075)
Riesz0.2600.7138.5319.57−394278987
(0.004)(0.004)(0.170)(0.157)
i-Wishart0.1950.79118.631162−23193
(0.003)(0.004)(0.067)
i-Riesz0.1900.79613.0920.451493−29717
(0.003)(0.004)(0.223)(0.155)
F0.2150.77056.9026.312229−44504
(0.003)(0.004)(1.350)(0.277)
F-Riesz0.1660.82315.4982.7413.6322.857220−14,41612
(0.003)(0.003)(0.460)(1.681)(0.174)(0.485)

Panel B: JPM/GE/HON/BA/IBM/XOM/CAT/HD/PG/KO/AA/WMT/AXP/MCD/PFE

Wishart0.1760.81027.8396,115−192,2243
(0.001)(0.001)(0.041)
Riesz0.1550.8306.2736.34113,988−227,94217
(0.001)(0.001)(0.123)(0.149)
i-Wishart0.1000.89532.10143,001−285,9953
(0.001)(0.001)0.03
iRiesz0.0950.89910.5735.34151,316−302,59917
(0.001)(0.001)(0.170)(0.140)
F0.1160.87978.2546.26159,642−319,2764
(0.001)(0.001)(0.473)0.14
F-Riesz0.0870.90812.19105.214.0946.60185,170−370,27532
(0.001)(0.001)(0.260)(0.702)(0.271)(0.539)
DistributionABμminμmaxνminνmaxLAICNo. para
Panel A: XOM/PG/WMT/PFE/MCD

Wishart0.2810.69116.09−517810,3623
(0.004)(0.004)(0.075)
Riesz0.2600.7138.5319.57−394278987
(0.004)(0.004)(0.170)(0.157)
i-Wishart0.1950.79118.631162−23193
(0.003)(0.004)(0.067)
i-Riesz0.1900.79613.0920.451493−29717
(0.003)(0.004)(0.223)(0.155)
F0.2150.77056.9026.312229−44504
(0.003)(0.004)(1.350)(0.277)
F-Riesz0.1660.82315.4982.7413.6322.857220−14,41612
(0.003)(0.003)(0.460)(1.681)(0.174)(0.485)

Panel B: JPM/GE/HON/BA/IBM/XOM/CAT/HD/PG/KO/AA/WMT/AXP/MCD/PFE

Wishart0.1760.81027.8396,115−192,2243
(0.001)(0.001)(0.041)
Riesz0.1550.8306.2736.34113,988−227,94217
(0.001)(0.001)(0.123)(0.149)
i-Wishart0.1000.89532.10143,001−285,9953
(0.001)(0.001)0.03
iRiesz0.0950.89910.5735.34151,316−302,59917
(0.001)(0.001)(0.170)(0.140)
F0.1160.87978.2546.26159,642−319,2764
(0.001)(0.001)(0.473)0.14
F-Riesz0.0870.90812.19105.214.0946.60185,170−370,27532
(0.001)(0.001)(0.260)(0.702)(0.271)(0.539)

Notes: This table reports maximum likelihood parameter estimates of the conditional autoregressive models (4)–(1), assuming a Wishart, Riesz, Inverse Wishart, Inverse Riesz, matrix-F, or F-Riesz distribution in (4). Data consist of realized covariance matrices with the optimal ordering based on the algorithm from 2.3 with p=1 on the CAFr model. Panels A and B list results for a randomly chosen subset of 5 and 15 different assets, respectively. Standard errors are provided in parentheses and based on the (sandwich) robust covariance matrix estimator. We report the likelihood L, the AIC, and the number of estimated parameters. The sample goes from January 2, 2001, until December 12, 2019 (T =4696 trading days).

Table 3

Parameter estimates, likelihoods, and information criteria

DistributionABμminμmaxνminνmaxLAICNo. para
Panel A: XOM/PG/WMT/PFE/MCD

Wishart0.2810.69116.09−517810,3623
(0.004)(0.004)(0.075)
Riesz0.2600.7138.5319.57−394278987
(0.004)(0.004)(0.170)(0.157)
i-Wishart0.1950.79118.631162−23193
(0.003)(0.004)(0.067)
i-Riesz0.1900.79613.0920.451493−29717
(0.003)(0.004)(0.223)(0.155)
F0.2150.77056.9026.312229−44504
(0.003)(0.004)(1.350)(0.277)
F-Riesz0.1660.82315.4982.7413.6322.857220−14,41612
(0.003)(0.003)(0.460)(1.681)(0.174)(0.485)

Panel B: JPM/GE/HON/BA/IBM/XOM/CAT/HD/PG/KO/AA/WMT/AXP/MCD/PFE

Wishart0.1760.81027.8396,115−192,2243
(0.001)(0.001)(0.041)
Riesz0.1550.8306.2736.34113,988−227,94217
(0.001)(0.001)(0.123)(0.149)
i-Wishart0.1000.89532.10143,001−285,9953
(0.001)(0.001)0.03
iRiesz0.0950.89910.5735.34151,316−302,59917
(0.001)(0.001)(0.170)(0.140)
F0.1160.87978.2546.26159,642−319,2764
(0.001)(0.001)(0.473)0.14
F-Riesz0.0870.90812.19105.214.0946.60185,170−370,27532
(0.001)(0.001)(0.260)(0.702)(0.271)(0.539)
DistributionABμminμmaxνminνmaxLAICNo. para
Panel A: XOM/PG/WMT/PFE/MCD

Wishart0.2810.69116.09−517810,3623
(0.004)(0.004)(0.075)
Riesz0.2600.7138.5319.57−394278987
(0.004)(0.004)(0.170)(0.157)
i-Wishart0.1950.79118.631162−23193
(0.003)(0.004)(0.067)
i-Riesz0.1900.79613.0920.451493−29717
(0.003)(0.004)(0.223)(0.155)
F0.2150.77056.9026.312229−44504
(0.003)(0.004)(1.350)(0.277)
F-Riesz0.1660.82315.4982.7413.6322.857220−14,41612
(0.003)(0.003)(0.460)(1.681)(0.174)(0.485)

Panel B: JPM/GE/HON/BA/IBM/XOM/CAT/HD/PG/KO/AA/WMT/AXP/MCD/PFE

Wishart0.1760.81027.8396,115−192,2243
(0.001)(0.001)(0.041)
Riesz0.1550.8306.2736.34113,988−227,94217
(0.001)(0.001)(0.123)(0.149)
i-Wishart0.1000.89532.10143,001−285,9953
(0.001)(0.001)0.03
iRiesz0.0950.89910.5735.34151,316−302,59917
(0.001)(0.001)(0.170)(0.140)
F0.1160.87978.2546.26159,642−319,2764
(0.001)(0.001)(0.473)0.14
F-Riesz0.0870.90812.19105.214.0946.60185,170−370,27532
(0.001)(0.001)(0.260)(0.702)(0.271)(0.539)

Notes: This table reports maximum likelihood parameter estimates of the conditional autoregressive models (4)–(1), assuming a Wishart, Riesz, Inverse Wishart, Inverse Riesz, matrix-F, or F-Riesz distribution in (4). Data consist of realized covariance matrices with the optimal ordering based on the algorithm from 2.3 with p=1 on the CAFr model. Panels A and B list results for a randomly chosen subset of 5 and 15 different assets, respectively. Standard errors are provided in parentheses and based on the (sandwich) robust covariance matrix estimator. We report the likelihood L, the AIC, and the number of estimated parameters. The sample goes from January 2, 2001, until December 12, 2019 (T =4696 trading days).

The results provide four main takeaways. First, the maximized log-likelihood values show that the model with the F-Riesz distribution performs better than all the other specifications, including the Riesz distribution.5 This is most clearly seen in Figure 2, which shows the AIC improvements of all models compared to the Wishart specification for both the original (1) and the extended HAR specification (6). The gain of the F-Riesz specification increases substantially with the dimension of the system as can be seen from the scales of the vertical axes of the different panels. For example, the difference between the F-Riesz and the matrix-F distribution equals 5000 and 25,000 log-likelihood points for dimensions 5 and 15, respectively. This increase suggests substantial heterogeneity and fatness of the tails. The AIC values underline that the large likelihood differences outweigh the increased number of parameters.

Second, tail heterogeneity and tail fatness both play an important role at all levels of the analysis. When relaxing the Wishart to the Riesz specification, the AIC improves substantially for all dimensions considered, irrespective of the ordering of the assets; see Figure 2. This underlines the importance of tail heterogeneity. The same holds when relaxing the inverse Wishart to the Inverse Riesz. Tail fatness is also clearly important: the AIC improvement for the matrix-F is large compared to the Wishart. With only two parameters, the matrix-F succeeds in having a similar or higher AIC as the Inverse Riesz, which needs k2 additional parameters compared to the matrix-F. This is the more interesting result given that the matrix-F already heavily outperforms the Wishart, inverse Wishart, Riesz, and to a lesser extent also the Inverse Riesz distributions. Including tail heterogeneity in the matrix-F by using the F-Riesz distribution provides a further substantial gain in likelihood and AIC. Tail heterogeneity thus appears important for both the thin and fat-tailed distributional specifications.

Third, the importance of allowing for tail heterogeneity is confirmed by looking at the estimates of the degrees of freedom parameters. To save space, the table only reports the minima and maxima of the elements of μ and ν. Still, the picture is clear. For example, the estimate of μ in Panel A for the matrix-F is around 55, while the elements of μ of the F-Riesz distribution vary from around 16–83. The pattern persists for the other panels in the table, as well as for the ν parameters. The Riesz and F-Riesz distributions also solve an empirical puzzle for the (Inverse) Wishart and matrix-F distributions. As we can see in Table 3, rising the dimension of the system from 5 to 15 increases the estimated degrees of freedom for the (Inverse) Wishart and matrix-F. We can understand this by looking at the spreads of μ and ν for the F-Riesz distribution. These reveal that the tail fatness (low μ and ν values) persists across dimensions, as μmin and νmin remain relatively constant across panels A and B. By contrast, μmax and νmax increase if we consider more stocks, indicating that some of the realized volatilities exhibit thinner tail behavior. As the (Inverse) Wishart/matrix-F can only accommodate this by using some sort of average degrees of freedom across all assets due to their one or two parameter set-up, the degrees of freedom for these two distributions increases empirically when increasing the number of assets. By contrast, the F-Riesz (and also the (Inverse) Riesz) distributions do not show this behavior.

Fourth, we see that the heterogeneity biases discussed above spill over into biases in the estimated persistence of Xt. The B of the F-Riesz distribution is higher across all dimensions than that of the other models, while its A parameter is lower. This results in a much smoother pattern of Vt for the F-Riesz distribution. Again, this stems from the accumulation of two effects: fat tails of Xt, and tail heterogeneity. Fatter tails for Xt in the model imply the dynamics of Vt react less violently to incidental outliers in Xt, similar to the effect of using a Student’s t distribution in a GARCH model. This explains why the F-Riesz results in more persistence than the Riesz or Inverse Riesz. The second effect is that of tail heterogeneity. Because the (Inverse) Wishart and matrix-F only have one or two degrees of freedom parameters, they fail to describe the heavy-tailed behavior in some of the realized volatilities. Empirically, this typically leads to a lower estimated persistence due to the more frequent unexpected occurrence of incidental large observations. As a result, the F-Riesz and Riesz have a higher persistence B compared to the matrix-F and Wishart, respectively.

3.3 Out-of-sample Setup and Metrics

We also apply our new model in an out-of-sample exercise. First, we calculate daily, weekly, and biweekly point forecasts for Vt. Second, we forecast the (joint) density of the realized covariance matrices. Third, we conduct an economic application by considering Global Minimum Variance Portfolios (GMVP) as in for example Engle, Ledoit, and Wolf (2019). All these exercises directly depend on one-step and multi-step-ahead forecasts of Vt.

We use a moving-window approach in the forecasting exercise with an in-sample period of 1000 observations. This corresponds roughly to four calendar years. To avoid that the results are driven by a particular selection of stocks, we choose three disjoint sets of stocks for each of the settings k=5,15.

The out-of-sample period contains P =3, 696 observations including the Great Financial Crisis and the European Sovereign Debt crisis. The period, therefore, provides an important test for the robustness of the model. We re-estimate the models after every 250 observations, which roughly corresponds to updating the parameters annually. As we show later on, though optimizing the ordering of the variables empirically leads to some gains in likelihood, there are no substantial gains in point prediction quality compared to a single random ordering. We therefore consider only one random initial ordering for the main results.

Based on Table 3, we consider a subset of five distributions: the classical Wishart distribution as a benchmark, the Riesz, the Inverse Riesz, the matrix-F, and the F-Riesz distribution. Moreover, we include the EWMA filter with λ=0.96, the DCC-GARCH model (Engle 2002), and the HAR-DRD model of Oh and Patton (2016) as further benchmark models.

Given a predicted Vt+1 and the true realized covariance matrix Xt+1, we evaluate the point forecasts by the Frobenius norm and the QLIK loss function:
(7)
 
(8)

We use the model confidence set (MCS) of Hansen, Lunde, and Nason (2011) at a significance level of 5% to select the best models. The MCS automatically accounts for the dependence between model outcomes, given that all models are based on the same data.

Turning toward the density forecasts, we use the log scoring rule (see Mitchell and Hall 2005; Amisano and Giacomini 2007) to differentiate between the density forecasts of the different models. Define the difference in log score between the two density forecasts M1 and M2 corresponding to the realized covariance matrix Xt+1 as
(9)
for t=R+1,T, with R =1000 the length of the rolling estimation window and Sls,t+1(Xt+1,Mj) (j =1, 2) the log score of the density forecast corresponding to model Mj at time t +1,
(10)
where pt+1(·) is one of the densities discussed. For multi-step ahead density forecasts, we consider the joint density h=1Hpt+h(Xt+h|Vt+h,Ft+h1,Mj) such that the log score boils down to Sls,t+1:t+H(Xt+1:Xt+H,Mj)=h=1Hlogpt+h(Xt+h|Vt+h,Ft+h1,Mj). The null hypothesis of equal predictive ability is given by H0:E[dls]=0 for all T—R out-of-sample forecasts. Similar to the evaluation of the point forecasts, we again use the MCS with a 5% significance level to test equal predictive ability.
The GMVP application is motivated by the mean-variance optimization setting of Markowitz (1952). Assuming that the investor aims at minimizing the 1-step-ahead portfolio variance at time t subject to a fully invested portfolio, we have the quadratic optimization problem
(11)

We assess the predictive ability of the different models by comparing the results to the ex-post portfolio volatility σp,t+1=(wt+1|tXt+1wt+1|t)1/2 using the MCS.

Alongside the GMVP’s volatility, we also calculate several other relevant quantities, such as portfolio turnover (TOt), concentration (COt), and the total short position (SPt) for each competing model at time t. A model that produces more stable covariance matrix forecasts implies in general less turnover and hence less transaction costs. This effect would lead to a gain in trading strategies. We follow Bollerslev, Patton, and Quaedvlieg (2018) and assume that there is a fixed transaction cost c. The total turnover at time t is defined as
(12)
where wi,t|t1 is the i-th element of the weight vector wt|t1 and ri,t the return of asset i at time t. It measures the value of the change in portfolio holdings when rebalancing the portfolio to its new optimal position from time t to t +1. For given proportional transaction costs c, the portfolio return net of transaction costs then equals
(13)
with c equal to 0, 1, or 2 percent, respectively.
Portfolio concentration and total portfolio short position both measure the amount of extreme portfolio allocations. Again, more stable forecasts of Vt+1 should result in less extreme portfolio weights. The portfolio concentration (COt) and short position (SPt) are defined as
(14)
with I[·] an indicator function that takes the value 1 if the i-the element of the weight vector is negative.
Finally, we follow Bollerslev, Patton, and Quaedvlieg (2018) and also evaluate the economic significance of different forecasting models by considering the utility-based framework of Fleming, Kirby, and Ostdiek (2001, 2003). This framework is based on the assumption that an investor has a quadratic utility with a risk aversion parameter γ. Then the realized utility of the portfolio return based on the forecasted covariances from model j reads
(15)
Given two different models j and l, Δγ denotes the return an investor with risk aversion parameter γ is willing to forfeit for model l to make her indifferent between models j to l. It can be obtained by solving
(16)

We test the null hypothesis Δγ=0 using the Reality Check of White (2000), based on the stationary bootstrap of Politis and Romano (1994) for 999 bootstrap samples with an average block length of 22 days.

Before presenting the main out-of-sample results, we first briefly investigate the effect of the ordering of the variables on point forecast performance. We do so for two sets of dimension 5. Table 4 shows the results for three settings: an arbitrary ordering, the optimal ordering over all 120 possibilities, and a time-varying ordering scheme based on Algorithm 2.6 implemented every 250 observations in line with the re-estimation of the model’s static parameters. The short summary of the table is that all approaches behave similarly in terms of the Frobenius norm: none of the models falls outside the MCS based on the simulated p-values. The same can be said in terms of the QLIK if we consider the left-panel of the table. Only in case of the assets AA, AXP, BA, CAT, and GE, the original and time-varying ordering scheme do not belong to the confidence set. We conclude that the optimal ordering does not automatically lead to better point predictions. In addition, varying the ordering over time also does not seem to lead to significant improvements in point forecasts. In the remainder of the analysis, we, therefore, stick to one specific order for the model comparisons (as indicated in the tables), keeping in mind that the results might be improved somewhat further in favor of the F-Riesz and Riesz models by re-ordering the variables.

Table 4

Ordering and out-of-sample point forecasts

MCD/PFE/PG/WMT/XOM
AA/AXP/BA/CAT/GE
OriginalOptimalTV ordOriginalOptimalTV ord
Frob5.6565.6485.6582.2802.2782.283
mcs p-value(0.36)(1.00)(0.31)(0.37)(1.00)(0.12)
QLIK7.4777.4807.4784.3644.3614.366
mcs p-value(1.00)(0.06)(0.44)(0.00)(1.00)(0.00)
MCD/PFE/PG/WMT/XOM
AA/AXP/BA/CAT/GE
OriginalOptimalTV ordOriginalOptimalTV ord
Frob5.6565.6485.6582.2802.2782.283
mcs p-value(0.36)(1.00)(0.31)(0.37)(1.00)(0.12)
QLIK7.4777.4807.4784.3644.3614.366
mcs p-value(1.00)(0.06)(0.44)(0.00)(1.00)(0.00)

Notes: This table shows the mean of the Frobenius norm and QLIK values based on daily predictions of the (5×5) covariance matrix of two sets of five stocks according to the Conditional Autoregressive model assuming a F-Riesz (FR) distribution using three different approaches. The first approach takes simply the original ordering created by the construction of the realized covariance matrix. The second ordering corresponds with the lowest Frobenius Norm. Finally, the third approach uses a time-varying ordering scheme (TV ord), by applying Heuristic 2.6 iteratively after 250 observations. Parameters are estimated with a moving window of 1,000 observations and re-estimated after 250 observations. The lowest value of the Frobenius norm and QLIK across the models are marked in bold. In addition, we report the MCS p-values based on a 5% significance level. The p-values of the models within the model confidence set are marked in bold. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

Table 4

Ordering and out-of-sample point forecasts

MCD/PFE/PG/WMT/XOM
AA/AXP/BA/CAT/GE
OriginalOptimalTV ordOriginalOptimalTV ord
Frob5.6565.6485.6582.2802.2782.283
mcs p-value(0.36)(1.00)(0.31)(0.37)(1.00)(0.12)
QLIK7.4777.4807.4784.3644.3614.366
mcs p-value(1.00)(0.06)(0.44)(0.00)(1.00)(0.00)
MCD/PFE/PG/WMT/XOM
AA/AXP/BA/CAT/GE
OriginalOptimalTV ordOriginalOptimalTV ord
Frob5.6565.6485.6582.2802.2782.283
mcs p-value(0.36)(1.00)(0.31)(0.37)(1.00)(0.12)
QLIK7.4777.4807.4784.3644.3614.366
mcs p-value(1.00)(0.06)(0.44)(0.00)(1.00)(0.00)

Notes: This table shows the mean of the Frobenius norm and QLIK values based on daily predictions of the (5×5) covariance matrix of two sets of five stocks according to the Conditional Autoregressive model assuming a F-Riesz (FR) distribution using three different approaches. The first approach takes simply the original ordering created by the construction of the realized covariance matrix. The second ordering corresponds with the lowest Frobenius Norm. Finally, the third approach uses a time-varying ordering scheme (TV ord), by applying Heuristic 2.6 iteratively after 250 observations. Parameters are estimated with a moving window of 1,000 observations and re-estimated after 250 observations. The lowest value of the Frobenius norm and QLIK across the models are marked in bold. In addition, we report the MCS p-values based on a 5% significance level. The p-values of the models within the model confidence set are marked in bold. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

3.4 Out-of-sample Results

We now turn to the discussion of our point and density forecasts, followed by the results of the GMV forecasts. Given that we have three data sets of dimensions 5 and 15, we have many results. For reasons of space, we only present results for one selection of 5 and 15 assets. The remaining results can be found in Supplementary Appendix G and do not change the main conclusions.

We first consider the point forecasts. For point forecasts, it is not clear a priori why a good model for tail behavior would improve the point forecasts. Tables 5 and 6 show the mean of the Frobenius norm and QLIK values with the associated MCS p-values for daily, weekly, and biweekly covariance forecasts for the first set of 5 and 15 assets. Supplementary Appendix Tables G.1–G.4 show the results for the other two sets of 5 and 15 assets. All tables show a similar pattern. For 1-step-ahead forecasts, the HAR-DRD model is superior. For the Frobenius norm, it is often accompanied in the MCS for k=5 by the conditional autoregressive models with one of the matrix distributions and HAR dynamics, illustrating that the point forecasts for the different models are comparable 1-day-ahead. This is not surprising as the parameters of the HAR-DRD are estimated by minimizing the Frobenius norm. The HAR-DRD model loses its superiority for point forecasts, however, for weekly and biweekly point forecasts. Here, the F-Riesz distribution always belongs to the MCS, either with the classical dynamic specification in Equation (1) or with the HAR specification from Equation (4). This does not hold for other matrix distributions, such as the Riesz distribution. Finally, we see that the forecasts of the EWMA and DCC-GARCH forecast are relatively worse than the other models as they are rarely within the MCS.

Table 5

Out-of-sample point forecasts (dimension 5, set I)

CA specifications
EWMADCCCA-HAR specifications
HDRD
WRiRFFRWRiRFFR
Panel A: daily forecasts

Frob2.2742.2762.2752.2742.2802.6272.6202.2502.2522.2482.2462.2532.231
mcs p-value(0.02)(0.01)(0.01)(0.05)(0.01)(0.01)(0.00)(0.43)(0.37)(0.43)(0.66)(0.34)(1.00)
QLIK4.3544.3534.3584.3564.3644.5254.9114.3414.3414.3454.3434.3514.325
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.07)(0.07)(0.02)(0.06)(0.00)(1.00)

Panel B: weekly forecasts

Frob9.3699.3149.2719.3039.29510.79911.2309.2599.2409.2299.2399.2889.507
mcs p-value(0.25)(0.61)(0.87)(0.61)(0.64)(0.11)(0.02)(0.79)(0.87)(1.00)(0.87)(0.64)(0.61)
QLIK12.57812.57312.57212.57312.57412.67413.15112.56712.56412.56812.56912.57412.575
mcs p-value(0.11)(0.25)(0.33)(0.21)(0.21)(0.01)(0.00)(0.42)(1.00)(0.42)(0.42)(0.21)(0.42)

Panel C: biweekly forecasts

Frob18.93718.72318.54718.67618.55720.81421.90619.27319.25218.73818.83018.78218.953
mcs p-value(0.05)(0.39)(1.00)(0.39)(0.92)(0.01)(0.01)(0.01)(0.04)(0.67)(0.34)(0.39)(0.39)
QLIK16.17516.16516.16016.16516.16216.21816.71816.16216.15816.15016.15316.15516.173
mcs p-value(0.10)(0.30)(0.32)(0.27)(0.32)(0.06)(0.01)(0.32)(0.32)(1.00)(0.32)(0.32)(0.32)
CA specifications
EWMADCCCA-HAR specifications
HDRD
WRiRFFRWRiRFFR
Panel A: daily forecasts

Frob2.2742.2762.2752.2742.2802.6272.6202.2502.2522.2482.2462.2532.231
mcs p-value(0.02)(0.01)(0.01)(0.05)(0.01)(0.01)(0.00)(0.43)(0.37)(0.43)(0.66)(0.34)(1.00)
QLIK4.3544.3534.3584.3564.3644.5254.9114.3414.3414.3454.3434.3514.325
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.07)(0.07)(0.02)(0.06)(0.00)(1.00)

Panel B: weekly forecasts

Frob9.3699.3149.2719.3039.29510.79911.2309.2599.2409.2299.2399.2889.507
mcs p-value(0.25)(0.61)(0.87)(0.61)(0.64)(0.11)(0.02)(0.79)(0.87)(1.00)(0.87)(0.64)(0.61)
QLIK12.57812.57312.57212.57312.57412.67413.15112.56712.56412.56812.56912.57412.575
mcs p-value(0.11)(0.25)(0.33)(0.21)(0.21)(0.01)(0.00)(0.42)(1.00)(0.42)(0.42)(0.21)(0.42)

Panel C: biweekly forecasts

Frob18.93718.72318.54718.67618.55720.81421.90619.27319.25218.73818.83018.78218.953
mcs p-value(0.05)(0.39)(1.00)(0.39)(0.92)(0.01)(0.01)(0.01)(0.04)(0.67)(0.34)(0.39)(0.39)
QLIK16.17516.16516.16016.16516.16216.21816.71816.16216.15816.15016.15316.15516.173
mcs p-value(0.10)(0.30)(0.32)(0.27)(0.32)(0.06)(0.01)(0.32)(0.32)(1.00)(0.32)(0.32)(0.32)

Notes: This table shows the mean of the Frobenius norm and QLIK values based on daily, weekly, and biweekly predictions of the (5×5) covariance matrix of 5 stocks (MCD/PFE/PG/WMT/XOM) according to the Conditional Autoregressive model with and without HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz (FR) distribution. In addition, we include the EWMA, DCC-GARCH, and the HAR-DRD (HDRD) models as benchmarks. Parameters are estimated with a moving window of 1,000 observations and re-estimated after 250 observations. Panel A shows results of 1-step ahead predictions, and Panels B and C list results of cumulative forecasts for a week (five-step ahead) and two weeks (10 steps ahead). The lowest value of the Frobenius norm and QLIK across the models are marked in bold. In addition, we report the MCS p-values based on a 5% significance level. The p-values of the models within the model confidence set are marked in bold. The out-of-sample period goes from January 2005 until December 2019 and contains 3,696 observations.

Table 5

Out-of-sample point forecasts (dimension 5, set I)

CA specifications
EWMADCCCA-HAR specifications
HDRD
WRiRFFRWRiRFFR
Panel A: daily forecasts

Frob2.2742.2762.2752.2742.2802.6272.6202.2502.2522.2482.2462.2532.231
mcs p-value(0.02)(0.01)(0.01)(0.05)(0.01)(0.01)(0.00)(0.43)(0.37)(0.43)(0.66)(0.34)(1.00)
QLIK4.3544.3534.3584.3564.3644.5254.9114.3414.3414.3454.3434.3514.325
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.07)(0.07)(0.02)(0.06)(0.00)(1.00)

Panel B: weekly forecasts

Frob9.3699.3149.2719.3039.29510.79911.2309.2599.2409.2299.2399.2889.507
mcs p-value(0.25)(0.61)(0.87)(0.61)(0.64)(0.11)(0.02)(0.79)(0.87)(1.00)(0.87)(0.64)(0.61)
QLIK12.57812.57312.57212.57312.57412.67413.15112.56712.56412.56812.56912.57412.575
mcs p-value(0.11)(0.25)(0.33)(0.21)(0.21)(0.01)(0.00)(0.42)(1.00)(0.42)(0.42)(0.21)(0.42)

Panel C: biweekly forecasts

Frob18.93718.72318.54718.67618.55720.81421.90619.27319.25218.73818.83018.78218.953
mcs p-value(0.05)(0.39)(1.00)(0.39)(0.92)(0.01)(0.01)(0.01)(0.04)(0.67)(0.34)(0.39)(0.39)
QLIK16.17516.16516.16016.16516.16216.21816.71816.16216.15816.15016.15316.15516.173
mcs p-value(0.10)(0.30)(0.32)(0.27)(0.32)(0.06)(0.01)(0.32)(0.32)(1.00)(0.32)(0.32)(0.32)
CA specifications
EWMADCCCA-HAR specifications
HDRD
WRiRFFRWRiRFFR
Panel A: daily forecasts

Frob2.2742.2762.2752.2742.2802.6272.6202.2502.2522.2482.2462.2532.231
mcs p-value(0.02)(0.01)(0.01)(0.05)(0.01)(0.01)(0.00)(0.43)(0.37)(0.43)(0.66)(0.34)(1.00)
QLIK4.3544.3534.3584.3564.3644.5254.9114.3414.3414.3454.3434.3514.325
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.07)(0.07)(0.02)(0.06)(0.00)(1.00)

Panel B: weekly forecasts

Frob9.3699.3149.2719.3039.29510.79911.2309.2599.2409.2299.2399.2889.507
mcs p-value(0.25)(0.61)(0.87)(0.61)(0.64)(0.11)(0.02)(0.79)(0.87)(1.00)(0.87)(0.64)(0.61)
QLIK12.57812.57312.57212.57312.57412.67413.15112.56712.56412.56812.56912.57412.575
mcs p-value(0.11)(0.25)(0.33)(0.21)(0.21)(0.01)(0.00)(0.42)(1.00)(0.42)(0.42)(0.21)(0.42)

Panel C: biweekly forecasts

Frob18.93718.72318.54718.67618.55720.81421.90619.27319.25218.73818.83018.78218.953
mcs p-value(0.05)(0.39)(1.00)(0.39)(0.92)(0.01)(0.01)(0.01)(0.04)(0.67)(0.34)(0.39)(0.39)
QLIK16.17516.16516.16016.16516.16216.21816.71816.16216.15816.15016.15316.15516.173
mcs p-value(0.10)(0.30)(0.32)(0.27)(0.32)(0.06)(0.01)(0.32)(0.32)(1.00)(0.32)(0.32)(0.32)

Notes: This table shows the mean of the Frobenius norm and QLIK values based on daily, weekly, and biweekly predictions of the (5×5) covariance matrix of 5 stocks (MCD/PFE/PG/WMT/XOM) according to the Conditional Autoregressive model with and without HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz (FR) distribution. In addition, we include the EWMA, DCC-GARCH, and the HAR-DRD (HDRD) models as benchmarks. Parameters are estimated with a moving window of 1,000 observations and re-estimated after 250 observations. Panel A shows results of 1-step ahead predictions, and Panels B and C list results of cumulative forecasts for a week (five-step ahead) and two weeks (10 steps ahead). The lowest value of the Frobenius norm and QLIK across the models are marked in bold. In addition, we report the MCS p-values based on a 5% significance level. The p-values of the models within the model confidence set are marked in bold. The out-of-sample period goes from January 2005 until December 2019 and contains 3,696 observations.

Table 6

Out-of-sample point forecasts (dimension 15, set I)

CA specifications
EWMADCCCA-HAR specifications
HDRD
WRiRFFRWRiRFFR
Panel A: daily forecasts

Frob10.41110.45610.53610.48610.60311.80112.37310.31110.35910.43110.37210.50410.150
mcs p-value(0.01)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.18)(0.01)(0.00)(0.01)(0.00)(1.00)
QLIK15.27815.29315.31015.28615.32015.56116.79315.23115.24815.26415.23415.28115.206
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.32)(0.01)(0.00)(0.32)(0.00)(1.00)

Panel B: weekly forecasts

Frob42.17042.48842.19741.88542.37247.95553.85942.12442.50342.53642.00842.69642.770
mcs p-value(0.64)(0.19)(0.64)(1.00)(0.43)(0.05)(0.00)(0.64)(0.14)(0.14)(0.67)(0.11)(0.64)
QLIK39.91939.93339.88639.87039.87639.99541.40539.87739.90139.89339.85739.89139.922
mcs p-value(0.01)(0.00)(0.06)(0.26)(0.21)(0.00)(0.00)(0.06)(0.01)(0.03)(1.00)(0.05)(0.03)

Panel C: biweekly forecasts

Frob84.34285.19083.28182.51783.18191.937106.85884.86285.59084.76183.39484.41885.784
mcs p-value(0.11)(0.02)(0.42)(1.00)(0.42)(0.01)(0.00)(0.04)(0.01)(0.03)(0.42)(0.10)(0.11)
QLIK50.67650.70250.61550.59050.58250.60752.05950.60150.64150.62350.57350.60950.646
mcs p-value(0.00)(0.00)(0.11)(0.67)(0.67)(0.67)(0.00)(0.45)(0.01)(0.03)(1.00)(0.18)(0.11)
CA specifications
EWMADCCCA-HAR specifications
HDRD
WRiRFFRWRiRFFR
Panel A: daily forecasts

Frob10.41110.45610.53610.48610.60311.80112.37310.31110.35910.43110.37210.50410.150
mcs p-value(0.01)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.18)(0.01)(0.00)(0.01)(0.00)(1.00)
QLIK15.27815.29315.31015.28615.32015.56116.79315.23115.24815.26415.23415.28115.206
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.32)(0.01)(0.00)(0.32)(0.00)(1.00)

Panel B: weekly forecasts

Frob42.17042.48842.19741.88542.37247.95553.85942.12442.50342.53642.00842.69642.770
mcs p-value(0.64)(0.19)(0.64)(1.00)(0.43)(0.05)(0.00)(0.64)(0.14)(0.14)(0.67)(0.11)(0.64)
QLIK39.91939.93339.88639.87039.87639.99541.40539.87739.90139.89339.85739.89139.922
mcs p-value(0.01)(0.00)(0.06)(0.26)(0.21)(0.00)(0.00)(0.06)(0.01)(0.03)(1.00)(0.05)(0.03)

Panel C: biweekly forecasts

Frob84.34285.19083.28182.51783.18191.937106.85884.86285.59084.76183.39484.41885.784
mcs p-value(0.11)(0.02)(0.42)(1.00)(0.42)(0.01)(0.00)(0.04)(0.01)(0.03)(0.42)(0.10)(0.11)
QLIK50.67650.70250.61550.59050.58250.60752.05950.60150.64150.62350.57350.60950.646
mcs p-value(0.00)(0.00)(0.11)(0.67)(0.67)(0.67)(0.00)(0.45)(0.01)(0.03)(1.00)(0.18)(0.11)

Notes: This table shows the mean of the Frobenius norm and QLIK values based on daily, weekly, and biweekly predictions of the (15×15) covariance matrix of 15 stocks (AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM) according to the Conditional Autoregressive model with and without HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz (FR) distribution. In addition, we include the EWMA, DCC-GARCH, and the HAR-DRD (HDRD) models as benchmarks. Parameters are estimated with a moving window of 1000 observations and re-estimated after 250 observations. Panel A shows results of 1-step ahead predictions, and Panels B and C list results of cumulative forecasts for a week (five-step ahead) and two weeks (10 steps ahead). The lowest value of the Frobenius norm and QLIK across the models are marked in bold. In addition, we report the MCS p-values based on a 5% significance level. The p-values of the models within the model confidence set are marked in bold. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

Table 6

Out-of-sample point forecasts (dimension 15, set I)

CA specifications
EWMADCCCA-HAR specifications
HDRD
WRiRFFRWRiRFFR
Panel A: daily forecasts

Frob10.41110.45610.53610.48610.60311.80112.37310.31110.35910.43110.37210.50410.150
mcs p-value(0.01)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.18)(0.01)(0.00)(0.01)(0.00)(1.00)
QLIK15.27815.29315.31015.28615.32015.56116.79315.23115.24815.26415.23415.28115.206
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.32)(0.01)(0.00)(0.32)(0.00)(1.00)

Panel B: weekly forecasts

Frob42.17042.48842.19741.88542.37247.95553.85942.12442.50342.53642.00842.69642.770
mcs p-value(0.64)(0.19)(0.64)(1.00)(0.43)(0.05)(0.00)(0.64)(0.14)(0.14)(0.67)(0.11)(0.64)
QLIK39.91939.93339.88639.87039.87639.99541.40539.87739.90139.89339.85739.89139.922
mcs p-value(0.01)(0.00)(0.06)(0.26)(0.21)(0.00)(0.00)(0.06)(0.01)(0.03)(1.00)(0.05)(0.03)

Panel C: biweekly forecasts

Frob84.34285.19083.28182.51783.18191.937106.85884.86285.59084.76183.39484.41885.784
mcs p-value(0.11)(0.02)(0.42)(1.00)(0.42)(0.01)(0.00)(0.04)(0.01)(0.03)(0.42)(0.10)(0.11)
QLIK50.67650.70250.61550.59050.58250.60752.05950.60150.64150.62350.57350.60950.646
mcs p-value(0.00)(0.00)(0.11)(0.67)(0.67)(0.67)(0.00)(0.45)(0.01)(0.03)(1.00)(0.18)(0.11)
CA specifications
EWMADCCCA-HAR specifications
HDRD
WRiRFFRWRiRFFR
Panel A: daily forecasts

Frob10.41110.45610.53610.48610.60311.80112.37310.31110.35910.43110.37210.50410.150
mcs p-value(0.01)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.18)(0.01)(0.00)(0.01)(0.00)(1.00)
QLIK15.27815.29315.31015.28615.32015.56116.79315.23115.24815.26415.23415.28115.206
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.32)(0.01)(0.00)(0.32)(0.00)(1.00)

Panel B: weekly forecasts

Frob42.17042.48842.19741.88542.37247.95553.85942.12442.50342.53642.00842.69642.770
mcs p-value(0.64)(0.19)(0.64)(1.00)(0.43)(0.05)(0.00)(0.64)(0.14)(0.14)(0.67)(0.11)(0.64)
QLIK39.91939.93339.88639.87039.87639.99541.40539.87739.90139.89339.85739.89139.922
mcs p-value(0.01)(0.00)(0.06)(0.26)(0.21)(0.00)(0.00)(0.06)(0.01)(0.03)(1.00)(0.05)(0.03)

Panel C: biweekly forecasts

Frob84.34285.19083.28182.51783.18191.937106.85884.86285.59084.76183.39484.41885.784
mcs p-value(0.11)(0.02)(0.42)(1.00)(0.42)(0.01)(0.00)(0.04)(0.01)(0.03)(0.42)(0.10)(0.11)
QLIK50.67650.70250.61550.59050.58250.60752.05950.60150.64150.62350.57350.60950.646
mcs p-value(0.00)(0.00)(0.11)(0.67)(0.67)(0.67)(0.00)(0.45)(0.01)(0.03)(1.00)(0.18)(0.11)

Notes: This table shows the mean of the Frobenius norm and QLIK values based on daily, weekly, and biweekly predictions of the (15×15) covariance matrix of 15 stocks (AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM) according to the Conditional Autoregressive model with and without HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz (FR) distribution. In addition, we include the EWMA, DCC-GARCH, and the HAR-DRD (HDRD) models as benchmarks. Parameters are estimated with a moving window of 1000 observations and re-estimated after 250 observations. Panel A shows results of 1-step ahead predictions, and Panels B and C list results of cumulative forecasts for a week (five-step ahead) and two weeks (10 steps ahead). The lowest value of the Frobenius norm and QLIK across the models are marked in bold. In addition, we report the MCS p-values based on a 5% significance level. The p-values of the models within the model confidence set are marked in bold. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

Next, we consider the models’ density forecast performances. In contrast to milder differences in the point forecasts, the results for the density forecasts exhibit much stronger differences between the different models. Table 7 (as well as Supplementary Appendix Table G.5) reports results on the daily, weekly, and biweekly density forecasts. The tables list the average log-score values together with the MCS p-values. The message from the table is clear: the F-Riesz distribution is superior in density forecasts against all other matrix distributions, irrespective of the forecast horizon. The gains are large, even with respect to the matrix-F distribution. Second, the HAR specification is most of the time better than the regular conditional autoregressive specification from Equation (1), except for the third data set of dimension 15 (see Supplementary Appendix Table G.5). We conclude that though the differences between the point forecasts of the different models are modest, the differences in density fits are clearly in favor of the F-Riesz model with HAR dynamics for Vt.6

Table 7

Out-of-sample log-scores

CA specifications
CA-HAR specifications
WRiRFFRWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM

Sls(1)0.7251.1922.5102.5153.3480.8271.2762.6932.6703.455
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(5)3.4815.82112.3912.4316.603.9916.24013.3113.2017.13
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(10)6.96611.6524.8224.8833.227.98412.4926.6526.4334.29
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)

Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM

Sls(1)34.6238.5048.6849.9954.8635.2438.9649.1150.7455.00
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(1)171.9191.3242.3249.0273.2175.0193.6244.4252.6273.9
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(10)343.7382.6484.7498.0546.5350.0387.2489.0505.3547.9
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
CA specifications
CA-HAR specifications
WRiRFFRWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM

Sls(1)0.7251.1922.5102.5153.3480.8271.2762.6932.6703.455
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(5)3.4815.82112.3912.4316.603.9916.24013.3113.2017.13
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(10)6.96611.6524.8224.8833.227.98412.4926.6526.4334.29
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)

Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM

Sls(1)34.6238.5048.6849.9954.8635.2438.9649.1150.7455.00
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(1)171.9191.3242.3249.0273.2175.0193.6244.4252.6273.9
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(10)343.7382.6484.7498.0546.5350.0387.2489.0505.3547.9
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)

Notes: This table shows the mean of log scores, defined in (10), based on daily, weekly, and biweekly ahead predictions of the covariance matrix, according to the Conditional Autoregressive model with and without HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz I (FR) distribution. More specifically, we show the average value of Stls(H)=h=1Hlogpt+h(Xt+h|Vt+h,Ft+h1) with H =1, 5, and 10, respectively. Parameters are estimated with a moving window of 1,000 observations and re-estimated after 250 observations. Panel A shows results of dimension 5, and Panel B lists results of dimension 15. The highest value of the log score across the models is marked in bold. In addition, we report the MCS p-values based on a 5% significance level. The p-values of the models within the model confidence set are marked in bold. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

Table 7

Out-of-sample log-scores

CA specifications
CA-HAR specifications
WRiRFFRWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM

Sls(1)0.7251.1922.5102.5153.3480.8271.2762.6932.6703.455
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(5)3.4815.82112.3912.4316.603.9916.24013.3113.2017.13
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(10)6.96611.6524.8224.8833.227.98412.4926.6526.4334.29
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)

Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM

Sls(1)34.6238.5048.6849.9954.8635.2438.9649.1150.7455.00
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(1)171.9191.3242.3249.0273.2175.0193.6244.4252.6273.9
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(10)343.7382.6484.7498.0546.5350.0387.2489.0505.3547.9
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
CA specifications
CA-HAR specifications
WRiRFFRWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM

Sls(1)0.7251.1922.5102.5153.3480.8271.2762.6932.6703.455
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(5)3.4815.82112.3912.4316.603.9916.24013.3113.2017.13
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(10)6.96611.6524.8224.8833.227.98412.4926.6526.4334.29
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)

Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM

Sls(1)34.6238.5048.6849.9954.8635.2438.9649.1150.7455.00
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(1)171.9191.3242.3249.0273.2175.0193.6244.4252.6273.9
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
Sls(10)343.7382.6484.7498.0546.5350.0387.2489.0505.3547.9
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)

Notes: This table shows the mean of log scores, defined in (10), based on daily, weekly, and biweekly ahead predictions of the covariance matrix, according to the Conditional Autoregressive model with and without HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz I (FR) distribution. More specifically, we show the average value of Stls(H)=h=1Hlogpt+h(Xt+h|Vt+h,Ft+h1) with H =1, 5, and 10, respectively. Parameters are estimated with a moving window of 1,000 observations and re-estimated after 250 observations. Panel A shows results of dimension 5, and Panel B lists results of dimension 15. The highest value of the log score across the models is marked in bold. In addition, we report the MCS p-values based on a 5% significance level. The p-values of the models within the model confidence set are marked in bold. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

Finally, we evaluate the different models in terms of their GMV portfolio performance. To save space, we only present Tables 8 and 9 in the main text, reporting the results for 1-day and 10-days (2 weeks) ahead forecasts for the first set of 5 and 15 assets. The 5-day-ahead forecasts for these assets, as well as the 1-, 5-, and 10-day-ahead forecasts for the other sets of assets, can be found in Supplementary Appendix Tables G.6–G.12. We only show results for the HAR specifications as these were most often in the MCSs in Table 7. Each table first reports the annualized mean and ex-post (realized) standard deviation of the portfolio returns, with MCS p-values on the lowest ex-post realized portfolio volatility. Next, we report portfolio statistics such as the average turnover, concentration, and total short position. The second part of each table sheds light on the economic significance of using the CAFr HAR model against all other models using the utility-based framework. Bold positive values of Δγ indicate that an investor is willing to sacrifice an annual return of Δγ basis points to switch from a particular model to the CAFr HAR model.

Table 8

The GMV portfolio (daily)

EWMADCCHDRDWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM
Mean ret4.5735.8875.7605.3335.2895.1725.2105.228
σ^p10.20110.59310.16310.16110.15910.15210.15310.149
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
StDev ret11.69912.10411.69311.64411.63711.62811.63411.622
TO0.0250.1560.2400.1940.1830.1560.1660.140
CO0.5000.5500.5120.5100.5100.5070.5080.506
SP−0.005−0.011−0.005−0.007−0.007−0.007−0.007−0.006

c=0%Sharpe0.3910.4860.4930.4580.4540.4450.4480.450
Δ166−60−52−10−662
Δ1074−9−45−8−463
c=1%Sharpe0.3860.4540.4410.4160.4150.4110.4120.419
Δ137−56−2735108
Δ1045−5−20671010
c=2%Sharpe0.3800.4210.3890.3740.3750.3770.3760.389
Δ18−52−217161415
Δ10160519181416
EWMADCCHDRDWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM
Mean ret4.5735.8875.7605.3335.2895.1725.2105.228
σ^p10.20110.59310.16310.16110.15910.15210.15310.149
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
StDev ret11.69912.10411.69311.64411.63711.62811.63411.622
TO0.0250.1560.2400.1940.1830.1560.1660.140
CO0.5000.5500.5120.5100.5100.5070.5080.506
SP−0.005−0.011−0.005−0.007−0.007−0.007−0.007−0.006

c=0%Sharpe0.3910.4860.4930.4580.4540.4450.4480.450
Δ166−60−52−10−662
Δ1074−9−45−8−463
c=1%Sharpe0.3860.4540.4410.4160.4150.4110.4120.419
Δ137−56−2735108
Δ1045−5−20671010
c=2%Sharpe0.3800.4210.3890.3740.3750.3770.3760.389
Δ18−52−217161415
Δ10160519181416
Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM
Mean ret7.0549.2538.1247.7057.7347.5847.6167.613
σ^p9.24810.0519.2749.2249.2239.2129.2129.212
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.66)(1.00)(0.66)
StDev ret10.68511.34410.75410.58710.58210.56910.57510.568
TO0.0550.2930.5020.2730.2470.1930.2140.170
CO0.4150.5040.4390.4310.4300.4260.4280.425
SP−0.104−0.195−0.127−0.119−0.117−0.112−0.115−0.110
c=0%Sharpe0.6600.8160.7550.7280.7310.7180.7200.720
Δ157−156−49−9−1230
Δ1068−79−31−7−1130
c=1%Sharpe0.6470.7500.6380.6630.6720.6720.6690.680
Δ128−12435177911
Δ1039−4852199912
c=2%Sharpe0.6340.6850.5200.5980.6130.6260.6180.639
Δ1−1−9311843271422
Δ1010−1613645281423
Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM
Mean ret7.0549.2538.1247.7057.7347.5847.6167.613
σ^p9.24810.0519.2749.2249.2239.2129.2129.212
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.66)(1.00)(0.66)
StDev ret10.68511.34410.75410.58710.58210.56910.57510.568
TO0.0550.2930.5020.2730.2470.1930.2140.170
CO0.4150.5040.4390.4310.4300.4260.4280.425
SP−0.104−0.195−0.127−0.119−0.117−0.112−0.115−0.110
c=0%Sharpe0.6600.8160.7550.7280.7310.7180.7200.720
Δ157−156−49−9−1230
Δ1068−79−31−7−1130
c=1%Sharpe0.6470.7500.6380.6630.6720.6720.6690.680
Δ128−12435177911
Δ1039−4852199912
c=2%Sharpe0.6340.6850.5200.5980.6130.6260.6180.639
Δ1−1−9311843271422
Δ1010−1613645281423

Notes: This table shows portfolio statistics of the Global Minimum Variance portfolio, based on one-step-ahead predictions of the covariance matrix, according to the Conditional Autoregressive model with HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz (FR) distribution. In addition, we include the EWMA, the DCC-GARCH, and HAR-DRD models as benchmarks. We report the average annualized return and standard deviation (both using the true realized covariance matrix and daily returns), as well as the turnover (TO), portfolio concentration (CO), and short positions (SP). Lowest portfolio volatilities are marked in bold. We also list the p-value that belongs to the model confidence set approach (MCS) of lowest ex-post daily volatility using the true realized covariance matrix based on a 5% significance level. Bold p-values correspond to models that belong to the model confidence set. Finally, the table reports the economic gains of switching from each model listed in the column to the CAFr HAR model in annual basis points, Δγ, for various transaction cost levels c and risk aversion coefficients γ. A bold Δγ means significantly different from zero at the 5% level. Panel A shows results of dimension 5, and Panel B lists results of dimension 15. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

Table 8

The GMV portfolio (daily)

EWMADCCHDRDWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM
Mean ret4.5735.8875.7605.3335.2895.1725.2105.228
σ^p10.20110.59310.16310.16110.15910.15210.15310.149
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
StDev ret11.69912.10411.69311.64411.63711.62811.63411.622
TO0.0250.1560.2400.1940.1830.1560.1660.140
CO0.5000.5500.5120.5100.5100.5070.5080.506
SP−0.005−0.011−0.005−0.007−0.007−0.007−0.007−0.006

c=0%Sharpe0.3910.4860.4930.4580.4540.4450.4480.450
Δ166−60−52−10−662
Δ1074−9−45−8−463
c=1%Sharpe0.3860.4540.4410.4160.4150.4110.4120.419
Δ137−56−2735108
Δ1045−5−20671010
c=2%Sharpe0.3800.4210.3890.3740.3750.3770.3760.389
Δ18−52−217161415
Δ10160519181416
EWMADCCHDRDWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM
Mean ret4.5735.8875.7605.3335.2895.1725.2105.228
σ^p10.20110.59310.16310.16110.15910.15210.15310.149
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(0.00)(1.00)
StDev ret11.69912.10411.69311.64411.63711.62811.63411.622
TO0.0250.1560.2400.1940.1830.1560.1660.140
CO0.5000.5500.5120.5100.5100.5070.5080.506
SP−0.005−0.011−0.005−0.007−0.007−0.007−0.007−0.006

c=0%Sharpe0.3910.4860.4930.4580.4540.4450.4480.450
Δ166−60−52−10−662
Δ1074−9−45−8−463
c=1%Sharpe0.3860.4540.4410.4160.4150.4110.4120.419
Δ137−56−2735108
Δ1045−5−20671010
c=2%Sharpe0.3800.4210.3890.3740.3750.3770.3760.389
Δ18−52−217161415
Δ10160519181416
Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM
Mean ret7.0549.2538.1247.7057.7347.5847.6167.613
σ^p9.24810.0519.2749.2249.2239.2129.2129.212
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.66)(1.00)(0.66)
StDev ret10.68511.34410.75410.58710.58210.56910.57510.568
TO0.0550.2930.5020.2730.2470.1930.2140.170
CO0.4150.5040.4390.4310.4300.4260.4280.425
SP−0.104−0.195−0.127−0.119−0.117−0.112−0.115−0.110
c=0%Sharpe0.6600.8160.7550.7280.7310.7180.7200.720
Δ157−156−49−9−1230
Δ1068−79−31−7−1130
c=1%Sharpe0.6470.7500.6380.6630.6720.6720.6690.680
Δ128−12435177911
Δ1039−4852199912
c=2%Sharpe0.6340.6850.5200.5980.6130.6260.6180.639
Δ1−1−9311843271422
Δ1010−1613645281423
Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM
Mean ret7.0549.2538.1247.7057.7347.5847.6167.613
σ^p9.24810.0519.2749.2249.2239.2129.2129.212
mcs p-value(0.00)(0.00)(0.00)(0.00)(0.00)(0.66)(1.00)(0.66)
StDev ret10.68511.34410.75410.58710.58210.56910.57510.568
TO0.0550.2930.5020.2730.2470.1930.2140.170
CO0.4150.5040.4390.4310.4300.4260.4280.425
SP−0.104−0.195−0.127−0.119−0.117−0.112−0.115−0.110
c=0%Sharpe0.6600.8160.7550.7280.7310.7180.7200.720
Δ157−156−49−9−1230
Δ1068−79−31−7−1130
c=1%Sharpe0.6470.7500.6380.6630.6720.6720.6690.680
Δ128−12435177911
Δ1039−4852199912
c=2%Sharpe0.6340.6850.5200.5980.6130.6260.6180.639
Δ1−1−9311843271422
Δ1010−1613645281423

Notes: This table shows portfolio statistics of the Global Minimum Variance portfolio, based on one-step-ahead predictions of the covariance matrix, according to the Conditional Autoregressive model with HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz (FR) distribution. In addition, we include the EWMA, the DCC-GARCH, and HAR-DRD models as benchmarks. We report the average annualized return and standard deviation (both using the true realized covariance matrix and daily returns), as well as the turnover (TO), portfolio concentration (CO), and short positions (SP). Lowest portfolio volatilities are marked in bold. We also list the p-value that belongs to the model confidence set approach (MCS) of lowest ex-post daily volatility using the true realized covariance matrix based on a 5% significance level. Bold p-values correspond to models that belong to the model confidence set. Finally, the table reports the economic gains of switching from each model listed in the column to the CAFr HAR model in annual basis points, Δγ, for various transaction cost levels c and risk aversion coefficients γ. A bold Δγ means significantly different from zero at the 5% level. Panel A shows results of dimension 5, and Panel B lists results of dimension 15. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

Table 9

The GMV portfolio (biweekly)

EWMADCCHDRDWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM
Mean ret4.4145.9615.5395.4325.3855.2595.2975.319
σ^p10.58310.89210.60510.56810.56910.56410.56410.563
mcs p-value(0.01)(0.00)(0.01)(0.01)(0.01)(0.40)(0.40)(1.00)
StDev ret10.38310.60010.30710.34110.34810.32710.32610.317
TO0.0310.1100.1520.1080.1060.0890.0920.083
CO0.5000.5370.5060.5010.5010.5000.5000.500
SP−0.005−0.009−0.003−0.005−0.005−0.005−0.005−0.005

c=0%Sharpe0.4250.5620.5370.5250.5200.5090.5130.516
Δ1182−122−47−22−12125
Δ10195−68−49−17−7146
c=1%Sharpe0.4240.5600.5340.5230.5180.5070.5110.514
Δ1180−121−43−21−11135
Δ10192−66−45−16−5157
c=2%Sharpe0.4240.5570.5300.5200.5150.5050.5090.512
Δ1177−120−40−20−10135
Δ10190−65−42−15−4157
EWMADCCHDRDWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM
Mean ret4.4145.9615.5395.4325.3855.2595.2975.319
σ^p10.58310.89210.60510.56810.56910.56410.56410.563
mcs p-value(0.01)(0.00)(0.01)(0.01)(0.01)(0.40)(0.40)(1.00)
StDev ret10.38310.60010.30710.34110.34810.32710.32610.317
TO0.0310.1100.1520.1080.1060.0890.0920.083
CO0.5000.5370.5060.5010.5010.5000.5000.500
SP−0.005−0.009−0.003−0.005−0.005−0.005−0.005−0.005

c=0%Sharpe0.4250.5620.5370.5250.5200.5090.5130.516
Δ1182−122−47−22−12125
Δ10195−68−49−17−7146
c=1%Sharpe0.4240.5600.5340.5230.5180.5070.5110.514
Δ1180−121−43−21−11135
Δ10192−66−45−16−5157
c=2%Sharpe0.4240.5570.5300.5200.5150.5050.5090.512
Δ1177−120−40−20−10135
Δ10190−65−42−15−4157
Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM
Mean ret6.7558.9437.5837.1677.1797.0387.0617.055
σ^p9.65110.3439.7379.6699.6729.6619.6599.660
mcs p-value(1.00)(0.00)(0.00)(0.00)(0.00)(0.17)(0.52)(0.52)
StDev ret10.14910.68110.31710.14410.13710.11310.12810.108
TO0.0630.2130.3310.1660.1540.1320.1450.123
CO0.4160.4920.4270.4260.4250.4240.4240.424
SP−0.104−0.181−0.110−0.109−0.108−0.106−0.107−0.106
c=0%Sharpe0.6660.8370.7350.7070.7080.6960.6970.698
Δ161−366−103−22−243−1
Δ1068−254−63−15−1943
c=1%Sharpe0.6640.8320.7270.7020.7040.6930.6940.695
Δ158−361−93−20−2340
Δ1065−250−53−13−1754
c=2%Sharpe0.6630.8270.7190.6980.7010.6890.6900.692
Δ155−356−82−17−2141
Δ1062−245−42−11−1655
Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM
Mean ret6.7558.9437.5837.1677.1797.0387.0617.055
σ^p9.65110.3439.7379.6699.6729.6619.6599.660
mcs p-value(1.00)(0.00)(0.00)(0.00)(0.00)(0.17)(0.52)(0.52)
StDev ret10.14910.68110.31710.14410.13710.11310.12810.108
TO0.0630.2130.3310.1660.1540.1320.1450.123
CO0.4160.4920.4270.4260.4250.4240.4240.424
SP−0.104−0.181−0.110−0.109−0.108−0.106−0.107−0.106
c=0%Sharpe0.6660.8370.7350.7070.7080.6960.6970.698
Δ161−366−103−22−243−1
Δ1068−254−63−15−1943
c=1%Sharpe0.6640.8320.7270.7020.7040.6930.6940.695
Δ158−361−93−20−2340
Δ1065−250−53−13−1754
c=2%Sharpe0.6630.8270.7190.6980.7010.6890.6900.692
Δ155−356−82−17−2141
Δ1062−245−42−11−1655

Note: This table shows portfolio statistics of the Global Minimum Variance portfolio, based on biweekly predictions of the covariance matrix, according to the Conditional Autoregressive model with HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz (FR) distribution. In addition, we include the EWMA, the DCC-GARCH, and HAR-DRD models as benchmarks. We report the average annualized return and standard deviation (both using the true realized covariance matrix and daily returns), as well as the turnover (TO), portfolio concentration (CO), and short positions (SP). Lowest portfolio volatilities are marked in bold. We also list the p-value that belongs to the model confidence set approach (MCS) of lowest ex-post daily volatility using the true realized covariance matrix based on a 5% significance level. Bold p-values correspond to models that belong to the model confidence set. Finally, the table reports the economic gains of switching from each model listed in the column to the CAFr HAR model in annual basis points, Δγ, for various transaction cost levels c and risk aversion coefficients γ. A bold Δγ means significantly different from zero at the 5% level. Panel A shows results of dimension 5, and Panel B lists results of dimension 15. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

Table 9

The GMV portfolio (biweekly)

EWMADCCHDRDWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM
Mean ret4.4145.9615.5395.4325.3855.2595.2975.319
σ^p10.58310.89210.60510.56810.56910.56410.56410.563
mcs p-value(0.01)(0.00)(0.01)(0.01)(0.01)(0.40)(0.40)(1.00)
StDev ret10.38310.60010.30710.34110.34810.32710.32610.317
TO0.0310.1100.1520.1080.1060.0890.0920.083
CO0.5000.5370.5060.5010.5010.5000.5000.500
SP−0.005−0.009−0.003−0.005−0.005−0.005−0.005−0.005

c=0%Sharpe0.4250.5620.5370.5250.5200.5090.5130.516
Δ1182−122−47−22−12125
Δ10195−68−49−17−7146
c=1%Sharpe0.4240.5600.5340.5230.5180.5070.5110.514
Δ1180−121−43−21−11135
Δ10192−66−45−16−5157
c=2%Sharpe0.4240.5570.5300.5200.5150.5050.5090.512
Δ1177−120−40−20−10135
Δ10190−65−42−15−4157
EWMADCCHDRDWRiRFFR
Panel A: MCD/PFE/PG/WMT/XOM
Mean ret4.4145.9615.5395.4325.3855.2595.2975.319
σ^p10.58310.89210.60510.56810.56910.56410.56410.563
mcs p-value(0.01)(0.00)(0.01)(0.01)(0.01)(0.40)(0.40)(1.00)
StDev ret10.38310.60010.30710.34110.34810.32710.32610.317
TO0.0310.1100.1520.1080.1060.0890.0920.083
CO0.5000.5370.5060.5010.5010.5000.5000.500
SP−0.005−0.009−0.003−0.005−0.005−0.005−0.005−0.005

c=0%Sharpe0.4250.5620.5370.5250.5200.5090.5130.516
Δ1182−122−47−22−12125
Δ10195−68−49−17−7146
c=1%Sharpe0.4240.5600.5340.5230.5180.5070.5110.514
Δ1180−121−43−21−11135
Δ10192−66−45−16−5157
c=2%Sharpe0.4240.5570.5300.5200.5150.5050.5090.512
Δ1177−120−40−20−10135
Δ10190−65−42−15−4157
Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM
Mean ret6.7558.9437.5837.1677.1797.0387.0617.055
σ^p9.65110.3439.7379.6699.6729.6619.6599.660
mcs p-value(1.00)(0.00)(0.00)(0.00)(0.00)(0.17)(0.52)(0.52)
StDev ret10.14910.68110.31710.14410.13710.11310.12810.108
TO0.0630.2130.3310.1660.1540.1320.1450.123
CO0.4160.4920.4270.4260.4250.4240.4240.424
SP−0.104−0.181−0.110−0.109−0.108−0.106−0.107−0.106
c=0%Sharpe0.6660.8370.7350.7070.7080.6960.6970.698
Δ161−366−103−22−243−1
Δ1068−254−63−15−1943
c=1%Sharpe0.6640.8320.7270.7020.7040.6930.6940.695
Δ158−361−93−20−2340
Δ1065−250−53−13−1754
c=2%Sharpe0.6630.8270.7190.6980.7010.6890.6900.692
Δ155−356−82−17−2141
Δ1062−245−42−11−1655
Panel B: AA/AXP/BA/CAT/GE/HD/HON/IBM/JPM/KO/MCD/PFE/PG/WMT/XOM
Mean ret6.7558.9437.5837.1677.1797.0387.0617.055
σ^p9.65110.3439.7379.6699.6729.6619.6599.660
mcs p-value(1.00)(0.00)(0.00)(0.00)(0.00)(0.17)(0.52)(0.52)
StDev ret10.14910.68110.31710.14410.13710.11310.12810.108
TO0.0630.2130.3310.1660.1540.1320.1450.123
CO0.4160.4920.4270.4260.4250.4240.4240.424
SP−0.104−0.181−0.110−0.109−0.108−0.106−0.107−0.106
c=0%Sharpe0.6660.8370.7350.7070.7080.6960.6970.698
Δ161−366−103−22−243−1
Δ1068−254−63−15−1943
c=1%Sharpe0.6640.8320.7270.7020.7040.6930.6940.695
Δ158−361−93−20−2340
Δ1065−250−53−13−1754
c=2%Sharpe0.6630.8270.7190.6980.7010.6890.6900.692
Δ155−356−82−17−2141
Δ1062−245−42−11−1655

Note: This table shows portfolio statistics of the Global Minimum Variance portfolio, based on biweekly predictions of the covariance matrix, according to the Conditional Autoregressive model with HAR dynamics, assuming a Wishart (W), Riesz (R), Inverse Riesz (iR), matrix-F (F), or a F-Riesz (FR) distribution. In addition, we include the EWMA, the DCC-GARCH, and HAR-DRD models as benchmarks. We report the average annualized return and standard deviation (both using the true realized covariance matrix and daily returns), as well as the turnover (TO), portfolio concentration (CO), and short positions (SP). Lowest portfolio volatilities are marked in bold. We also list the p-value that belongs to the model confidence set approach (MCS) of lowest ex-post daily volatility using the true realized covariance matrix based on a 5% significance level. Bold p-values correspond to models that belong to the model confidence set. Finally, the table reports the economic gains of switching from each model listed in the column to the CAFr HAR model in annual basis points, Δγ, for various transaction cost levels c and risk aversion coefficients γ. A bold Δγ means significantly different from zero at the 5% level. Panel A shows results of dimension 5, and Panel B lists results of dimension 15. The out-of-sample period goes from January 2005 until December 2019 and contains 3696 observations.

Table 8 shows three important results. First, the CAFr HAR model has significantly lower ex-post realized portfolio volatility in the five-dimensional case. For the case of 15 assets, the CAFr model belongs to the MCS and behaves at par with the matrix-F and Inverse Riesz. This is the most important signal in the table, as the GMV criterion function only steers toward minimizing the portfolio variance, not taking into account its return, short positions, concentration, or any other performance measure of the portfolio returns. Second, apart from the EWMA filter, the F-Riesz distribution also has the lowest turnover amongst all models. Third, taking into account transaction costs, there are economic gains in switching from any other matrix distribution and the HAR-DRD model to the CAFr HAR model (FR). These gains are more pronounced for the 15-dimensional case, ranging from 14 to 136 basis points per year. Only the DCC model appears to perform better, mainly due to the higher mean return during the sample period. This performance of the DCC model, however, is not robust: for other sets of stocks, the performance of the DCC is easily found to be much worse among the different models, such as in Supplementary Appendix Tables G.7–G.9. We note again that the GMV criterion only takes the variance of the portfolio returns into account, and not its for instance its (notoriously hard to estimate) expected return. Altering the objective to correct for this might result in other portfolios and possibly different rankings.

Most of the results are robust for the 10-day-ahead (bi-weekly) forecasting horizon in Table 9 (as well as the additional tables and settings in Supplementary Appendix G). In particular, the F-Riesz distribution either has the lowest (realized) ex-post portfolio volatility (k=5) or is in the MCS for models with the lowest variance (k=15). Particularly at this longer horizon, the lower variance typically comes at the cost of a lower average return, resulting in a less favorable performance on the return-based performance criteria. Again, the GMV criterion does not take this dimension explicitly into account. Still, it is comforting to see that the economic gains at the bi-weekly horizon are either insignificant (non-bold) or positive in favor of the F-Riesz model. We also note that the F-Riesz again has the lowest turnover amongst all models, safe the EWMA filter.

To sum up our out-of-sample results, the F-Riesz distribution does very well out-of-sample in terms of density forecasts, indicating that the F-Riesz distribution captures the distributional shape of realized variances and covariances better than any of the other models considered here at different forecasting horizons. In terms of point forecasts, the F-Riesz model is also in the MCS for different forecasting horizons and different numbers of assets. Finally, in terms of GMV performance, the new model is again always in the MCS for yielding the lowest ex-post portfolio variance. It also results in stable allocations, as the model has the lowest turnover across all models considered, safe the EWMA filter. The lower ex-post variance may come at the cost of a lower ex-post average return. Still, the economic gains are for many of the models either in favor of the F-Riesz model or insignificantly different.

4 Conclusions

In this article, we introduced the new conditional autoregressive F-Riesz model for capturing the dynamics of matrix-valued random variables. The F-Riesz distribution was obtained by mixing the Riesz distribution (Hassairi and Lajmi 2001) with an Inverse Riesz distribution (Tounsi and Zine 2012), thus allowing for much more heterogeneity in tail behavior compared to the well-known matrix distributions like the thin-tailed Wishart, the inverse Wishart, or the fat-tailed matrix-F distribution. While the latter distributions depend on one or two degrees of freedom parameters, the new distribution allows vector-valued degrees of freedom parameters. These can easily be estimated by a two-step targeted maximum likelihood approach. In higher dimensions, the elements of the vector-valued degrees-of-freedom parameters might even be clustered to impose further parsimony using the likelihood fit as a guiding mechanism; compare the clustering approach of Oh and Patton (2023) in a copula context.

An empirical application to realized covariance matrices of dimensions 5 and 15 and different samples of U.S. stocks over 19 years of daily data showed a remarkably high increase in the likelihood of the F-Riesz distribution compared to the (Inverse) Wishart, (Inverse) Riesz, and matrix-F distributions. The margin of outperformance in terms of density forecasts was significant, both in-sample and out-of-sample. Also, the degrees of freedom parameters varied significantly over the different coordinates. The model was also always in the MCS for the lowest ex-post portfolio variance for a GMV analysis, as well as for most of the simple point forecast comparisons. Overall these results show that there is strong heterogeneity of tail behavior of realized covariance matrices, as well as fat-tailedness, and that the F-Riesz distribution can be a helpful vehicle to obtain better empirical models.7

Opschoor thanks the Dutch National Science Foundation (NWO) for financial support under grant VI.VIDI.201.079. We thank the associate editor and two anonymous referees for their valuable comments, and Francisco Blasques for contributions on the asymptotic analysis of the model. This paper has previously circulated under the title “Tail Heterogeneity for Dynamic Covariance Matrices: the F-Riesz Distribution.”

Footnotes

1

We correct the expression for the pdf in the original paper of Díaz-García (2016); see Supplementary Appendix A. The correction relates to the part of the pdf involving the scaling matrix and is therefore crucial in our context, where we consider time-varying scaling matrices.

2

As one of the referees pointed out correctly, a state-space model with a Riesz measurement equation as in Gribisch and Hartkopf (2023) can also result in conditional fat-tailedness of the realized covariance matrix measurements via the mixing with the state innovation. The state innovations in Gribisch and Hartkopf (2023) do not give rise to the closed-form Riesz distribution, making the two models less comparable.

3

The type I relates to the fact that we have taken a lower triangular Cholesky decomposition in the Bartlett decomposition. Type II versions of the distribution based on an upper triangular Cholesky decomposition also exist, and we refer to Supplementary Appendix B for details. As the performance of both types of F-Riesz distributions was similar in our context, we only discuss the type I F-Riesz distributions in the main text.

4

For the i.i.d. case, we provide a separate set of conditions and results in Supplementary Appendix C. We also confine ourselves to proving consistency. Though we expect asymptotic normality of the MLE to hold as well, the expressions for first and second-order derivatives of the F-Riesz pdf are complex in θ and Vt such that establishing the asymptotic normality in a tractable way is hard without high-level assumptions. We leave this part for further research. In the empirical section, we use the standard sandwich estimator for the standard errors.

5

Note that our results and those of Gribisch and Hartkopf (2023) cannot be compared directly, as they use a parameter-driven model specification, whereas ours is an observation-driven approach.

6

Note that the other benchmarks like DCC, HAR-DRD, and EWMA are not included in this comparison, because they either (i) do not produce a density forecast for lack of a density assumption, or (ii) take the return vector rather than the realized covariance matrix as the random variable of interest, thus inhibiting a density comparison with the matrix-valued F-Riesz distribution.

7

As one of the referees remarked, the F-Riesz distribution may also be an interesting alternative in modeling covariance matrices in macroeconomics. For instance, Arias, Rubio-Ramírez, and Shin (2023) found that a Wishart-based “approach may be too restrictive, as a single scalar parameter (ν) controls the overall tightness of the distribution.” The F-Riesz distribution would be less susceptible to this drawback.

Funding

Funding support for this article was provided by the Dutch National Science Foundation (NWO) (VI.VIDI.201.079).

References

Amisano
G.
,
Giacomini
R.
 
2007
.
Comparing Density Forecasts via Weighted Likelihood Ratio Tests
.
Journal of Business & Economic Statistics
 
25
:
177
190
.

Andersen
T.
,
Bollerslev
T.
,
Diebold
F.
,
Labys
P.
 
2003
.
Modeling and Forecasting Realized Volatility
.
Econometrica
 
71
:
579
625
.

Anderson
T.
 
1962
. An Introduction to Multivariate Statistical Analysis, Technical report. New York: Wiley.

Andersson
S
,
Klein
T.
 
2010
.
On Riesz and Wishart Distributions Associated with Decomposable Undirected Graphs
.
Journal of Multivariate Analysis
 
101
:
789
810
.

Arias
J. E.
,
Rubio-Ramírez
J. F.
,
Shin
M.
 
2023
.
Macroeconomic Forecasting and Variable Ordering in Multivariate Stochastic Volatility Models
.
Journal of Econometrics
 
235
:
1054
1086
.

Barndorff-Nielsen
O.
,
Hansen
P.
,
Lunde
A.
,
Shephard
N.
 
2009
.
Realized Kernels in Practice: Trades and Quotes
.
Econometrics Journal
 
12
:
1
32
.

Barndorff-Nielsen
O
,
Shephard
N.
 
2004
.
Econometric Analysis of Realized Covariation: High Frequency Based Covariance, Regression, and Correlation in Financial Economics
.
Econometrica
 
72
:
885
925
.

Bollerslev
T.
,
Li
J.
,
Patton
A.
,
Quaedvlieg
R.
 
2020
.
Realized Semicovariances
.
Econometrica
 
88
:
1515
1551
.

Bollerslev
T.
,
Patton
A.
,
Quaedvlieg
R.
 
2018
.
Modeling and Forecasting (un)Reliable Realized Covariances for More Reliable Financial Decisions
.
Journal of Econometrics
 
207
:
71
91
.

Brownlees
C
,
Gallo
G.
 
2006
.
Financial Econometric Analysis at Ultra-High Frequency: Data Handling Concerns
.
Computational Statistics & Data Analysis
 
51
:
2232
2245
.

Callot
L.
,
Kock
A.
,
Medeiros
M.
 
2017
.
Modeling and Forecasting Large Realized Covariance Matrices and Portfolio Choice
.
Journal of Applied Econometrics
 
32
:
140
158
.

Chen
R.
,
Yang
D.
,
Zhang
C.-H.
 
2022
.
Factor Models for High-Dimensional Tensor Time Series
.
Journal of the American Statistical Association
 
117
:
94
116
.

Chiriac
R
,
Voev
V.
 
2011
.
Modelling and Forecasting Multivariate Realized Volatility
.
Journal of Applied Econometrics
 
26
:
922
947
.

Corsi
F.
 
2009
.
A Simple Approximate Long-Memory Model of Realized Volatility
.
Journal of Financial Econometrics
 
7
:
174
196
.

Díaz-García
J.
 
2013
.
A Note on the Moments of the Riesz Distribution
.
Journal of Statistical Planning and Inference
 
143
:
1880
1886
.

Díaz-García
J.
 
2016
.
Riesz and Beta-Riesz Distributions
.
Austrian Journal of Statistics
 
45
:
35
51
.

Diebold
F.
,
Mariano
R.
 
1995
.
Comparing Predictive Accuracy
.
Journal of Business & Economic Statistics
 
13
:
253
263
.

Engle
R.
 
2002
.
Dynamic Conditional Correlation: A Simple Class of Multivariate Generalized Autoregressive Conditional Heteroskedasticity Models
.
Journal of Business & Economic Statistics
 
20
:
339
350
.

Engle
R.
,
Gallo
G.
 
2006
.
A Multiple Indicators Model for Volatility Using Intra-Daily Data
.
Journal of Econometrics
 
131
:
3
27
.

Engle
R.
,
Ledoit
O.
,
Wolf
M.
 
2019
.
Large Dynamic Covariance Matrices
.
Journal of Business & Economic Statistics
 
37
:
363
375
.

Fleming
J.
,
Kirby
C.
,
Ostdiek
B.
 
2001
.
The Economic Value of Volatility Timing
.
The Journal of Finance
 
56
:
329
352
.

Fleming
J.
,
Kirby
C.
,
Ostdiek
B.
 
2003
.
The Economic Value of Volatility Timing Using “Realized” Volatility
.
Journal of Financial Economics
 
67
:
473
509
.

Golosnoy
V.
,
Gribisch
B.
,
Liesenfeld
R.
 
2012
.
The Conditional Autoregressive Wishart Model for Multivariate Stock Market Volatility
.
Journal of Econometrics
 
167
:
211
223
.

Gribisch
B
,
Hartkopf
J.
 
2023
.
Modeling Realized Covariance Measures with Heterogeneous Liquidity: A Generalized Matrix-Variate Wishart State-Space Model
.
Journal of Econometrics
 
235
:
43
64
.

Gross
K
,
Richards
D.
 
1987
.
Special Functions of Matrix Argument. I. Algebraic Induction, Zonal Polynomials, and Hypergeometric Functions
.
Transactions of the American Mathematical Society
 
301
:
781
811
.

Hansen
P.
,
Lunde
A.
,
Nason
J.
 
2011
.
The Model Confidence Set
.
Econometrica
 
79
:
453
497
.

Hassairi
A.
,
Lajmi
S.
 
2001
.
Riesz Exponential Families on Symmetric Cones
.
Journal of Theoretical Probability
 
14
:
927
948
.

Jin
X.
,
Maheu
J.
 
2013
.
Modeling Realized Covariances and Returns
.
Journal of Financial Econometrics
 
11
:
335
369
.

Jin
X.
,
Maheu
J.
 
2016
.
Bayesian Semiparametric Modeling of Realized Covariance Matrices
.
Journal of Econometrics
 
192
:
19
39
.

Konno
Y.
 
1991
.
A Note on Estimating Eigenvalues of Scale Matrix of the Multivariate F-Distribution
.
Annals of the Institute of Statistical Mathematics
 
43
:
157
165
.

Louati
M.
,
Masmoudi
A.
 
2015
.
Moment for the Inverse Riesz Distributions
.
Statistics & Probability Letters
 
102
:
30
37
.

Lunde
A.
,
Shephard
N.
,
Sheppard
K.
 
2016
.
Econometric Analysis of Vast Covariance Matrices Using Composite Realized Kernels and Their Application to Portfolio Choice
.
Journal of Business & Economic Statistics
 
34
:
504
518
.

Markowitz
H.
 
1952
.
Portfolio Selection
.
The Journal of Finance
 
7
:
77
91
.

Markowitz
H.
 
1991
.
Foundations of Portfolio Theory
.
The Journal of Finance
 
46
:
469
477
.

Mitchell
J.
,
Hall
S.
 
2005
.
Evaluating, Comparing and Combining Density Forecasts Using the KLIC with an Application to the Bank of England and NIESR Fan-Charts of Inflation
.
Oxford Bulletin of Economics and Statistics
 
67
:
995
1033
.

Noureldin
D.
,
Shephard
N.
,
Sheppard
K.
 
2012
.
Multivariate High-Frequency-Based Volatility (HEAVY) Models
.
Journal of Applied Econometrics
 
27
:
907
933
.

Oh
D.
,
Patton
A.
 
2016
.
High-Dimensional Copula-Based Distributions with Mixed Frequency Data
.
Journal of Econometrics
 
193
:
349
366
.

Oh
D.
,
Patton
A.
 
2017
.
Modeling Dependence in High Dimensions with Factor Copulas
.
Journal of Business & Economic Statistics
 
35
:
139
154
.

Oh
D.
,
Patton
A.
 
2018
.
Time-Varying Systemic Risk: Evidence from a Dynamic Copula Model of CDS Spreads
.
Journal of Business & Economic Statistics
 
36
:
181
195
.

Oh
D.
,
Patton
A.
 
2023
.
Dynamic Factor Copula Models with Estimated Cluster Assignments
.
Journal of Econometrics
 
237
:
105374
.

Opschoor
A.
,
Janus
P.
,
Lucas
A.
,
van Dijk
D.
 
2018
.
New HEAVY Models for Fat-Tailed Realized Covariances and Returns
.
Journal of Business & Economic Statistics
 
36
:
643
657
.

Opschoor
A.
,
Lucas
A.
,
Barra
I.
,
van Dijk
D.
 
2021
.
Closed-Form Multi-Factor Copula Models with Observation-Driven Dynamic Factor Loadings
.
Journal of Business & Economic Statistics
 
39
:
1066
1079
.

Patton
A.
 
2009
. Copula-Based Models for Financial Time Series. In: Mikosch, T., Kreiß, JP., Davis, R., Andersen, T. (eds),
Handbook of Financial Time Series
. Berlin, Heidelberg:
Springer
, pp.
767
785
.

Politis
D.
,
Romano
J.
 
1994
.
The Stationary Bootstrap
.
Journal of the American Statistical Association
 
89
:
1303
1313
.

Straumann
D.
,
Mikosch
T.
 
2006
.
Quasi-Maximum-Likelihood Estimation in Conditionally Heteroeskedastic Time Series: A Stochastic Recurrence Equations Approach
.
The Annals of Statistics
 
34
:
2449
2495
.

Tounsi
M.
,
Zine
R.
 
2012
.
The Inverse Riesz Probability Distribution on Symmetric Matrices
.
Journal of Multivariate Analysis
 
111
:
174
182
.

Wang
D.
,
Liu
X.
,
Chen
R.
 
2019
.
Factor Models for Matrix-Valued High-Dimensional Time Series
.
Journal of Econometrics
 
208
:
231
248
.

White
H.
 
2000
.
A Reality Check for Data Snooping
.
Econometrica
 
68
:
1097
1126
.

Wintenberger
O.
 
2013
.
Continuous Invertibility and Stable QML Estimation of the EGARCH(1, 1) Model
.
Scandinavian Journal of Statistics
 
40
:
846
867
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data