I congratulate Evans and Didelez warmly on this innovative and inspiring paper. Here, I offer my interpretation of the approach in terms of two factorizations and a density ratio.

Any distribution over Z,X,Y can be factorized in two ways:

(1)

Factorization C1 is the usual one; it is factorization A1 that is the focus of the two authors. More concretely, suppose ϕZYX(z,yx) is a conditional copula density, i.e. for every value of x, (z,y)ϕZYX(z,yx) is a density function over [0,1]2 with uniform margins. To obtain A1(PZXY), take PZX and PYX to be the corresponding component and let ϕZYX=x be the density function of (F(ZX=x),F(YX=x)) for every x in the support of X. Conversely, to compose PXYZ from the three pieces, we have

(2)

where F(zx) and F(yx) are defined by PZX and PYX, respectively. Note that although PZX appear on both sides of equation (1), the relation between PYZ,X and the pair (PYX,ϕZYX) is not a separate bijection because F(zx) is needed to map one to the other. In other words, the map between PYZ,X and (PYX,ϕZYX) itself depends on PZX.

Suppose PZXY* is a related distribution, of which the margin PYX* is our model of interest. We require that PZXY* is related to PZXY through a density ratio r, given by

such that (a) r does not depend on y, (b) r>0P-almost everywhere, and (c) r can be identified from P. Then, by integrating out y on both sides of p*(z,x,y)=r(z,x;p)p(z,x,y), we have

(3)

That p*(z,x,y) being ‘cognate’ with respect to p(z,x,y) amounts to choosing

for some weight w(zx), such as w(zx)=p(z) for estimating the average treatment effect and w(zx)=p(zx=1) for estimating the effect of the treatment on the treated.

With r(x,y;p) chosen and fixed, the frugal parametrization is to represent p (and hence p*) through pZX, pYX*, and ϕZYX*, i.e. the following three pieces in box:

The likelihood can be obtained through

where the second line uses equation (3). When ϕ*(z,yx) is a conditional copula density, using equation (2), it follows that

(4)

Indeed, by uniform margins of the copula, one can check that

Further, in equation (4), the arguments of ϕ* depend on F*(yx) and F*(zx): the former is derived from p*(yx) and the latter is the conditional distribution function pertaining to p*(z,x)=p(z,x)r(z,x;p), given by

(5)

Hence, equation (4) provides an explicit expression for p(z,x,y) in terms of pZX, pYX*, and ϕZYX*, which depends on the pre-specified density ratio r(x,y;p) through equation (5). Multiplying equation (4) by the density ratio simply yields the expression for p*(z,x,y).

Sequentially randomized trial

 
Example 1
For Figure 1, with r(a,l,b;p)=p(b)/p(ba,l), we can parametrize P and P* in terms of the three pieces in box below:
(a) P(A,L,B,Y) and (b) P*(A,L,B,Y).
Figure 1.

(a) P(A,L,B,Y) and (b) P*(A,L,B,Y).

Partially marginal model

 
Example 2
Suppose we have an observational study with baseline covariates Z=(Z1,Z2), treatment X and outcome Y. Imagine that we want to study how Z1 modifies the effect of X on Y. Hence, we want to choose P*(Z,X,Y) such that P*(YX,Z1) aligns with our intended marginal model P(YZ1,do(X)). In the meantime, we need to use both Z1 and Z2 to control for the confounding between X and Y. This is called a ‘partially’ marginal model because the marginal model is conditional on a partial collection of baseline covariates. To facilitate this analysis, we can choose density ratio
and parametrize p (and hence p*) in terms of

Author notes

Conflict of interests: None declared.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/pages/standard-publication-reuse-rights)