-
PDF
- Split View
-
Views
-
Cite
Cite
Hein Putter, Hans C. van Houwelingen, Dynamic frailty models based on compound birth–death processes, Biostatistics, Volume 16, Issue 3, July 2015, Pages 550–564, https://doi.org/10.1093/biostatistics/kxv002
Close -
Share
Abstract
Frailty models are used in survival analysis to model unobserved heterogeneity. They accommodate such heterogeneity by the inclusion of a random term, the frailty, which is assumed to multiply the hazard of a subject (individual frailty) or the hazards of all subjects in a cluster (shared frailty). Typically, the frailty term is assumed to be constant over time. This is a restrictive assumption and extensions to allow for time-varying or dynamic frailties are of interest. In this paper, we extend the auto-correlated frailty models of Henderson and Shimakura and of Fiocco, Putter and van Houwelingen, developed for longitudinal count data and discrete survival data, to continuous survival data. We present a rigorous construction of the frailty processes in continuous time based on compound birth–death processes. When the frailty processes are used as mixtures in models for survival data, we derive the marginal hazards and survival functions and the marginal bivariate survival functions and cross-ratio function. We derive distributional properties of the processes, conditional on observed data, and show how to obtain the maximum likelihood estimators of the parameters of the model using a (stochastic) expectation-maximization algorithm. The methods are applied to a publicly available data set.
1. Introduction
In many fields of application of event history analysis, such as medicine and life sciences, demography, social sciences and economy, models for survival data have been developed that explicitly account for unobserved heterogeneity. The seminal paper by Vaupel and others (1979) coined the term “frailty” for a model where the heterogeneity was introduced by a random multiplication factor.
The original formulation of the frailty model by Vaupel and others (1979) specified the hazard rate of a subject as |$h(t \,|\, Z) = Z h_{\rm c}(t)$|, where |$h_{\rm c}(t)$| is the conditional hazard of a subject with frailty value 1, and |$Z$| is a random multiplicative factor, the frailty, which was assumed to follow a gamma distribution. Since part of the frailty can be absorbed in |$h_{\rm c}$|, for identifiability it is typically assumed that the expectation of |$Z$| equals 1. The gamma distribution is mathematically convenient, because the posterior distribution of |$Z$| given survival data will again follow a gamma distribution. As a biological model, the validity of the sgamma frailty model has been questioned, because it implies that the cross-ratio function (Clayton, 1978; Oakes, 1989) and the heterogeneity of survivors (Hougaard, 2000; Farrington and others, 2012) are constant over time, but as a simplified model it has its merits. There is, however, room for extension to more realistic frailty models, for instance, to more general families of frailty distributions (Hougaard, 1986, 2000), correlated frailties (Yashin and others, 1995; Wienke, 2011), or time-varying or dynamic frailties.
Earlier attempts to develop dynamic frailty models include hazard models based on diffusion processes (Woodbury and Manton, 1977; Yashin and Manton, 1997; Aalen and Gjessing, 2004), and on Lévy-type processes (Gjessing and others, 2003). Paik and others (1994) and Wintrebert and others (2004) developed models based on piecewise constant frailties. McGilchrist and Yau (1996) and Yau and McGilchrist (1998) developed an auto-regressive log-normal frailty model and Henderson and Shimakura (2003) and Fiocco and others (2009) an auto-correlated gamma frailty model. A recent paper of Unkel and others (2014) considers time-varying frailty models, proposed in Farrington and others (2012), in the context of paired serological survey data.
In the present paper, we develop an extension of the auto-correlated gamma frailty model of Fiocco and others (2009) to continuous time survival data. A rigorous construction of the underlying frailty process is provided that at the same time generalizes both the correlation structure of the frailty process over time and the class of underlying frailty distributions. The construction is related to that of Gjessing and others (2003), the essential difference being that the Lévy-type processes are constructed as birth–death processes in two dimensions rather than in one dimension. Section 2 outlines the construction of the processes and gives properties of the frailty process. Section 3 derives properties of survival models using such frailty processes as underlying random effects. Results for univariate and bivariate survival data are derived and it is shown how the distribution of the frailty process and its underlying building blocks are affected by event history. Section 4 discusses estimation methods for the underlying parameters of the frailty process and the conditional survival model. We conclude with a simulation study in Section 5, an application in Section 6, and with a discussion in Section 7.
2. Construction and properties
2.1. Compound birth–death processes
2.1.1. Construction
2.1.2. Independent increments
2.1.3. Finite-dimensional distribution
Heuristically, coming back to the definition of |$Z(t)$| being composed of many independent frailty components |$X(u,v)$| that are born at |$u \leq t$| and die at |$v \geq t$|: for |$t_1 \leq t_2$|, those that are born at |$u \leq t_1$| and die at |$v \geq t_2$| are common to |$Z(t_1)$| and |$Z(t_2)$|. This common part, |$A_{12}$|, induces positive correlation; the correlation will be greater if the relative contribution of the mass |$\mu (A_{12})$| of the common part is bigger.
2.2. Infinitely divisible frailties
3. Using the dynamic frailty process as random effect
3.1. Marginal hazard and survival function
|$\tilde {H}_{\rm c}(u,v;t)$| for different values of |$u \leq v,$| within brackets |$($|one line below|$)$| the derivative with respect to |$t$|
| . | |$v \lt 0$| . | |$0 \leq v \leq t$| . | |$v >t$| . |
|---|---|---|---|
| |$u \lt 0$| | 0 | |$H_{\rm c}(v)$| | |$H_{\rm c}(t)$| |
| (0) | (0) | (|$h_{\rm c}(t)$|) | |
| |$0 \leq u \leq t$| | NA | |$H_{\rm c}(v)-H_{\rm c}(u)$| | |$H_{\rm c}(t) - H_{\rm c}(u)$| |
| (0) | (|$h_{\rm c}(t)$|) | ||
| |$u >t$| | NA | NA | 0 |
| (0) |
| . | |$v \lt 0$| . | |$0 \leq v \leq t$| . | |$v >t$| . |
|---|---|---|---|
| |$u \lt 0$| | 0 | |$H_{\rm c}(v)$| | |$H_{\rm c}(t)$| |
| (0) | (0) | (|$h_{\rm c}(t)$|) | |
| |$0 \leq u \leq t$| | NA | |$H_{\rm c}(v)-H_{\rm c}(u)$| | |$H_{\rm c}(t) - H_{\rm c}(u)$| |
| (0) | (|$h_{\rm c}(t)$|) | ||
| |$u >t$| | NA | NA | 0 |
| (0) |
|$\tilde {H}_{\rm c}(u,v;t)$| for different values of |$u \leq v,$| within brackets |$($|one line below|$)$| the derivative with respect to |$t$|
| . | |$v \lt 0$| . | |$0 \leq v \leq t$| . | |$v >t$| . |
|---|---|---|---|
| |$u \lt 0$| | 0 | |$H_{\rm c}(v)$| | |$H_{\rm c}(t)$| |
| (0) | (0) | (|$h_{\rm c}(t)$|) | |
| |$0 \leq u \leq t$| | NA | |$H_{\rm c}(v)-H_{\rm c}(u)$| | |$H_{\rm c}(t) - H_{\rm c}(u)$| |
| (0) | (|$h_{\rm c}(t)$|) | ||
| |$u >t$| | NA | NA | 0 |
| (0) |
| . | |$v \lt 0$| . | |$0 \leq v \leq t$| . | |$v >t$| . |
|---|---|---|---|
| |$u \lt 0$| | 0 | |$H_{\rm c}(v)$| | |$H_{\rm c}(t)$| |
| (0) | (0) | (|$h_{\rm c}(t)$|) | |
| |$0 \leq u \leq t$| | NA | |$H_{\rm c}(v)-H_{\rm c}(u)$| | |$H_{\rm c}(t) - H_{\rm c}(u)$| |
| (0) | (|$h_{\rm c}(t)$|) | ||
| |$u >t$| | NA | NA | 0 |
| (0) |
3.1.1. Special cases
- The gamma distribution has |$\psi (c;\beta ) = \ln (\beta +c) - \ln (\beta )$|, with derivative |$\psi ^\prime (\tilde {H}_{\rm c}(u,v;t);\beta ) = (\beta + \tilde {H}_{\rm c}(u,v;t))^{-1}$|, which implies that the marginal hazard equals(3.4)\[h(t) = h_{\rm c}(t) \int _{A(t)} \frac {\alpha g(u,v)}{\beta + \tilde {H}_{\rm c}(u,v;t)} \,{\rm d}u \,{\rm d}v.\]
- The power variance function (PVF) distributions (Hougaard, 2000; Aalen and others, 2008) have |$\psi (c;\beta ) = |1 - ({\beta }/({\beta +c}))^m|$|, for |$\beta >0$| and |$m>-1$|, with |$m \neq 0$|. The marginal hazard is given byThe family of PVF distributions has for instance the gamma distribution as a special case as a limit for |$m \to 0$| and |$\alpha \to \infty$| in such a way that |$\alpha m \to \tilde {\alpha }$|. In that case the above formula for the marginal hazard does not hold and (3.4) is to be used instead.\[h(t) = h_{\rm c}(t) \int _{A(t)} \frac {\alpha m}{\beta } g(u,v) \left ( \frac {\beta }{\beta + \tilde {H}_{\rm c}(u,v;t)} \right ) ^{m+1} \,{\rm d}u \,{\rm d}v.\]
- The positive stable distribution (Hougaard, 1986, 2000) has |$\psi (c;\beta ) = c^\beta$|, with |$\beta \in (0,1)$|. The derivative of |$\psi$| is given by |$\psi ^\prime (c;\beta ) = \beta c^{\beta -1}$|, which is not defined at |$c=0$|. Indeed, the positive stable distribution has infinite expectation. The derivative of |$\psi$| is defined, however, at |$c>0$|, so that the conditional expectation |$\mathbf {E}(Z(t) \,|\, T \geq t)$| is defined if |$H_{\rm c}(t)>0$|. The conditional Laplace transform of |${\rm d}B(u,v)$|, given |$T \geq t$| is given byThe marginal hazard is given by\[\mathcal {L}_{{\rm d}B(u,v) \,|\, T \geq t}(c) = \exp \{-\alpha g(u,v) \cdot ((c+ \tilde {H}_{\rm c}(u,v;t))^\beta - \tilde {H}_{\rm c}(u,v;t)^\beta ) \,{\rm d}u \,{\rm d}v \}.\]\[h(t) = h_{\rm c}(t) \int _{A(t)} \alpha \beta g(u,v) \tilde {H}_{\rm c}(u,v;t)^{\beta -1} \,{\rm d}u \,{\rm d}v.\]
Figure 1 shows the marginal hazard |$h(t)$| for the choice |$h_{\rm c}(t) \equiv 1$| and |$g(u,v) = \lambda ^2 \exp (-\lambda (v-u))$|. The frailties in Figure 1(a)–(c) have been chosen such that |$\mathbf {E} Z(t) \equiv 1$|, |${{\rm var}}(Z(t)) \equiv 0.5, 1, 2,$| and |$\rho = \exp (-\lambda ) = 1, 0.6, 0.2$|. For the gamma distribution, shown in Figure 1(a), this is achieved by choosing |$\alpha = \beta = 2, 1, 0.5$|. For the PVF with |$m=3$| (Figure 1(b)), this is achieved by choosing |$\alpha = \beta /3, \beta = 8, 4, 2$|; for the PVF with |$m=-\frac {1}{2}$|, the inverse Gaussian distribution (Figure 1(c)), by choosing |$\alpha = 2\beta , \beta = 1, \frac {1}{2}, \frac {1}{4}$|. Figure 1(d) shows the marginal hazard for the positive stable distribution. A direct comparison of the positive stable distribution with the others is impossible, since the positive stable does not have finite variance. To still make some comparison with the other distributions (here the gamma distribution) possible, the same values of |$\alpha$| were chosen as for the gamma distribution, and for each value of |$\rho$| (|$\lambda$|), values of |$\beta$| for the positive stable distribution were chosen in such a way that |$\mathbf {E}[Z(t) \,|\, T \geq t]$| of (3.3) matches each of the choices of the gamma distribution at |$t=1$| (|$H(t)=1$|). For some combinations of |$\alpha$| and |$\lambda$| no appropriate |$\beta \in (0,1)$| could be found.
Marginal hazards for a selection of frailty distributions, for different values of frailty variance and auto-correlation.
Marginal hazards for a selection of frailty distributions, for different values of frailty variance and auto-correlation.
Figure 1 illustrates a number of things. First, in (a), (b), and (c), higher variances of the frailties induce stronger selection effects; the marginal hazard is increasingly dragged down from the conditional hazard (|$\equiv 1$|) as the frailty variance increases. This phenomenon is well known from shared frailty models. The value |$\rho =1$| (|$\lambda =0$|) corresponds to the case where the frailty is time-constant, i.e., |$Z(t) \equiv Z$|. In this case, it is well known that the marginal hazard is given by |$h(t) = \alpha h_{\rm c}(t) \psi ^\prime (H_{\rm c}(t);\beta )$| (Aalen and others, 2008, see, e.g., ][Section 6.2.4). In all cases illustrated—in general if |$\psi ^\prime (t;\beta ) \to 0$| as |$t \to \infty$|—we have |$h_{\rm c}(t) \to 0$| as |$t \to \infty$| for |$\rho =1$|. Interestingly, for |$\rho \lt 1$| this is no longer the case; in the cases illustrated in Figure 1, we have |$h(t) \to \int _0^\infty \lambda \,{\rm e}^{-\lambda u} ({\beta }/({\beta +u})) \,{\rm d}u$| and |$h(t) \to \int _0^\infty \lambda \,{\rm e}^{-\lambda u} ({\beta }/({\beta +u}))^{m+1} \,{\rm d}u$|, as |$t \to \infty$|, for the gamma and PVF(|$m$|) case, respectively. For the positive stable case, |$h(t) \to \alpha \lambda ^{1-\beta } \Gamma (1+ \beta )$|; values of |$\beta$| closer to 1 correspond to smaller selection effects. In each of these cases, it can be shown that |$\lim _{t \to \infty } h(t)$|, as a function of |$\lambda$|, increases, and equals 0 only at |$\lambda =0$|.
The selection by the frailties also has an effect on marginal hazard ratios. Consider two conditional hazards |$h_{\rm c}(t)$| and |$r h_{\rm c}(t)$|, for which the hazard ratio equals |$r$|. Figure 2 illustrates the marginal hazard ratio obtained by dividing the marginal hazard corresponding to |$r h_{\rm c}(t)$| (for |$r=2$|) by the marginal hazard corresponding to |$h_{\rm c}(t)$|, again for |$h_{\rm c}(t) \equiv 1$| and |$g(u,v) = \lambda ^2 \exp (-\lambda (v-u))$|. Two distributions are illustrated, with the same choices as Figure 1, the gamma distribution and the inverse Gaussian distribution. In general the pattern will depend on the exact choice of |$h_{\rm c}$| and |$g(u,v)$|. It is straightforward to see that for the positive stable distribution the marginal hazard ratio equals |$r^\beta$|, irrespective of the choice of |$h_{\rm c}$| and |$g(u,v)$|.
Marginal hazard ratios when the conditional hazard ratio equals 2, for the gamma distribution (a) and the PVF distribution with |$m=\frac {1}{2}$| (b); different values of frailty variance and auto-correlation are shown.
Marginal hazard ratios when the conditional hazard ratio equals 2, for the gamma distribution (a) and the PVF distribution with |$m=\frac {1}{2}$| (b); different values of frailty variance and auto-correlation are shown.
3.2. Marginal bivariate distribution
Supplementary material available at Biostatistics online (Section A) derives the marginal bivariate distribution induced by the dynamic frailty process and properties of the cross-ratio function.
3.3. Frailty distribution given event history
Supplementary material available at Biostatistics online (Section B) derives the conditional distribution of the |${\rm d}B(u,v)$|'s, and hence of |$Z(t+s)$|, given |$T \geq t$|, and, more generally, given observed survival data from a cluster sharing the same dynamic frailty are given. The latter results are useful for fitting the model using an EM-algorithm, which is the topic of the next section.
4. Fitting the model
4.1. Set-up
Section 4.2 will develop an EM-algorithm with two versions of the E-step, to be used depending on the size of the data set, in particular on the number of events in the clusters. The first version is feasible only for clusters with a very limited number of events, while the second version, a Monte Carlo EM algorithm, is feasible for clusters with moderate numbers of events.
4.2. EM algorithm
The approach followed is profile maximum likelihood, where the parameters |$h_0(\cdot )$| and |$\boldsymbol {\gamma }$| of the conditional Cox model are estimated by (Monte Carlo) EM for fixed values of the parameters |$\boldsymbol {\theta } = (\alpha , \beta , \lambda )$| of the frailty process |$Z(\cdot )$|. The profile log-likelihood |$\ell _{{\rm prof}}(\boldsymbol {\theta }) = \sup _{h_0,\boldsymbol {\gamma }} \ell (h_0,\boldsymbol {\gamma },\boldsymbol {\theta })$| is obtained and maximized with respect to |$\boldsymbol {\theta }$|. The same approach was used by Nielsen and others (1992) in the context of the shared gamma frailty model. Two versions of the E-step will be developed, one for clusters with small number of events, the other for clusters for larger number of events.
4.2.1. Exact E-step
Both numerator and denominator can be expressed in terms of derivatives of the Laplace transform |$\mathcal {L}(c_1,\ldots ,c_L)$| of the finite-dimensional distribution of the frailty process from (2.6). Details are given in supplementary material available at Biostatistics online (Section C).
4.2.2. Monte Carlo E-step
Calculation of the conditional expectation of |$Z_g(t_j^* )$| using the method described above becomes prohibitively slow for |$>$|5–10 events in a cluster. Supplementary material available at Biostatistics online (Section D) describes a stochastic E-step that can be used for moderately large number of events per cluster.
4.2.3. Estimation of the frailty parameters
After convergence of the EM algorithm for fixed values of |$\boldsymbol {\theta } = (\alpha , \beta , \lambda )$|, maximum likelihood estimators |$\hat {h}_0(\boldsymbol {\theta })$| and |$\hat {\boldsymbol {\gamma }}(\boldsymbol {\theta })$| are obtained and the profile log-likelihood |$\ell _{{\rm prof}}(\boldsymbol {\theta }) = \ell (\hat {h}_0(\boldsymbol {\theta }),\hat {\boldsymbol {\gamma }}(\boldsymbol {\theta }),\boldsymbol {\theta })$| can be calculated. Since |$\ell _{{\rm prof}}(\boldsymbol {\theta })$| is a function of a limited number of parameters only it can simply be optimized numerically.
Calculation of the profile log-likelihood, or for that matter, of the marginal log-likelihood in general during the EM algorithm, is not directly possible if the Monte Carlo method is used in the E-step. In this case, the MCMC draws may be used to approximate it, see Supplementary material available at Biostatistics online (Section E).
4.3. Standard errors
Supplementary material available at Biostatistics online (Section F) contains details on obtaining the covariance matrix of the estimators.
5. Simulation
A simulation study was performed to evaluate whether it is practically feasible to fit the dynamic frailty models, and in particular to see to what extent it is possible to retrieve information about the value of |$\rho$| for the auto-regressive correlated gamma process. Details about the set-up and the results can be found in Supplementary material available at Biostatistics online (Section G).
6. Application
We consider a litter-matched tumorigenesis experiment, which is discussed in Mantel and others (1977). The data (rats) are part of the survival package in R. It consists of 50 all-female litters of three rats each. In each of the litters, one (random) is given anti-tumor treatment, while the other two serve as controls and are given placebo. The outcome of the experiment is time in weeks to tumor recurrence. A total of 40 tumor recurrence events are observed during follow-up. There were 23 litters without recurrences, 15 with one recurrence, 11 with two recurrences and 1 with three recurrences. A Cox model without frailty gives an estimated regression coefficient of 0.899, with a standard error (SE) of 0.317, for the treatment effect, with a log partial likelihood of |$-208.8452$|. A shared gamma frailty model yields an estimate of 0.906, with an SE of 0.319 for the treatment effect. The estimated frailty variance is 0.474. The (marginal) log partial likelihood equals |$-208.0774$|, which yields a likelihood ratio statistic of 1.5356, and a |$p$|-value of 0.78.
The purpose of the frailty term is to account for unobserved heterogeneity, caused by possible differences between litters that were not controlled by the experiment and not captured by the covariates in the model. Such unobserved differences could either be time-varying (e.g., temporal changes in experimental conditions) or their effect could be time-dependent. Alternatively, a time-varying frailty could capture violations of the proportional hazards assumption of the treatment that are different from those captured by a time-fixed frailty model (Section 3.1). To explore these issues, we fitted a series of auto-correlated frailty processes, all with |$g(u,v) = \lambda ^2 \exp (-\lambda (v-u))$|, yielding |${{\rm cor}}(Z(s),Z(t)) = \exp (-\lambda |s-t|)$|.
First, a Cox model with auto-correlated gamma frailty process was fitted, i.e., the gamma distribution was used (|$\psi (c;\beta ) = \ln (\beta +c) - \ln (\beta )$|). To ensure unit mean |$\mathbf {E} Z(t) \equiv 1$|, |$\alpha$| and |$\beta$| were restricted to |$\alpha =\beta$|, both equal to the inverse of the variance |${{\rm var}}(Z(t))$|. Figure 3 shows a contour plot of the profile log partial likelihood for varying |$\beta$| and |$\lambda$|. For this gamma correlated frailty model, the profile log partial likelihood was maximized at |$\beta$| is 0.834. The estimated frailty variance, 1.199 (=1/|$\hat {\beta }$|), is now much bigger than in the shared frailty model. The estimated value of |$\lambda$| was 0.0388, and hence the estimated correlation was |$\hat {\rho } = 0.962$|, which (since time is in days) amounts to a yearly correlation of 0.13. The model with correlated gamma frailty gives an estimate of 0.910 of the treatment effect, with an SE of 0.334. The marginal log partial likelihood is |$-207.4655$|. The difference in |$-2* $|log-likelihood, compared with the shared frailty model, is 1.2238, which, with one degree of freedom, would amount to a |$p$|-value of 0.73. Note, however, that the shared frailty model corresponds to |$\rho =1$|, which lies on the boundary of the parameter space. Figure 2 in Supplementary material available at Biostatistics online (Section H) shows the posterior mean gamma frailties for each of the 50 L, at the event time points.
Contour plot of the profile log partial likelihood for varying |$\beta$| and |$\lambda$|, for the rat data.
Contour plot of the profile log partial likelihood for varying |$\beta$| and |$\lambda$|, for the rat data.
In Table 2, we compare the results of using different underlying frailty distributions. The first is the gamma distribution, discussed above. Two examples from the power variance function (PVF) family, for which |$\psi (c;\beta ) = |1 - ({\beta }/({\beta +c}))^m|$|, are included, with |$m=-0.5$| (the inverse Gaussian distribution) and |$m=1$|. The restrictions on |$\alpha$| and |$\beta$| and the relation with |${{\rm var}}(Z(t))$| are now given by |$\beta = ({m+1})/{{{\rm var}}(Z(t))}$|, and |$\alpha = {\beta }/{|m|}$|. The last distribution is the positive stable distribution, with |$\psi (c;\beta ) = c^\beta$|, for |$\beta \in (0,1)$|. Since the positive stable distribution does not have finite expectation or variance, we chose the restriction |$\alpha \beta =1$|. This parametrization was chosen such that the conditional mean |$\mathbf {E}(Z(t) | T \geq t)$| would be |$\alpha \beta =1$| for a shared frailty at |$t$| for which |$H_{\rm c}(t) = \int _0^t h_{\rm c}(s) \,{\rm d}s = 1$|, and its conditional variance |${{\rm var}}(Z(t) | T \geq t)$| would be |$1-\beta$|. For the positive stable distribution, the variance is not defined, but the estimate of |$\beta$| was 0.896, with a 95% confidence interval of 0.305–0.984. In principle, several PVF models with different values of |$m$| could be fitted, and the one with the highest log partial likelihood could be chosen, but there is very little information on |$m$| in the data. Results for estimates of |$\gamma$| and |$\lambda$| are remarkably consistent across different underlying frailty distributions, with the exception of the positive stable distribution.
Parameter estimates and log partial likelihoods of the auto-correlated frailty model with different underlying frailty distributions, for the rat data
| Distribution . | |$\hat {\gamma }$| (SE) . | Variance (95% CI) . | |$\hat {\lambda }$| (95% CI) . | Log partial likelihood . |
|---|---|---|---|---|
| Gamma | 0.910 (0.334) | 1.199 (0.205–7.029) | 0.039 (0.004–0.356) | |$-$|207.4655 |
| |${\rm PVF}(-0.5)$| | 0.921 (0.335) | 1.464 (0.166–12.92) | 0.039 (0.004–0.373) | |$-$|207.5718 |
| PVF(1) | 0.905 (0.334) | 1.094 (0.223–5.365) | 0.039 (0.004–0.342) | |$-$|207.3987 |
| Positive stable | 0.965 (0.345) | – | 0.023 (0.001–0.589) | |$-$|208.2296 |
| Distribution . | |$\hat {\gamma }$| (SE) . | Variance (95% CI) . | |$\hat {\lambda }$| (95% CI) . | Log partial likelihood . |
|---|---|---|---|---|
| Gamma | 0.910 (0.334) | 1.199 (0.205–7.029) | 0.039 (0.004–0.356) | |$-$|207.4655 |
| |${\rm PVF}(-0.5)$| | 0.921 (0.335) | 1.464 (0.166–12.92) | 0.039 (0.004–0.373) | |$-$|207.5718 |
| PVF(1) | 0.905 (0.334) | 1.094 (0.223–5.365) | 0.039 (0.004–0.342) | |$-$|207.3987 |
| Positive stable | 0.965 (0.345) | – | 0.023 (0.001–0.589) | |$-$|208.2296 |
Parameter estimates and log partial likelihoods of the auto-correlated frailty model with different underlying frailty distributions, for the rat data
| Distribution . | |$\hat {\gamma }$| (SE) . | Variance (95% CI) . | |$\hat {\lambda }$| (95% CI) . | Log partial likelihood . |
|---|---|---|---|---|
| Gamma | 0.910 (0.334) | 1.199 (0.205–7.029) | 0.039 (0.004–0.356) | |$-$|207.4655 |
| |${\rm PVF}(-0.5)$| | 0.921 (0.335) | 1.464 (0.166–12.92) | 0.039 (0.004–0.373) | |$-$|207.5718 |
| PVF(1) | 0.905 (0.334) | 1.094 (0.223–5.365) | 0.039 (0.004–0.342) | |$-$|207.3987 |
| Positive stable | 0.965 (0.345) | – | 0.023 (0.001–0.589) | |$-$|208.2296 |
| Distribution . | |$\hat {\gamma }$| (SE) . | Variance (95% CI) . | |$\hat {\lambda }$| (95% CI) . | Log partial likelihood . |
|---|---|---|---|---|
| Gamma | 0.910 (0.334) | 1.199 (0.205–7.029) | 0.039 (0.004–0.356) | |$-$|207.4655 |
| |${\rm PVF}(-0.5)$| | 0.921 (0.335) | 1.464 (0.166–12.92) | 0.039 (0.004–0.373) | |$-$|207.5718 |
| PVF(1) | 0.905 (0.334) | 1.094 (0.223–5.365) | 0.039 (0.004–0.342) | |$-$|207.3987 |
| Positive stable | 0.965 (0.345) | – | 0.023 (0.001–0.589) | |$-$|208.2296 |
7. Discussion
In this paper, we have extended the auto-correlated gamma frailty models of Henderson and Shimakura (2003) and Fiocco and others (2009), developed for longitudinal count data and discrete survival data, to continuous survival data. The construction in continuous time of these frailty processes developed in this paper is based on compound birth–death processes and bears resemblance to the construction of gamma processes (Cinlar, 1980) and to the Lévy-type processes of Gjessing and others (2003).
The model in Fiocco and others (2009) was developed for longitudinal count data |$\mathbf {Y} = (Y_1,\ldots ,Y_T)$|, and specified that, conditionally on a vector of gamma distributed gamma random variables |$\mathbf {Z} = (Z_1,\ldots ,Z_T)$|, |$Y_t \,|\, Z_t \sim {\rm Po}(Z_t \mu _t)$|, for |$t=1,\ldots ,T$|, where |$\boldsymbol {\mu } = (\mu _1,\ldots ,\mu _T)$| is a vector of means, possibly depending on covariates. The model can be applied to survival data with piecewise constant (baseline) hazards by exploiting the fact that the log-likelihoods of the longitudinal count model and the piecewise constant hazards model coincide, up to a constant (see Fiocco and others (2009) for an application of the model to survival data). The frailty terms satisfy |$\mathbf {E} Z_t = 1$|, |${{\rm var}}(Z_t) = \sigma ^2$|, and |${{\rm cor}}(Z_s,Z_t) = \rho ^{|s-t|}$|, where |$s,t = 1,\ldots ,T$|. The construction of frailty processes in continuous time makes it possible to apply the auto-correlated gamma frailty model directly to continuous survival data with unspecified baseline hazard. Also, in this paper, a method for obtaining maximum likelihood estimates using EM is developed, while Fiocco and others (2009) used composite likelihood methods, as did Henderson and Shimakura (2003).
The construction presented in this paper extends the auto-correlated gamma frailty model of Fiocco and others (2009) in two more ways. First, a wide class of positive infinitely divisible frailty distributions is allowed, containing the gamma distribution, the power variance family (which in turn contains the inverse Gaussian distribution), and the positive stable distribution as special cases. Second, more general correlation structures can be defined by choosing different functions |$g(u,v)$| in the construction of Section 2.
The frailty processes considered in this paper are stationary in the sense that the distribution of |$Z(t)$| does not vary over |$t$|. The assumption of constant variance may not always be realistic in practice—although one should not confuse the variance of |$Z(t)$| with the variance of |$Z(t)$|among the survivors at |$t$|, the relative frailty variance in the terminology of Farrington and others (2012). Models where |${{\rm var}}(Z(t))$| is allowed to vary with |$t$| may be obtained by removing the restriction |$\nu (t) \equiv \nu$| in the construction of Section 2. More research is needed to determine how useful and manageable these more flexible models are.
Formula (3.2) of the marginal hazard, and its special case (3.4) for the gamma distribution, is reminiscent of the Burr model, describing the marginal hazard |$h(t) = {h_0(t) \exp (\mathbf {x}^\top \boldsymbol {\gamma })}/({1 + \theta ^{-1} H_0(t) \exp (\mathbf {x}^\top \boldsymbol {\gamma })})$| of a shared gamma frailty model with frailty variance |$\theta ^{-1}$|. As proposed in Perperoglou and others (2006) (see also van Houwelingen and Putter, 2012, Section 6.2), as an alternative to frailty models, one could use the so-called relaxed Burr model |$h(t) = {h_0(t) \exp (\mathbf {x}^\top \boldsymbol {\gamma })}/({1 +F(t;\theta ) \exp (\mathbf {x}^\top \boldsymbol {\gamma })})$|, with |$F(t;\theta )$| any nonnegative function, as definition of a model. A richer class of relaxed Burr models could be imagined using (3.4) as starting point, and replacing |$\tilde {H}_{\rm c}(u,v;t)$| by a more general nonnegative function |$F(u,v;t)$|.
We presented an EM-algorithm to estimate the parameters in the auto-correlated frailty process. Restricting attention to the gamma distribution, the formulas for the conditional expectation of the frailty at time |$t_j^* $|, |$\mathbf {E}(Z_g(t_j^* ) \,|\, {\rm data})$|, given the data of cluster |$g$|, resemble those in Petersen and others (1996, e.g., (23)–(25)). Petersen and others (1996) studied estimation in a class of correlated gamma frailty models proposed by Yashin and others (1995), using an EM-algorithm, and showed that the posterior distribution of the frailties given the data is a mixture of independent gamma random variables. The methods developed in this paper allow generalization of the correlated gamma frailties to correlated frailties of random variables within the wider class of infinitely divisible distributions considered in this paper.
Supplementary material
Supplementary Material is available at http://biostatistics.oxfordjournals.org.
Acknowledgement
Conflict of Interest: None declared.
References



