-
PDF
- Split View
-
Views
-
Cite
Cite
Ying Sheng, Yifei Sun, Chiung-Yu Huang, Mi-Ok Kim, Synthesizing External Aggregated Information in the Presence of Population Heterogeneity: A Penalized Empirical Likelihood Approach, Biometrics, Volume 78, Issue 2, June 2022, Pages 679–690, https://doi.org/10.1111/biom.13429
- Share Icon Share
Abstract
With the increasing availability of data in the public domain, there has been a growing interest in exploiting information from external sources to improve the analysis of smaller scale studies. An emerging challenge in the era of big data is that the subject-level data are high dimensional, but the external information is at an aggregate level and of a lower dimension. Moreover, heterogeneity and uncertainty in the auxiliary information are often not accounted for in information synthesis. In this paper, we propose a unified framework to summarize various forms of aggregated information via estimating equations and develop a penalized empirical likelihood approach to incorporate such information in logistic regression. When the homogeneity assumption is violated, we extend the method to account for population heterogeneity among different sources of information. When the uncertainty in the external information is not negligible, we propose a variance estimator adjusting for the uncertainty. The proposed estimators are asymptotically more efficient than the conventional penalized maximum likelihood estimator and enjoy the oracle property even with a diverging number of predictors. Simulation studies show that the proposed approaches yield higher accuracy in variable selection compared with competitors. We illustrate the proposed methodologies with a pediatric kidney transplant study.
1 Introduction
Personalized medicine has gained a great deal of attention over the past decade. The objective of personalized medicine is to tailor the treatment/intervention plan based on individual characteristics. To this end, summarizing evidence regarding treatment effect heterogeneity is a crucial step. In practice, however, individual studies are usually underpowered to detect important interactions or perform subgroup analyses. To increase the power, researchers have employed meta-analysis to synthesize information from different data sources. Traditional meta-analysis methods concerned with combining aggregate study-level data, such as odds ratios and mean differences. The major drawback of aggregate data analyses is that covariate-treatment interactions are usually not provided in the reports of primary analysis findings, making it difficult to conduct subgroup analysis (Simmonds and Higgins, 2007). An increasingly popular approach is meta-analysis of individual patient data (IPD), where the raw data from each study are obtained and analyzed directly. However, acquiring data from different institutions and generating a consistent data format across studies usually require a substantial amount of time and resource. Moreover, critical information about factors that influence the choice of treatment and clinical outcome may not be available for all studies.
In this paper, we present a unified framework for synthesizing information from a mixture of subject-level data and aggregate data. Various approaches have been proposed in the literature to leverage information from external sources to improve efficiency in analyzing smaller scale studies (Thall and Simon, 1990; Boonstra et al., 2016) and to build more accurate risk prediction models (Gail et al., 1989; Costantino et al., 1999; Chen et al., 2006; Gail, 2011; Liu et al., 2014; Chatterjee et al., 2016). An emerging challenge in the era of big data is that the subject-level information, such as genomics and proteomics profile, is high dimensional, whereas external information is typically at an aggregate level and low dimensional. Although meta-analysis has gained its popularity in the discovery of weak or sparse genetic features (Evangelou and Ioannidis, 2013; He et al., 2016; Pasaniuc and Price, 2017), existing approaches are not readily applicable when the external information is at an aggregate level and of a lower dimension. To tackle this problem, we propose a unified penalized empirical likelihood (PEL) approach for synthesizing information from different sources in a high-dimensional setting.
The empirical likelihood method was originally developed to obtain confidence regions that respect the boundaries of the support of the target parameter (Owen, 1988, 1990; Qin and Lawless, 1994). It has been applied to incorporate auxiliary information to improve estimation efficiency (Qin, 2000; Wu and Sitter, 2001; Chen et al., 2002; Chen and Qin, 2014; Qin et al., 2015; Huang et al., 2016; Han and Lawless, 2019). In a high-dimensional setting, Chen et al. (2009) and Hjort et al. (2009) studied the asymptotic properties of empirical likelihood-based inferences, while others, including Tang and Leng (2010), Leng and Tang (2012), and Chang et al. (2018), considered PEL methods for parameter estimation and variable selection. The existing work, however, is not applicable to a mixture of high-dimensional subject-level data and low-dimensional aggregate data. In this paper, we demonstrate that various forms of auxiliary aggregated information can be re-expressed as population estimating equations under a postulated logistic regression model for the subject-level data and thus can be readily incorporated as constraints under the proposed PEL framework.
The issue of whether the results from an external study are applicable to the current study arises when combining information from different sources. In particular, study design and inclusion and exclusion criteria often vary across studies, leading to population heterogeneity among different sources of information. Such heterogeneity, if not addressed properly, can lead to biased estimates of predictor effects. To tackle this problem, we construct a PEL ratio test statistic to check if the external information is consistent with the subject-level data. When the homogeneity assumption fails, we postulate a semiparametric density ratio model to account for population heterogeneity between different sources of information. The regression parameters in the density ratio model quantify the degree of heterogeneity in the predictor distributions and can be incorporated in the estimating equations derived from the auxiliary information. Moreover, as will be shown later, the additional set of parameters in the density ratio model is identifiable with a sufficient number of summary statistics of the predictors in the external studies. The proposed PEL approach yields higher variable selection accuracy and improved estimation efficiency; moreover, it enjoys the oracle property in the sense that it can achieve sparsity and optimal estimation efficiency simultaneously.
Finally, it is possible that the external aggregated information was derived from studies of similar sample sizes as the internal study and hence uncertainty in the auxiliary information is not negligible. In this case, we study the asymptotic results to show explicitly the impact of uncertainty on the efficiency gain of the proposed PEL approach. We further propose a modified variance estimator that properly accounts for the uncertainty in the external information. To the best of our knowledge, it is the first paper that considers the population heterogeneity and uncertainty in the auxiliary information when combining information from both high-dimensional subject-level data and low-dimensional aggregate data.
2 Proposed Methodologies
2.1 Different Forms of Auxiliary Information
Let denote a binary outcome of interest. Consider a logistic regression model

where contains p predictors and
is a
vector of regression parameters. To lay the groundwork, we begin with a brief discussion of the general method for summarizing different forms of auxiliary information. For ease of discussion, we assume that the external aggregated information is consistent with the subject-level data, that is, the internal and external studies were conducted on the same population. Moreover, we further assume that the auxiliary information was derived from large databases so that its uncertainty can be ignored. The two assumptions will be relaxed later in Sections 2.3 and 2.4. Our goal is to approximate the population moments by the sample moments with the available subject-level data. We will then treat the sample moment estimating equations as constraints under the empirical likelihood framework to synthesize information from external sources. We give three examples for illustration.
Example 1. For a binary outcome, the most common form of auxiliary information is the event rate in subgroups. For example, in kidney transplant studies, the acute rejection rates in patients who received living versus cadaveric donor organs are usually reported. Note that the subgroups are not necessarily mutually exclusive. Write , where ϕ is the event rate in the subgroup and Ω is the subgroup whose event rate was reported in an external study. This equation can be re-expressed as
. By double expectation and under Model (1), the auxiliary information can be expressed by the population estimating equation
, where
.
Example 2. Summary statistics for the predictors are commonly available in study reports. For example, the proportion of patients who received cadaveric transplants and the mean donor age are usually available in kidney transplant research. Obviously, summary statistics can be expressed as population estimating equations , where
. Common choices of
include
,
, and
, representing, respectively, the proportion, mean, and variance of the predictor in the target population. Note that in this example,
does not involve
directly. However, efficiency gain may be expected under the empirical likelihood framework due to correlation between constraints formed using the same set of subject-level data.
Example 3. Many publications also report the results of univariate and multivariate logistic regression analyses. However, covariates included in these models may not be the same as those considered in Model (1). For example, in studying the effect of steroid avoidance protocols on the risk of acute rejection in kidney transplant patients, the previous studies may have included a smaller set of potential confounders than what are considered in the current data analysis. Write , where
and
are disjoint subsets of
with dimensions K and
, respectively. Consider the case that the coefficient
of
in a reduced logistic regression model
was available. It is known that the true regression coefficient
is the solution to the expected score equation derived from the reduced model, that is,
. Under the full model (1) and following the law of total expectation, the auxiliary information can be expressed as
, where
.
2.2 PEL for Synthesizing Auxiliary Information
In this section, we introduce a unified PEL method that can synthesize various forms of external aggregated information in variable selection and parameter estimation. Suppose the subject-level data , are independent and identically distributed (i.i.d.) realizations of
. Denote by G the marginal distribution function of
. Consider the log full likelihood
, where
is the log conditional likelihood of Y given
and
is the log marginal likelihood of
. Denote by
the jump size of G at
,
, and write
. Then the log marginal likelihood can be re-expressed as
. We are interested in a sparse high-dimensional setting where p is allowed to increase with n. Write
, where
is a
vector of parameters that includes the intercept β0 and the nonzero coefficients, and
is a collection of zero parameters.
As illustrated above, various forms of aggregated information can be summarized as a system of K-dimensional population estimating equations , where
and
. Note that, although the subject-level data are high dimensional, the external information is often of a low dimension and thus we assume that K is fixed and finite. To combine information from different sources, one can minimize the negative log full likelihood
subject to the constraints,

The first two constraints ensure G is a proper distribution function. The last constraint is constructed by approximating the population estimating equation derived from the external aggregated information, , using the observed subject-level data. For stability in implementations, a nested optimization algorithm based on the corresponding dual problem is recommended to solve the constrained minimization problem (Chen et al., 2002; Imbens, 2002; Donald et al., 2003; Han and Lawless, 2019).
With a diverging number of predictors, direct minimization of subject to the constraints often leads to estimators with large variances; as a result, the fitted model could have poor prediction performance. To tackle this problem, we consider incorporating the adaptive least absolute shrinkage and selection operator (lasso) penalty (Zou, 2006) into the constrained minimization for variable selection and parameter estimation simultaneously. Specifically, the adaptive lasso PEL estimator is defined by

where is a tuning parameter and
are adaptive weights. Following Zou and Zhang (2009), we propose to use the adaptive weights
, where
is a constant and
is the penalized maximum likelihood estimator (MLE) with elastic net (Zou and Hastie, 2005). Since
is a consistent estimator of
, the value of
reasonably reflects the importance of
. With proper weights, the adaptive lasso method puts smaller penalties on important predictors than unimportant ones and thus the important predictors are more likely to be selected.
In what follows, we tackle the penalized constrained minimization problem in (2) by solving the corresponding dual problem. The Lagrange function for the penalized constrained minimization problem in (2) is given by , where η and
are Lagrange multipliers. With some calculations, we can derive
and
, where the Lagrange multiplier
satisfies
. Hence the penalized negative constrained log full likelihood function can be expressed as
, where, up to a constant,

Arguing as in Newey and Smith (2004), the penalized constrained minimization can be carried out equivalently by

The large sample properties of are summarized below in Theorem 1 and Theorem 2, with the proofs given in the Supporting Information.
Theorem 1. Under conditions (C1)–(C3) in the Appendix, suppose that as ,
,
and
, then
.
Theorem 2. Write , where
and
are estimators of
and
, respectively. Let
be the
predictors corresponding to
. Suppose that the conditions specified in Theorem 1 are satisfied and
as
. Moreover, assume that
and
are positive definite. Then, as
, we have (i)
with probability tending to 1; and (ii)
converges in distribution to N(0, 1), where
is an arbitrary
vector satisfying
and
.
Under the proposed PEL framework, both the dimensionality of predictors and the number of nonzero coefficients are allowed to increase with the sample size n; moreover, the minimal signal strength is allowed to decrease with n. Theorem 1 shows that the proposed estimator
is a root-
consistent estimator and Theorem 2 establishes the sparsity property and asymptotic normality. By Theorem 2, the proposed estimator
enjoys the oracle property; that is,
can be estimated as zero with probability tending to one and
is asymptotically as efficient as the empirical likelihood estimator for estimating
by incorporating auxiliary information as if we knew
in advance. Define

which is the adaptive lasso penalized MLE without incorporating the auxiliary information. Write , where
and
are estimators of
and
, respectively. Following Zou and Zhang (2009), we can show that
converges in distribution to N(0, 1) as
. Hence Theorem 2 implies that the proposed PEL estimator
is asymptotically more efficient than the conventional penalized MLE
.
Applying the local quadratic algorithm (Fan and Li, 2001; Fan and Peng, 2004), the covariance matrix of can be estimated by

For the nonzero parameters, the local quadratic algorithm is applied to approximate the adaptive lasso penalty and thus leads to in (6). As
, we have
and (6) converges in probability to the asymptotic covariance matrix in Theorem 2. More details are given in the Supporting Information.
2.3 Synthesizing Auxiliary Information in the Presence of Population Heterogeneity
The premise of information synthesis is that the underlying relationship between the response and predictors is the same between different data sources. The above estimation procedure additionally assumes homogeneity in the predictor distribution and thus motivates us to approximate the population estimating equations by the sample estimating equations. Due to differences in the inclusion and exclusion criteria, however, this assumption may not hold. Following Qin and Lawless (1994, 1995), we develop a PEL ratio test to check the homogeneity assumption, that is, . Note that when
,
is minimized by
, where
is defined by (3) and
is defined by (5). Therefore, we consider a PEL ratio test with the test statistics
, where
is the solution to
. By Theorem 3 below, we reject the null hypothesis of homogeneity at a type I error rate of 5% if
, where
is the 95th percentile of the chi-squared distribution with K degrees of freedom.
Theorem 3. Under the conditions specified in Theorem 2 and the null hypothesis that , the test statistic
converges in distribution to a χ2 random variable with K degrees of freedom as
.
When the homogeneity assumption fails to hold, we extend the proposed procedure to allow the predictors in the external source data to have a different distribution. In most applications, a subvector of accounts for the predictor distribution heterogeneity between the data sources. In practice, summary statistics available for the predictors in the external source usually inform the selection of the subvector that accounts for the predictor distributional differences. With
denoting the fixed-length p0-dimensional predictors accounting for the predictor distributional differences, we propose a semiparametric density ratio model

where and
denote the density functions of
in the external data and the subject-level data, respectively, with
unspecified. The model is equivalent to imposing a logistic regression model for the odds of membership in the external study relative to that in the internal study, where nonzero regression coefficients in the logistic regression model only involve that in the subvector
. The coefficient
characterizes the degree of heterogeneity in covariate distribution relative to the internal study. Obviously,
indicates fidelity to the homogeneity assumption. For model identifiability, we assume that
. In most situations, aggregated information in the form of summary statistics for predictors is often available for checking the heterogeneity in the predictor distribution. Such aggregated information, as illustrated below, can be readily incorporated under the proposed PEL framework and thus the identifiability condition can be easily met.
As illustrated in Section 2.1, the external auxiliary information can be summarized by population estimating equations , where the expectation
is evaluated with respect to the covariate distribution in the external study. In other words, the external information is given by
. Under the density ratio model (7), the population moment constraints can be expressed as
, where
and the expectation is evaluated with respect to the covariate distribution in the internal study.
To synthesize information from a mixture of the subject-level and aggregate data in the presence of population heterogeneity, we consider minimizing the penalized negative log full likelihood subject to the constraints

The third constraint follows from the density ratio model (7) such that and the last constraint is obtained by approximating
using the observed subject-level data in the presence of population heterogeneity. In this case, the Lagrange function for the penalized constrained minimization is given by
, where
, and
are Lagrange multipliers. Some calculations lead to
and
, where ξ0 and
are determined by
and\break
. Then the penalized negative constrained log full likelihood function is
, where, up to a constant,
. Hence the PEL estimator that accounts for the population heterogeneity is defined as

Write , where
and
are the corresponding estimators for
and
, respectively. Theorem 4 below shows that the proposed estimator
also enjoys the oracle property and is asymptotically more efficient than the conventional penalized MLE
defined by (5). The proof is given in the Supporting Information.
Theorem 4. Under the conditions specified in Theorem 2, as , we have (i)
with probability tending to 1 and (ii)
converges in distribution to N(0, 1) for an arbitrary
vector
satisfying
with
and
being defined in the Supporting Information.
2.4 Synthesizing Auxiliary Information with Uncertainty
We have assumed in the previous discussions that the auxiliary information comes from a large database where the sample size of the external data, denoted by m, satisfies as
. Therefore, the variability in the estimates derived from the external data is much smaller than that in the estimation of
using the subject-level data and hence can be ignored. We now consider the case where m is of the same order of n, that is, assume that
converges to a constant κ as
. In this case, incorporating uncertainty of the auxiliary information in the proposed inference procedure is desired. Denote by
the root-m consistent estimator of the population parameter
using the external data and assume that
is asymptotically normal. Along the same line as in Section 2.2, one can replace
with
in the derivation and estimate
by

The large sample properties of the proposed estimator are summarized below in Theorem 5, with the proof given in the Supporting Information.
Theorem 5. Under the conditions specified in Theorem 2, as , we have (i)
with probability tending to 1 and (ii)
converges in distribution to N(0, 1) for an arbitrary
vector
satisfying
with
and
being defined in the Supporting Information.
By Theorem 5, the proposed estimator enjoys the oracle property and is asymptotically more efficient than the penalized MLE
given in (5). The efficiency improvement decreases with κ. When
, that is, κ is a very small constant,
is approximately
and thus
enjoys a substantial efficiency gain. When
, the efficiency gain is close to 0. Moreover, we propose a modified variance estimator that properly adjusts for the uncertainty in the auxiliary information. Specifically, the covariance matrix of nonzero coefficients can be consistently estimated by

where ,
, and
are given in the Supporting Information.
3 Simulations and Data Analysis
3.1 Numerical Simulations
Simulation studies were conducted to evaluate the finite-sample performances of the proposed PEL approaches and the penalized maximum likelihood (PML) approach that does not incorporate the auxiliary information. In each simulation setting, we generated X1 from a Bernoulli distribution with and
independently from the standard normal distribution with
. The binary outcome Y was generated from the logistic regression model
, where
,
, and thus the first five predictors are considered important. We assumed
under the density ratio model (7). In each simulation, we generated 500 simulated data sets, each with a sample size of 200.
We first assumed that the homogeneity assumption holds, that is, in the density ratio model, and considered two forms of external aggregated information: (I) the conditional probabilities of
for two subgroups
and
and the means of X1 and X2 and (II) the regression coefficients in a reduced logistic model with predictors
. For the PEL method and its extension that accounts for the population heterogeneity (PEL
), the sample size of the external study was large and the uncertainty in the auxiliary information was ignored. For the external information form (I), the probabilities of
for the two subgroups are approximately 0.31 and 0.38, and the means of X1 and X2 are approximately 0.5 and 0. For the external information form (II), the coefficients in the reduced model given
are approximately ( − 0.95, 0.28, 0.19). For the approach that accounts for uncertainty in the auxiliary information (PEL*), the auxiliary information was estimated from an independent sample with sample size
.
Table 1 summarizes the variable selection results by percentages of selecting each important predictor (i.e., true positive percentages), and the average percentage of selecting unimportant predictors (i.e., average false positive percentage). As shown in Table 1, the proposed PEL approaches yield higher variable selection accuracy when compared with the PML approach that does not incorporate auxiliary information. The variable selection accuracy improvement is most clear over X1 and X2 where the auxiliary information was available, ranging from 6.6% to 12.0% in selecting X1 and from 4.0% to 8.8% in selecting X2. Table 2 summarizes the parameter estimation results. As shown in Table 2, all three proposed PEL approaches yield a substantial efficiency gain over the PML approach. The relative efficiency ranges from 1.81 to 2.29 in estimating β1 and ranges from 1.21 to 1.66 in estimating β2.
. | PML . | PEL . | PEL![]() | PEL* . |
---|---|---|---|---|
External information form (I) | ||||
X1 | 32.8 | 43.0 | 39.4 | 41.6 |
X2 | 43.2 | 49.4 | 51.4 | 47.2 |
X3 | 43.2 | 47.0 | 44.8 | 46.2 |
X4 | 76.0 | 81.4 | 77.2 | 81.6 |
X5 | 76.2 | 80.2 | 79.0 | 80.8 |
Average FP | 20.7 | 20.6 | 20.5 | 21.1 |
External information form (II) | ||||
X1 | 32.8 | 43.4 | 42.4 | 44.8 |
X2 | 43.2 | 48.4 | 52.0 | 49.0 |
X3 | 43.2 | 44.0 | 47.8 | 44.4 |
X4 | 76.0 | 81.8 | 77.4 | 82.0 |
X5 | 76.2 | 80.2 | 78.8 | 80.0 |
Average FP | 20.7 | 21.0 | 21.2 | 20.8 |
. | PML . | PEL . | PEL![]() | PEL* . |
---|---|---|---|---|
External information form (I) | ||||
X1 | 32.8 | 43.0 | 39.4 | 41.6 |
X2 | 43.2 | 49.4 | 51.4 | 47.2 |
X3 | 43.2 | 47.0 | 44.8 | 46.2 |
X4 | 76.0 | 81.4 | 77.2 | 81.6 |
X5 | 76.2 | 80.2 | 79.0 | 80.8 |
Average FP | 20.7 | 20.6 | 20.5 | 21.1 |
External information form (II) | ||||
X1 | 32.8 | 43.4 | 42.4 | 44.8 |
X2 | 43.2 | 48.4 | 52.0 | 49.0 |
X3 | 43.2 | 44.0 | 47.8 | 44.4 |
X4 | 76.0 | 81.8 | 77.4 | 82.0 |
X5 | 76.2 | 80.2 | 78.8 | 80.0 |
Average FP | 20.7 | 21.0 | 21.2 | 20.8 |
Note. PML, the adaptive lasso penalized maximum likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach accounting for population heterogeneity; PEL*, the adaptive lasso penalized empirical likelihood approach accounting for uncertainty in the auxiliary information; FP, false positive (%).
. | PML . | PEL . | PEL![]() | PEL* . |
---|---|---|---|---|
External information form (I) | ||||
X1 | 32.8 | 43.0 | 39.4 | 41.6 |
X2 | 43.2 | 49.4 | 51.4 | 47.2 |
X3 | 43.2 | 47.0 | 44.8 | 46.2 |
X4 | 76.0 | 81.4 | 77.2 | 81.6 |
X5 | 76.2 | 80.2 | 79.0 | 80.8 |
Average FP | 20.7 | 20.6 | 20.5 | 21.1 |
External information form (II) | ||||
X1 | 32.8 | 43.4 | 42.4 | 44.8 |
X2 | 43.2 | 48.4 | 52.0 | 49.0 |
X3 | 43.2 | 44.0 | 47.8 | 44.4 |
X4 | 76.0 | 81.8 | 77.4 | 82.0 |
X5 | 76.2 | 80.2 | 78.8 | 80.0 |
Average FP | 20.7 | 21.0 | 21.2 | 20.8 |
. | PML . | PEL . | PEL![]() | PEL* . |
---|---|---|---|---|
External information form (I) | ||||
X1 | 32.8 | 43.0 | 39.4 | 41.6 |
X2 | 43.2 | 49.4 | 51.4 | 47.2 |
X3 | 43.2 | 47.0 | 44.8 | 46.2 |
X4 | 76.0 | 81.4 | 77.2 | 81.6 |
X5 | 76.2 | 80.2 | 79.0 | 80.8 |
Average FP | 20.7 | 20.6 | 20.5 | 21.1 |
External information form (II) | ||||
X1 | 32.8 | 43.4 | 42.4 | 44.8 |
X2 | 43.2 | 48.4 | 52.0 | 49.0 |
X3 | 43.2 | 44.0 | 47.8 | 44.4 |
X4 | 76.0 | 81.8 | 77.4 | 82.0 |
X5 | 76.2 | 80.2 | 78.8 | 80.0 |
Average FP | 20.7 | 21.0 | 21.2 | 20.8 |
Note. PML, the adaptive lasso penalized maximum likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach accounting for population heterogeneity; PEL*, the adaptive lasso penalized empirical likelihood approach accounting for uncertainty in the auxiliary information; FP, false positive (%).
Coef . | PML . | PEL . | PEL![]() | PEL* . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Bias . | SD . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | |
External information form (I) | |||||||||||
β1 | −16 | 28 | −18 | 20 | 24 | −18 | 20 | 26 | −17 | 21 | 25 |
β2 | −8 | 18 | −7 | 16 | 7 | −7 | 16 | 9 | −7 | 16 | 10 |
β3 | −8 | 18 | −7 | 16 | 16 | −8 | 16 | 16 | −8 | 16 | 16 |
β4 | −7 | 25 | −10 | 20 | 16 | −10 | 21 | 16 | −10 | 20 | 17 |
β5 | −7 | 24 | −10 | 20 | 16 | −10 | 21 | 16 | −10 | 20 | 17 |
External information form (II) | |||||||||||
β1 | −16 | 28 | −18 | 19 | 8 | −19 | 19 | 25 | −17 | 19 | 12 |
β2 | −8 | 18 | −9 | 14 | 5 | −9 | 14 | 12 | −9 | 14 | 8 |
β3 | −8 | 18 | −9 | 15 | 16 | −8 | 16 | 16 | −10 | 15 | 16 |
β4 | −7 | 25 | −11 | 19 | 16 | −10 | 22 | 16 | −12 | 19 | 17 |
β5 | −7 | 24 | −10 | 21 | 17 | −10 | 21 | 16 | −10 | 21 | 18 |
Coef . | PML . | PEL . | PEL![]() | PEL* . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Bias . | SD . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | |
External information form (I) | |||||||||||
β1 | −16 | 28 | −18 | 20 | 24 | −18 | 20 | 26 | −17 | 21 | 25 |
β2 | −8 | 18 | −7 | 16 | 7 | −7 | 16 | 9 | −7 | 16 | 10 |
β3 | −8 | 18 | −7 | 16 | 16 | −8 | 16 | 16 | −8 | 16 | 16 |
β4 | −7 | 25 | −10 | 20 | 16 | −10 | 21 | 16 | −10 | 20 | 17 |
β5 | −7 | 24 | −10 | 20 | 16 | −10 | 21 | 16 | −10 | 20 | 17 |
External information form (II) | |||||||||||
β1 | −16 | 28 | −18 | 19 | 8 | −19 | 19 | 25 | −17 | 19 | 12 |
β2 | −8 | 18 | −9 | 14 | 5 | −9 | 14 | 12 | −9 | 14 | 8 |
β3 | −8 | 18 | −9 | 15 | 16 | −8 | 16 | 16 | −10 | 15 | 16 |
β4 | −7 | 25 | −11 | 19 | 16 | −10 | 22 | 16 | −12 | 19 | 17 |
β5 | −7 | 24 | −10 | 21 | 17 | −10 | 21 | 16 | −10 | 21 | 18 |
Note. are the regression parameters with the true values of (0.3,0.2,0.2,0.4,0.4); PML, the adaptive lasso penalized maximum likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach; PEL
, the adaptive lasso penalized empirical likelihood approach accounting for population heterogeneity; PEL*, the adaptive lasso penalized empirical likelihood approach accounting for uncertainty in the auxiliary information; Bias, SD, and SEE, empirical bias (× 100), empirical standard deviation (× 100), and empirical mean of standard error estimates (× 100).
Coef . | PML . | PEL . | PEL![]() | PEL* . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Bias . | SD . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | |
External information form (I) | |||||||||||
β1 | −16 | 28 | −18 | 20 | 24 | −18 | 20 | 26 | −17 | 21 | 25 |
β2 | −8 | 18 | −7 | 16 | 7 | −7 | 16 | 9 | −7 | 16 | 10 |
β3 | −8 | 18 | −7 | 16 | 16 | −8 | 16 | 16 | −8 | 16 | 16 |
β4 | −7 | 25 | −10 | 20 | 16 | −10 | 21 | 16 | −10 | 20 | 17 |
β5 | −7 | 24 | −10 | 20 | 16 | −10 | 21 | 16 | −10 | 20 | 17 |
External information form (II) | |||||||||||
β1 | −16 | 28 | −18 | 19 | 8 | −19 | 19 | 25 | −17 | 19 | 12 |
β2 | −8 | 18 | −9 | 14 | 5 | −9 | 14 | 12 | −9 | 14 | 8 |
β3 | −8 | 18 | −9 | 15 | 16 | −8 | 16 | 16 | −10 | 15 | 16 |
β4 | −7 | 25 | −11 | 19 | 16 | −10 | 22 | 16 | −12 | 19 | 17 |
β5 | −7 | 24 | −10 | 21 | 17 | −10 | 21 | 16 | −10 | 21 | 18 |
Coef . | PML . | PEL . | PEL![]() | PEL* . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Bias . | SD . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | |
External information form (I) | |||||||||||
β1 | −16 | 28 | −18 | 20 | 24 | −18 | 20 | 26 | −17 | 21 | 25 |
β2 | −8 | 18 | −7 | 16 | 7 | −7 | 16 | 9 | −7 | 16 | 10 |
β3 | −8 | 18 | −7 | 16 | 16 | −8 | 16 | 16 | −8 | 16 | 16 |
β4 | −7 | 25 | −10 | 20 | 16 | −10 | 21 | 16 | −10 | 20 | 17 |
β5 | −7 | 24 | −10 | 20 | 16 | −10 | 21 | 16 | −10 | 20 | 17 |
External information form (II) | |||||||||||
β1 | −16 | 28 | −18 | 19 | 8 | −19 | 19 | 25 | −17 | 19 | 12 |
β2 | −8 | 18 | −9 | 14 | 5 | −9 | 14 | 12 | −9 | 14 | 8 |
β3 | −8 | 18 | −9 | 15 | 16 | −8 | 16 | 16 | −10 | 15 | 16 |
β4 | −7 | 25 | −11 | 19 | 16 | −10 | 22 | 16 | −12 | 19 | 17 |
β5 | −7 | 24 | −10 | 21 | 17 | −10 | 21 | 16 | −10 | 21 | 18 |
Note. are the regression parameters with the true values of (0.3,0.2,0.2,0.4,0.4); PML, the adaptive lasso penalized maximum likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach; PEL
, the adaptive lasso penalized empirical likelihood approach accounting for population heterogeneity; PEL*, the adaptive lasso penalized empirical likelihood approach accounting for uncertainty in the auxiliary information; Bias, SD, and SEE, empirical bias (× 100), empirical standard deviation (× 100), and empirical mean of standard error estimates (× 100).
We then considered the case that the population heterogeneity is present. Two important predictors, , and one unimportant predictor, X20, accounted for the predictor distributional differences in the density ratio model with
. For the proposed PEL approach and its extension PEL
, the uncertainty in the auxiliary information was ignored. Under the heterogeneous population setting, for the external information form (I), the probabilities of
for the two subgroups are approximately 0.32 and 0.39, and the means of X1 and X2 are approximately 0.62 and 0.5. For the external information form (II), the coefficients in the reduced model given
are approximately ( − 0.92, 0.27, 0.18). For the proposed PEL* method, the auxiliary information was estimated from independent samples of
subjects in each simulation. Tables 3 and 4 summarize the variable selection and parameter estimation results, respectively. As expected, the proposed PEL
method performs well, maintaining variable selection accuracy and estimation efficiency gains over the PML method. The variable selection improvement ranges from 8.8% to 9% in selecting X1 and from 6.0% to 7.0% in selecting X2. The relative efficiency ranges from 2.17 to 2.48 in estimating β1 and ranges from 1.07 to 1.59 in estimating β2. Two approaches that proposed under the homogeneity assumption (PEL and PEL*), however, yield larger biases.
. | PML . | PEL . | PEL![]() | PEL* . |
---|---|---|---|---|
External information form (I) | ||||
X1 | 32.8 | 42.4 | 41.8 | 42.8 |
X2 | 43.2 | 45.2 | 50.2 | 47.8 |
X3 | 43.2 | 45.4 | 44.6 | 46.6 |
X4 | 76.0 | 79.6 | 78.2 | 81.6 |
X5 | 76.2 | 79.4 | 79.2 | 81.8 |
Average FP | 20.7 | 22.2 | 21.4 | 21.9 |
External information form (II) | ||||
X1 | 32.8 | 39.6 | 41.6 | 39.6 |
X2 | 43.2 | 46.2 | 49.2 | 46.2 |
X3 | 43.2 | 42.2 | 45.2 | 42.2 |
X4 | 76.0 | 79.6 | 77.2 | 79.4 |
X5 | 76.2 | 77.4 | 79.0 | 78.0 |
Average FP | 20.7 | 22.9 | 21.1 | 22.9 |
. | PML . | PEL . | PEL![]() | PEL* . |
---|---|---|---|---|
External information form (I) | ||||
X1 | 32.8 | 42.4 | 41.8 | 42.8 |
X2 | 43.2 | 45.2 | 50.2 | 47.8 |
X3 | 43.2 | 45.4 | 44.6 | 46.6 |
X4 | 76.0 | 79.6 | 78.2 | 81.6 |
X5 | 76.2 | 79.4 | 79.2 | 81.8 |
Average FP | 20.7 | 22.2 | 21.4 | 21.9 |
External information form (II) | ||||
X1 | 32.8 | 39.6 | 41.6 | 39.6 |
X2 | 43.2 | 46.2 | 49.2 | 46.2 |
X3 | 43.2 | 42.2 | 45.2 | 42.2 |
X4 | 76.0 | 79.6 | 77.2 | 79.4 |
X5 | 76.2 | 77.4 | 79.0 | 78.0 |
Average FP | 20.7 | 22.9 | 21.1 | 22.9 |
Note. PML, the adaptive lasso penalized maximum likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach accounting for population heterogeneity; PEL*, the adaptive lasso penalized empirical likelihood approach accounting for uncertainty in the auxiliary information; FP, false positive (%).
. | PML . | PEL . | PEL![]() | PEL* . |
---|---|---|---|---|
External information form (I) | ||||
X1 | 32.8 | 42.4 | 41.8 | 42.8 |
X2 | 43.2 | 45.2 | 50.2 | 47.8 |
X3 | 43.2 | 45.4 | 44.6 | 46.6 |
X4 | 76.0 | 79.6 | 78.2 | 81.6 |
X5 | 76.2 | 79.4 | 79.2 | 81.8 |
Average FP | 20.7 | 22.2 | 21.4 | 21.9 |
External information form (II) | ||||
X1 | 32.8 | 39.6 | 41.6 | 39.6 |
X2 | 43.2 | 46.2 | 49.2 | 46.2 |
X3 | 43.2 | 42.2 | 45.2 | 42.2 |
X4 | 76.0 | 79.6 | 77.2 | 79.4 |
X5 | 76.2 | 77.4 | 79.0 | 78.0 |
Average FP | 20.7 | 22.9 | 21.1 | 22.9 |
. | PML . | PEL . | PEL![]() | PEL* . |
---|---|---|---|---|
External information form (I) | ||||
X1 | 32.8 | 42.4 | 41.8 | 42.8 |
X2 | 43.2 | 45.2 | 50.2 | 47.8 |
X3 | 43.2 | 45.4 | 44.6 | 46.6 |
X4 | 76.0 | 79.6 | 78.2 | 81.6 |
X5 | 76.2 | 79.4 | 79.2 | 81.8 |
Average FP | 20.7 | 22.2 | 21.4 | 21.9 |
External information form (II) | ||||
X1 | 32.8 | 39.6 | 41.6 | 39.6 |
X2 | 43.2 | 46.2 | 49.2 | 46.2 |
X3 | 43.2 | 42.2 | 45.2 | 42.2 |
X4 | 76.0 | 79.6 | 77.2 | 79.4 |
X5 | 76.2 | 77.4 | 79.0 | 78.0 |
Average FP | 20.7 | 22.9 | 21.1 | 22.9 |
Note. PML, the adaptive lasso penalized maximum likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach accounting for population heterogeneity; PEL*, the adaptive lasso penalized empirical likelihood approach accounting for uncertainty in the auxiliary information; FP, false positive (%).
Coef . | PML . | PEL . | PEL![]() | PEL* . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Bias . | SD . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | |
External information form (I) | |||||||||||
β1 | −16 | 28 | −20 | 17 | 23 | −18 | 19 | 26 | −20 | 17 | 24 |
β2 | −8 | 18 | −12 | 11 | 6 | −7 | 17 | 12 | −12 | 11 | 9 |
β3 | −8 | 18 | −12 | 11 | 16 | −9 | 15 | 16 | −12 | 11 | 16 |
β4 | −7 | 25 | −21 | 14 | 16 | −11 | 21 | 16 | −21 | 13 | 17 |
β5 | −7 | 24 | −21 | 14 | 16 | −11 | 21 | 16 | −21 | 14 | 16 |
External information form (II) | |||||||||||
β1 | −16 | 28 | −21 | 16 | 7 | −19 | 18 | 26 | −20 | 17 | 10 |
β2 | −8 | 18 | −13 | 11 | 4 | −9 | 14 | 12 | −13 | 10 | 7 |
β3 | −8 | 18 | −13 | 11 | 16 | −8 | 16 | 16 | −13 | 11 | 17 |
β4 | −7 | 25 | −19 | 17 | 16 | −10 | 22 | 16 | −19 | 17 | 16 |
β5 | −7 | 24 | −18 | 18 | 16 | −11 | 21 | 16 | −18 | 18 | 17 |
Coef . | PML . | PEL . | PEL![]() | PEL* . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Bias . | SD . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | |
External information form (I) | |||||||||||
β1 | −16 | 28 | −20 | 17 | 23 | −18 | 19 | 26 | −20 | 17 | 24 |
β2 | −8 | 18 | −12 | 11 | 6 | −7 | 17 | 12 | −12 | 11 | 9 |
β3 | −8 | 18 | −12 | 11 | 16 | −9 | 15 | 16 | −12 | 11 | 16 |
β4 | −7 | 25 | −21 | 14 | 16 | −11 | 21 | 16 | −21 | 13 | 17 |
β5 | −7 | 24 | −21 | 14 | 16 | −11 | 21 | 16 | −21 | 14 | 16 |
External information form (II) | |||||||||||
β1 | −16 | 28 | −21 | 16 | 7 | −19 | 18 | 26 | −20 | 17 | 10 |
β2 | −8 | 18 | −13 | 11 | 4 | −9 | 14 | 12 | −13 | 10 | 7 |
β3 | −8 | 18 | −13 | 11 | 16 | −8 | 16 | 16 | −13 | 11 | 17 |
β4 | −7 | 25 | −19 | 17 | 16 | −10 | 22 | 16 | −19 | 17 | 16 |
β5 | −7 | 24 | −18 | 18 | 16 | −11 | 21 | 16 | −18 | 18 | 17 |
Note. are the regression parameters with the true values of (0.3,0.2,0.2,0.4,0.4); PML, the adaptive lasso penalized maximum likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach; PEL
, the adaptive lasso penalized empirical likelihood approach accounting for population heterogeneity; PEL*, the adaptive lasso penalized empirical likelihood approach accounting for uncertainty in the auxiliary information; Bias, SD, and SEE, empirical bias (× 100), empirical standard deviation (× 100), and empirical mean of standard error estimates (× 100).
Coef . | PML . | PEL . | PEL![]() | PEL* . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Bias . | SD . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | |
External information form (I) | |||||||||||
β1 | −16 | 28 | −20 | 17 | 23 | −18 | 19 | 26 | −20 | 17 | 24 |
β2 | −8 | 18 | −12 | 11 | 6 | −7 | 17 | 12 | −12 | 11 | 9 |
β3 | −8 | 18 | −12 | 11 | 16 | −9 | 15 | 16 | −12 | 11 | 16 |
β4 | −7 | 25 | −21 | 14 | 16 | −11 | 21 | 16 | −21 | 13 | 17 |
β5 | −7 | 24 | −21 | 14 | 16 | −11 | 21 | 16 | −21 | 14 | 16 |
External information form (II) | |||||||||||
β1 | −16 | 28 | −21 | 16 | 7 | −19 | 18 | 26 | −20 | 17 | 10 |
β2 | −8 | 18 | −13 | 11 | 4 | −9 | 14 | 12 | −13 | 10 | 7 |
β3 | −8 | 18 | −13 | 11 | 16 | −8 | 16 | 16 | −13 | 11 | 17 |
β4 | −7 | 25 | −19 | 17 | 16 | −10 | 22 | 16 | −19 | 17 | 16 |
β5 | −7 | 24 | −18 | 18 | 16 | −11 | 21 | 16 | −18 | 18 | 17 |
Coef . | PML . | PEL . | PEL![]() | PEL* . | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Bias . | SD . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | Bias . | SD . | SEE . | |
External information form (I) | |||||||||||
β1 | −16 | 28 | −20 | 17 | 23 | −18 | 19 | 26 | −20 | 17 | 24 |
β2 | −8 | 18 | −12 | 11 | 6 | −7 | 17 | 12 | −12 | 11 | 9 |
β3 | −8 | 18 | −12 | 11 | 16 | −9 | 15 | 16 | −12 | 11 | 16 |
β4 | −7 | 25 | −21 | 14 | 16 | −11 | 21 | 16 | −21 | 13 | 17 |
β5 | −7 | 24 | −21 | 14 | 16 | −11 | 21 | 16 | −21 | 14 | 16 |
External information form (II) | |||||||||||
β1 | −16 | 28 | −21 | 16 | 7 | −19 | 18 | 26 | −20 | 17 | 10 |
β2 | −8 | 18 | −13 | 11 | 4 | −9 | 14 | 12 | −13 | 10 | 7 |
β3 | −8 | 18 | −13 | 11 | 16 | −8 | 16 | 16 | −13 | 11 | 17 |
β4 | −7 | 25 | −19 | 17 | 16 | −10 | 22 | 16 | −19 | 17 | 16 |
β5 | −7 | 24 | −18 | 18 | 16 | −11 | 21 | 16 | −18 | 18 | 17 |
Note. are the regression parameters with the true values of (0.3,0.2,0.2,0.4,0.4); PML, the adaptive lasso penalized maximum likelihood approach; PEL, the adaptive lasso penalized empirical likelihood approach; PEL
, the adaptive lasso penalized empirical likelihood approach accounting for population heterogeneity; PEL*, the adaptive lasso penalized empirical likelihood approach accounting for uncertainty in the auxiliary information; Bias, SD, and SEE, empirical bias (× 100), empirical standard deviation (× 100), and empirical mean of standard error estimates (× 100).
3.2 Pediatric Kidney Transplantation Data Analysis
End-stage renal disease (ESRD) is the last stage of chronic kidney disease. The incidence of ESRD in children and adolescents, 0 to 21 years of age, is steadily decreasing from 17.5 per million in 2004 to 13.8 per million population in 2016. Over the past few decades, kidney transplantation (KT) has emerged as the optimal treatment of ESRD, providing a significant survival advantage over dialysis. Approximately 800 children in the United States undergo KT, representing about 5% of all kidney transplants in the nation each year. The outcome of pediatric KT has been improved significantly owing to the advances in immunosuppressants, including tacrolimus (TAC) and mycophenolate mofetil (MMF), and surgical techniques. However, rejection after transplant in children remains a serious concern, as it can lead to significant morbidity, graft loss, and death. We investigated important predictors of acute rejection in the pediatric population post wide adoption of the advanced immunosuppressants.
We analyzed data collected at transplant centers in California from July 2004 to June 2014. The binary outcome acute rejection was defined as recorded rejection episode within 6 months post transplantation. Of a total of 833 eligible pediatric kidney transplants, 13 were undetermined cases due to loss to follow-up and were ignored. Among the remaining 820 pediatric kidney transplant recipients (<18 years of age) who received TAC and MMF immunosuppression for prevention of rejection, 61 (7.4%) experienced early acute rejection. We considered 18 potential predictors, including demographic and immunological factors and key interaction terms with donor type, and applied the proposed method for variable selection. We exploited proportions of deceased donors and older donors along with the subgroup acute rejection rates by donor type reported in the entire Organ Procurement and Transplantation Network (OPTN) data. The rate of acute rejection was 9.07% among deceased donor transplants and 6.28% among living donor transplants. Also 61.16% were deceased donor transplants and 25.89% received organs from donors older than 35 years.
We applied the PEL ratio test proposed in Theorem 3 to test the homogeneity assumption. The null hypothesis was rejected at a type I error rate of 5% () and the heterogeneity between the subject-level predictor data collected in California and the entire OPTN data were suggested. We hence applied the proposed PEL
approach in Section 2.3 to account for the population heterogeneity. Four predictors (deceased donor, African American [AA] race, induction therapy, and the per center total number of pediatric transplants conducted during the study period [>100 versus ⩽100]) were included in the density ratio model (7) to account for heterogeneity in the predictor distributions. The selection was based on comparison of the reported summary statistics of the predictors in the entire OPTN data (Nehus et al., 2017) with the corresponding sample statistics in the California data (see Table S1 in the Supporting Information). For comparison, we also analyzed the data with the PML method without incorporating the auxiliary information.
Coefficient estimates derived using penalization methods change as a function of the tuning parameter in the penalty term. Figure 1 shows the coefficient estimates as functions of for the PML approach and the proposed PEL
approach. Notably the coefficient estimate curve of the donor age (≥35 years) derived using the proposed PEL
method decreases more slowly than that derived using the PML method. The coefficient estimates with the tuning parameter selected using the Bayesian information criteria (BIC)-type criteria are noted by where their respective curves cross the vertical lines in the figure. The PML method selects four predictors: delayed graft function, panel reactive antibody (
), deceased donor, and its interaction with recipient age (≥6 years). By incorporating the external aggregated information, the proposed PEL
method additionally selects donor age (≥35 years) as an important predictor. Donor age has been recognized as a prognostic factor. Starting from October 2005, the OPTN has implemented a new allocation policy, known as Share-35 (S35), which gives high priority access to pediatric recipients for kidney transplants from deceased donors ages <35 years.

The solution paths for the pediatric kidney transplant study
Note: The vertical lines indicate the tuning parameter values selected for the two approaches using the BIC-type criteria. Each curve represents the coefficient estimate trajectory for each predictor (labeled on the right) as a function of . PRA, panel reactive antibodies; induction therapy: include Interleukin-2 receptor antibody and lymphocyte depleting agents; number of pediatric transplants, per center total number of pediatric transplants performed during the study period; HLA mismatch, human leukocyte antigen mismatches.
We refitted the model with the selected five important variables only, using the proposed extended empirical likelihood (EL) approach without penalization. The results obtained using the usual maximum likelihood (ML) approach were reported for comparison. The bootstrap method was used to estimate the standard errors of the regression coefficient estimates and the corresponding confidence intervals. As shown in Table 5, the two methods yield similar coefficient estimates. More importantly, the proposed EL
method yields a substantial efficiency gain, with the relative efficiency ranging from 1.334 to 4.561. The result is well expected since we incorporated the external aggregated information. It is worthwhile to point out that the effects of the two known important predictors, donor age (≥35 years) and delayed graft function, do not reach statistical significance when applying the ML method but are found to be statistically significant when applying the proposed EL
method. Specifically, donor age (≥35 years) and delayed graft function are significantly associated with higher acute rejection rates, with odds ratios of
(95% CI, [1.226, 2.743]) and
(95% CI, [1.811, 5.490]), respectively, estimated by the EL
method.
. | ML . | EL![]() | ||||
---|---|---|---|---|---|---|
Est . | BSE . | 95% CI . | Est . | BSE . | 95% CI . | |
Deceased donor | 1.067 | 0.425 | ( 0.017, 1.645) | 1.094 | 0.199 | ( 0.738, 1.534) |
Donor age ≥35 years | 0.675 | 0.343 | (−0.207, 1.061) | 0.742 | 0.197 | ( 0.204, 1.009) |
Panel reactive antibody ![]() | 0.784 | 0.272 | ( 0.078, 1.182) | 0.810 | 0.236 | ( 0.330, 1.217) |
Delayed graft function | 1.086 | 0.576 | (−0.314, 1.887) | 1.092 | 0.272 | ( 0.594, 1.703) |
Deceased donor & recipient age ≥6 years | −0.881 | 0.319 | (−1.288, −0.055) | −0.872 | 0.230 | (−1.349, −0.416) |
. | ML . | EL![]() | ||||
---|---|---|---|---|---|---|
Est . | BSE . | 95% CI . | Est . | BSE . | 95% CI . | |
Deceased donor | 1.067 | 0.425 | ( 0.017, 1.645) | 1.094 | 0.199 | ( 0.738, 1.534) |
Donor age ≥35 years | 0.675 | 0.343 | (−0.207, 1.061) | 0.742 | 0.197 | ( 0.204, 1.009) |
Panel reactive antibody ![]() | 0.784 | 0.272 | ( 0.078, 1.182) | 0.810 | 0.236 | ( 0.330, 1.217) |
Delayed graft function | 1.086 | 0.576 | (−0.314, 1.887) | 1.092 | 0.272 | ( 0.594, 1.703) |
Deceased donor & recipient age ≥6 years | −0.881 | 0.319 | (−1.288, −0.055) | −0.872 | 0.230 | (−1.349, −0.416) |
Note. ML, the unpenalized maximum likelihood approach; EL, the unpenalized extended empirical likelihood approach with auxiliary information; Est, the estimated regression coefficient; BSE, bootstrap standard error given by the standard deviation of the 500 estimates; 95% CI, the 95% bootstrap confidence interval given by the 2.5% and 97.5% of the 500 estimates.
. | ML . | EL![]() | ||||
---|---|---|---|---|---|---|
Est . | BSE . | 95% CI . | Est . | BSE . | 95% CI . | |
Deceased donor | 1.067 | 0.425 | ( 0.017, 1.645) | 1.094 | 0.199 | ( 0.738, 1.534) |
Donor age ≥35 years | 0.675 | 0.343 | (−0.207, 1.061) | 0.742 | 0.197 | ( 0.204, 1.009) |
Panel reactive antibody ![]() | 0.784 | 0.272 | ( 0.078, 1.182) | 0.810 | 0.236 | ( 0.330, 1.217) |
Delayed graft function | 1.086 | 0.576 | (−0.314, 1.887) | 1.092 | 0.272 | ( 0.594, 1.703) |
Deceased donor & recipient age ≥6 years | −0.881 | 0.319 | (−1.288, −0.055) | −0.872 | 0.230 | (−1.349, −0.416) |
. | ML . | EL![]() | ||||
---|---|---|---|---|---|---|
Est . | BSE . | 95% CI . | Est . | BSE . | 95% CI . | |
Deceased donor | 1.067 | 0.425 | ( 0.017, 1.645) | 1.094 | 0.199 | ( 0.738, 1.534) |
Donor age ≥35 years | 0.675 | 0.343 | (−0.207, 1.061) | 0.742 | 0.197 | ( 0.204, 1.009) |
Panel reactive antibody ![]() | 0.784 | 0.272 | ( 0.078, 1.182) | 0.810 | 0.236 | ( 0.330, 1.217) |
Delayed graft function | 1.086 | 0.576 | (−0.314, 1.887) | 1.092 | 0.272 | ( 0.594, 1.703) |
Deceased donor & recipient age ≥6 years | −0.881 | 0.319 | (−1.288, −0.055) | −0.872 | 0.230 | (−1.349, −0.416) |
Note. ML, the unpenalized maximum likelihood approach; EL, the unpenalized extended empirical likelihood approach with auxiliary information; Est, the estimated regression coefficient; BSE, bootstrap standard error given by the standard deviation of the 500 estimates; 95% CI, the 95% bootstrap confidence interval given by the 2.5% and 97.5% of the 500 estimates.
4 Discussion
In this paper, we have proposed a PEL approach to improve variable selection accuracy and estimation efficiency by synthesizing information from a mixture of high-dimensional subject-level data and low-dimensional aggregate data. The proposed approach is extended to account for the population heterogeneity and uncertainty in the auxiliary information. Although we focus on the logistic regression, the proposed approaches can be easily extended to handle more general parametric models. Moreover, our work assumes that the covariates in the density ratio model are of a fixed dimension and are correctly specified. When such knowledge is not available, one can set
and impose a sparsity assumption that only a small subset of
accounts for the heterogeneity in the predictor distributions. Then we can include a proper penalty for
in the penalized log likelihood for variable selection in
. This will be investigated in our future research.
Data Availability Statement
The data that support the findings of the pediatric kidney transplant data analysis in this paper are available from the Organ Procurement and Transplantation Network (OPTN). The general public can submit a request to OPTN for the data at the following URL: https://optn.transplant.hrsa.gov/data/request-data/.
Acknowledgments
This work was supported by National Institutes of Health grant R01CA193888. This work was supported in part by Health Resources and Services Administration contract 234-2005-37011C. The authors thank Dr. Edward at the Cincinnati Children's Hospital Medical Center for providing clinical insight to the analysis of the pediatric kidney transplant study.
References
Appendix
(C1) The covariate vector is bounded with probability one. The true value
lies in a compact subset of
.
(C2) The functions and
are continuous in a neighborhood of the true value
. Moreover, the functions
,
, and
are bounded by some integrable functions in this neighborhood.
(C3) Let and
denote the smallest and largest eigenvalues of a positive definite matrix M, respectively. Assume that there exist constants
and
such that

Conditions (C1) and (C2) are commonly imposed for estimating functions under the empirical likelihood framework. Condition (C3) assumes that the Fisher information matrix under the logistic regression model is positive definite and its eigenvalues are bounded. Similar conditions have been assumed in Fan and Peng (2004) and Zou and Zhang (2009) under different model assumptions.
Supplementary data
The computation algorithm and the R code for implementing the proposed approaches in Section 2, additional numerical simulations and additional results of data analysis in Section 3, and proofs of Theorems 1-5 referenced in Section 2 are available with this paper at the Biometrics website on Wiley Online Library.