Abstract

Motivation

The flexibility of a Bayesian framework is promising for GWAS, but current approaches can benefit from more informative prior models. We introduce a novel Bayesian approach to GWAS, called Structured and Non-Local Priors (SNLPs) GWAS, that improves over existing methods in two important ways. First, we describe a model that allows for a marker’s gene-parent membership and other characteristics to influence its probability of association with an outcome. Second, we describe a non-local alternative model for differential minor allele rates at each marker, in which the null and alternative hypotheses have no common support.

Results

We employ a non-parametric model that allows for clustering of the genes in tandem with a regression model for marker-level covariates, and demonstrate how incorporating these additional characteristics can improve power. We further demonstrate that our non-local alternative model gives symmetric rates of convergence for the null and alternative hypotheses, whereas commonly used local alternative models have asymptotic rates that favor the alternative hypothesis over the null. We demonstrate the robustness and flexibility of our structured and non-local model for different data generating scenarios and signal-to-noise ratios. We apply our Bayesian GWAS method to single nucleotide polymorphisms data collected from a pool of Alzheimer’s disease and cognitively normal patients from the Alzheimer’s Database Neuroimaging Initiative.

Availability and implementation

R code to perform the SNLPs method is available at https://github.com/lockEF/BayesianScreening.

1 Introduction

Detecting associations between particularly influential genetic variants and a complex pathogenesis is of major interest within chronic disease research, and innovations in genome-wide association studies (GWAS) are desirable. A common practice in GWAS is massive univariate linear modeling, where each genetic marker (i.e. single nucleotide polymorphism, or SNP) is independently fit as a linear predictor to explain the variability in a given phenotype. Due to the large number of tests conducted in a genome-wide analysis (GWA), procedures that correct Type-I error rate cut-offs for multiple comparisons are commonly used. These strategies include controlling the family-wise error rate or false discovery rate (Benjamini and Hochberg, 1995; Holm, 1979; Hochberg and Tamhane, 1987). In GWAS, the massive number of tests requires a very stringent Type-I error rate correction. Kamboh et al. (2012) conducted a meta-analysis of GWAS on around 2.5 million SNPs with a Bonferroni correction to control family-wise error rate, and detected six SNPs associated with Alzheimer’s disease. Then, they mapped the critical values of each marker against the location of that marker on its corresponding gene (Kamboh et al., 2012). The location of the SNPs within a gene correlated with the significance level from their associated tests. This example suggests that a more informed data-dependent correction may yield increased power to detect marker-phenotype associations.

More contemporary research on multiple testing has incorporated a Bayesian perspective. Bayesian approaches to multiple testing follow two noticeable patterns: selection or shrinkage. Bayesian model selection uses posterior probabilities to select among different hypotheses given the data, and multiple testing can be treated with a hierarchical model on the prior probabilities for multiple related tests (Scott and Berger, 2010). Bayesian hierarchical modeling of effect sizes is similarly useful under multiple testing scenarios, by appropriately shrinking effects toward the null value. Bayesian hierarchical selection and shrinkage have both been considered in the GWAS context, and a key challenge is to incorporate additional characteristics for a more descriptive and powerful approach to multiple testing. These additional characteristics, such as the genomic position of a marker and its functional annotation, have been shown to influence a marker’s association with a given phenotype of interest (Kamboh et al., 2012; Lewinger et al., 2007; Zablocki et al., 2017). Yi et al. (2014) considered Bayesian model shrinking in the GWAS context and used additional marker level characteristics to influence the shrinkage of the parameters, and collected the few markers with high signal. Lewinger et al. (2007) introduced a fully Bayesian scheme for marker selection in which additional characteristics, such as SNP annotation, influence the prior probability of the null hypothesis for each marker. Alternatively, Zablocki et al. (2014) described a Bayesian framework that allows covariates to modulate a false discovery rate adjustment; Zablocki et al. (2017) followed up their work by relaxing distributional assumptions and uses B-splines for a more flexible modeling approach. Lock and Dunson (2017) described a hierarchical model in which the prior probability of association for a given marker depends on its parent gene, and the gene-level effects are modeled flexibly with a non-parametric Dirichlet Process (DP) prior. We build on this hierarchical DP structure in this work, to allow for clustering of the parent gene effect while also incorporating functional information on the markers.

For Bayesian model selection, results are often sensitive to the specification of priors under the null and alternative hypotheses, and thus these must be considered carefully. Classical frequentist hypotheses specifications often have a point null while the alternative is anything but that point null. However, in the Bayesian framework it is often the case that the null hypothesis is a point null while the alternative hypothesis is a more diffuse distribution. A local alternative prior includes the null in its support. It has been shown that the asymptotic rates of evidence in favor of either hypothesis differs exponentially under such a scenario (Johnson and Rossell, 2010; Lock and Dunson, 2015). Non-local priors (henceforth NLP) were introduced by Johnson and Rossell (2010, 2012) to resolve this issue with Bayesian alternative hypothesis specification, and they are commonly used for model selection in a regression context. The use of NLPs in GWAS for multivariate regression modeling and a continuous outcome was explored by Sanyal et al. (2019). We introduce a NLP for two-sample tests with categorical data that is appropriate for GWAS screening studies. NLPs are sensitive to user-specified hyperparameters, and in this article we explore the influence of these hyperparameters and propose a method of marginal likelihood maximization to select the hyperparameters in our context.

Our primary methodological contributions are twofold: (i) to incorporate prior knowledge on markers to estimate probabilities of phenotype associations more accurately and improve interpretation, and (ii) to introduce a NLP model for the two-sample testing context for Bayesian GWAS applications. For (i), we build on Lock and Dunson (2017) and extend the non-parametric Bayesian method for parent-gene clustering to incorporate additive effects for additional marker-level data. For (ii), we use a probit link function to model the difference in minor allele rates between two samples via a NLP. We describe extensive simulation studies to assess both contributions, demonstrating improved performance while also highlighting caveats in their implementation. Lastly, we illustrate our proposed model on the SNP level data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) and discuss the SNPs and genes that are found to be associated with Alzheimer’s disease (www.adni-info.org).

2 Model

2.1 GWA modeling framework

In this section we describe our general framework for Bayesian GWAS and discuss our contributions. There are many different forms of genetic marker data, however, we focus on the minor allele proportion of presence (either one or two minor alleles present) for given SNPs (henceforth minor allele rates). The SNPs may be grouped by the gene region that they fall within (i.e. their gene-parent). We consider the common scenario of screening for markers that are associated with the lack of or presence of a given phenotype (represented by 0 and 1, respectively). Denote Nij0 and Nij1 as the total number of control and case study participants, respectively, for genetic marker j within parent i. Denote the number of individuals with the minor allele present in each group as sij0 and sij1, which follow a binomial distribution with underlying minor allele rates pij0 and pij1, respectively, as opposed to the minor allele frequencies. That is,
sij0Bin(Nij0,pij0)andsij1Bin(Nij1,pij1),
where Bin(n,p) denotes the Binomial distribution with total count parameter n and probability of success p. We specify null and alternative hypotheses, Hij0 and HijA, that define prior models for pij0 and pij1. Then the marginal likelihoods under each hypothesis can be used to find the posterior probability of association for each marker. Let πij be the prior probability of HijA, then
P(HijA|sij0,sij1)=πij·P(sij0,sij1|HijA)πij·P(sij0,sij1|HijA)+(1πij)·P(sij0,sij1|Hij0)
(1)
where P(sij0,sij1|Hij0) and P(sij0,sij1|HijA) refer to the marginal likelihood under the null and alternative hypotheses for marker j within parent i, respectively. If there is not prior information on the SNPs then πij has been set to 0.5, reflecting that either hypothesis has 50% of being true a priori (Stephens and Balding, 2009).

In Section 2.2 we discuss implementing non-local priors for the likelihoods under Hij0 and HijA in Equation (1). In Section 2.3 we elaborate on using a structured model for πij in Equation (1) that incorporates marker-level covariates and the parent gene of each marker.

2.2 Incorporating non-local hypotheses

In our model we incorporate the non-local prior to enforce dissimilarity in a genetic marker’s minor allele rates between the patient classes, under the alternative hypothesis. Under Hij0: pij0=pij1=pijU(0,1), that is, we assume that the probability of a minor allele is the same for both groups, with a uniform prior over the unit interval. Under HijA: pij0=Φ(α0ijα1ij),pij1=Φ(α0ij+α1ij), where α0ijN(0,1) and α1ijNLP(τ,k,v) where
NLP(α1|τ,k,v)=kτv/2Γ(v/2k)[α12](v+1)/2exp[((α1)2τ)k].
(2)

Here, Φ denotes the cumulative distribution function of a standard normal distribution while N(·,·) means the Normal distribution with a mean and a variance parameter, respectively. Note that if hypothetically α1ij=0, then the alternative model is equivalent to the null. However, α1ij=0 is not within the support of the prior; this is the key characteristic of the non-local prior as it enforces pij0 and pij1 to be unequal. The specification of the hyperparameters {τ,k,v} is discussed in Section 3.2.1.

To our knowledge, there has not been an investigation of non-local priors within our context. However, non-local alternative priors generally yield exponential rates of convergence toward a true null or true alternative (Johnson and Rossell, 2010), whereas local alternative priors do not. We verify this with a simple simulation study in Section 4.3.

In addition to the non-local alternative model, we consider a straightforward local model following Stephens and Balding (2009). Under Hij0:pij0=pij1=pijU(0,1) and under HijA:pij0U(0,1) and pij1U(0,1) are treated independently. Although pij1 and pij0 are independent under the alternative, the alternative still supports the possibility that pij0=pij1, which is consistent with the null hypothesis. This is the key distinction between the local and non-local models; the former does not explicitly enforce inequality while the latter does. The marginal likelihoods under the null and local alternative have closed forms and are supplied in Section 3.1.

2.2.1 Exploring the non-local density

Here we investigate the implication of the hyperparameter settings for the non-local density on the magnitude of the difference in minor allele rates between classes. First, recall that Φ(α0α1)=p0 and Φ(α0+α1)=p1, and thus α1=12(Φ1(p1)Φ1(p0)). This provides a natural point of comparison to the local model, wherein the minor allele rates p0 and p1 follow independent U(0, 1) distributions; under these assumptions, α1=12(Φ1(p1)Φ1(p0))N(0,12). Figure 1 shows the density for α1 under different specifications of the non-local alternative prior, and for the local alternative. The value of τ largely controls the range of support for the non-local prior and the size of the ‘gap’ with negligible density near 0; the value of V controls the variability near the boundaries of this gap. As τ changes but is rather small, the density of the non-local parameter α1 also substantially changes. In comparison, the local model is rather diffuse over this support. The effect of the hyperparameters in the non-local density illustrate the sensitivity to the values of τ and V, and we caution against arbitrarily selecting their values. We devised a likelihood maximization scheme, described in Section 3.2.1, to empirically nominate a candidate set of values V, τ and k.

Fig. 1.

Non-local densities with varying parameter values V and small τ with k =1. The distribution for α1 under the local alternative hypothesis is provided for comparison

2.3 Incorporating structured dependence

Here we describe our approach to incorporate genomic structure and prior information to infer πij in Equation (1). Our approach generalizes to any constructed likelihood per marker under each hypothesis, P(sij0,sij1|Hij0) and P(sij0,sij1|HijA), such as those in Section 2.2. Let X=[Xβ,Xθ] be a design matrix of additional prior data for each marker. In our context, Xβ includes categorical indicators giving the functional classification for each SNP, and Xθ is a gene-parent affiliation matrix (i.e. an ANOVA design matrix of gene affiliation for each SNP) and Θ=[β,θ]. We use a probit link to model the marker prior probabilities as a function of covariates: πij=Φ(xijTΘ), where Φ is the standard normal CDF. We place a N(0, 1) prior on the coefficients β. For the gene-level effects θi, we use a flexible Dirichlet process prior with concentration parameter α and a N(0, 1) base distribution. We denote this distribution using the stick-breaking construction: ω|αGEM(α),ϕki.i.dN(0,1), with Si|ωMult(ω) and θi|Si,{ϕk}F(ϕki), where Mult(·) denotes the multinomial distribution, and GEM is the distribution for the stick-breaking construction of a Dirichlet process with concentration parameter α (Pitman et al., 2002). To flexibly infer the concentration of the gene effects we use a Γ(a,b) prior on α, where Γ(α,β) is the gamma distribution with shape and scale parameters α and β. We set a=b=1.1 for these hyperparameters. Lastly, we truncate the number of stick-breaks for the DP model at 20 (Sethuraman, 1994).

We point out certain aspects of our model that build off existing work on covariate modulated prior probabilities for GWAS. Without gene-level effects (θ=0) our model is analogous to that in Lewinger et al. (2007), who used a logit rather than a probit link for the prior probabilities πij. We used a probit link primarily to facilitate efficient posterior computation (see Section 3.2). Moreover, without additional marker level covariates (β=0), our model is analogous to that in Lock and Dunson (2017), who used a Dirichlet process prior with U(0, 1) base distribution to model gene-level posterior probabilities. As U(0, 1) is equivalent to a N(0, 1) distribution with a probit link, we have directly extended the gene-level model of Lock and Dunson (2017) to allow for additional marker covariates.

3 Estimation

3.1 Marginal likelihood estimation

Under the null hypothesis for the non-local and local models of Section 2.2 the marginal likelihood is
P(sij0,sij1|Hij0)=β(1+sij1+sij0,1+sma,ij1+sma,ij0)(Nij0sij0)(Nij1sij1),
where β(·,·) denotes the Beta coefficient, and the subscript ma denotes major allele count: sma,ij0=Nij0sij0 and sma,ij1=Nij1sij1. The marginal likelihood under the alternative hypothesis for the local model is also calculated directly:
P(sij0,sij1|HijA)=β(1+sij1,1+sma,ij1)β(1+sij0,1+sma,ij0)(Nij0sij0)(Nij1sij1).
Under the non-local alternative the marginal likelihood is
P(sij0,sij1|HijA)=Bin(sij0;Nij0,pij0)·Bin(sij1;Nij1,pij1)
·N(α0ij;0,1)·NLP(α1ij;τ,v,k)α0ijα1ij,
where pij0=Φ(α0ijα1ij) and pij1=Φ(α0ij+α1ij). We compute this intractable integral via a two-step Monte Carlo approach. First, we generate T values for each of α0 and α1 from their prior; that is, we generate α0 from a N(0, 1) distribution, and α1 from the specified NLP via a Metropolis-Hastings approach. For each simulated pair (α0(t),α1(t)) we compute p0(t)=Φ(α0(t)α1(t)) and pij1(t)=Φ(α0(t)+α1(t)), and approximate the marginal likelihood empirically:
P(sij0,sij1|HijA)1Tt=1TBin(sij0;Nij0,p0(t))·Bin(sij1;Nij1,p1(t)).

3.2 Posterior estimation

We compute the full posterior under the structured prior using a Gibbs sampling approach, with a latent variable augmentation for the probit link as described in Albert and Chib (1993). That is, we introduce Zij as latent variables that depend on the model indicators Yij: ZijN(xijTΘ,1) with xijTΘ=xijTβ+θi, where Yij = 1 if Zij > 0 and Yij = 0 otherwise. To sample from the DP model on the gene effects we use its stick-breaking construction (Sethuraman, 1994), and sample from the full conditionals as described in Dunson (2010). The full Gibbs sampling scheme proceeds as follows:

  • (1) Designate Alternative Markers:

The prior probability of association for marker j within parent i is
πij=Φ(xijTΘ)andΦis the Normal CDF.
Then the posterior probability of association is
P(HijA|sij0,sij1)=πijP(sij0,sij1|HijA)πijP(sij0,sij1|HijA)+(1πij)P(sij0,sij1|Hij0)

and YijBern(P(HijA|sij0,sij1)).

  • (2) Update DP parameters forθ:

(A) First we allocate each gene-parent effect θi to one of the DP atoms h, denoted by Si=h, from a closed form multinomial conditional posterior with probabilities:
Pr(Si=h|)={VhΠl<h(1Vl)}jniN(Zijx(β)ijTβ;θh,1)r=1H{VrΠs<r(1Vs)}jniN(Zijx(β)ijTβ;θr,1),
where h=1,,H, ni are the number of markers assigned to gene i, θh is the effect for atom h and H is the number of stick-breaks.
(B) We update the stick-breaking weights from the conjugate posterior distribution:
(Vh|)indBeta(1+i=1GI(Si=h),α+i=1GI(Si>h)),
where I(A) is the indicator function (if A is true then I(A) = 1, and I(A) = 0 otherwise), G is the number of genes and VH = 1.
(C) Update the DP atoms (random intercepts for each cluster) by sampling from their Normal posteriors
(θh|)indN(1+i:Si=hj=1niZijx(β)ijTβ1+i:Si=hni,11+i:Si=hni).

(D) Reassign θi=θSi.

  • (3) Update DP concentrationα via a Metropolis-Hastings step with a Γ(1.1,1.1) prior.

  • (4) Update additional regression parameters β via
    (β|)N(β^,(Iβ+XβXβ)1)

where β^ is the conditional posterior mean
β^=(Iβ+XβXβ)1Xβ(ZXθθ).
  • (5) Update Latent Variable Z by a truncated normal distribution depending on Yij.
    ZijN+(xijTΘ,1)ifYij=1;ZijN(xijTΘ,1)ifYij=0,
    where N+(·) and N(·) refer to truncated from the left and right at 0, respectively.

We take the mean over Gibbs samples to obtain point estimates for model parameters and to obtain the posterior probability of association for each marker.

3.2.1 Log-marginal likelihood maximization for non-local parameter determination

We select the non-local alternative hyperparameters for given data through a computationally efficient marginal likelihood empirical maximization technique. Given a dataset, we compute the marginal likelihoods under the null and alternative hypotheses for each marker, as explained in Section 3.1. Then we designate for marker ij, I(HijA)=1 if P(sij0,sij1|HijA)>P(sij0,sij1|Hij0), and I(HijA)=0 otherwise. The likelihood for all markers is computed as
P(sij0,sij1)=P(sij0,sij1|HijA)I(HijA)+P(sij0,sij1|Hij0)(1I(HijA))
(3)

We then take the average of log-likelihood over all markers, given the NLP parameters. We evaluate this expression over a grid of V and τ values while holding k =1, and the pair of NLP parameters yielding the highest average of the log-likelihood is the chosen set of parameters for the subsequent full analysis that incorporates the hierarchical estimation of the prior probability of association.

4 Simulation studies

4.1 Structured prior: gene-parent effects only

Here we describe a simulation study in which the prior probability that a marker is associated with a given phenotype depends on its gene-parent. We consider several scenarios, with varying levels of sparsity in the number of associated markers and the gene structure. We design our simulation studies to explore the performance of our proposed method with respect to these different levels of association. Namely, we consider four cases where the prior probabilities are drawn from (i) an all null distribution, (ii) a bimodal distribution in which 80% of genes have no association and the markers in the remaining 20% are all associated, (iii) a Beta(0.2,1) distribution for parent prior probabilities and (iv) a uniform distribution for the parent prior probabilities. For each scenario, we simulate data according to the local model in Section 2.2, with 100 case and control samples each (N0=N1=100), 500 parent genes, and the number of markers for each gene uniformly generated between 2 and 30.

Under each scenario, we compare three structured prior models: (i) our proposed hierarchical DP model, (ii) a separate model where parent level effects are modeled separately with independent N(0, 1) distributions (i.e. the parent probabilities are independent U(0, 1) after probit transformation) and (3) a joint model where all markers share the same prior probability, irrespective of their parent gene, with a U(0, 1) prior on this probability.

Table 1 gives the mean absolute error (MAE) for the inferred gene-level probabilities and the marker-level posterior probabilities for each scenario. Our hierarchical DP model outperforms the separate and joint approaches for the Beta(1,0.2) and Bimodal scenarios. The Uniform generative model matches the separate inference model, however, the results for the DP model are comparable to the separate model under this scenario and both substantially outperform the joint model. Similarly, the joint model is appropriate for the All null case because all parent genes have the same signal, yet the DP model is again comparable while substantially outperforming the separate model. Together, these scenarios reflect the advantages of implementing a flexible prior that accommodates different distributions of parent-gene effects, such as the DP. Moreover, the difference in the marker MAEs illustrate how a well-specified prior for the parent-gene effects can lead to more accurate posterior probabilities for the markers.

Table 1.

Mean absolute error (MAE) in gene parent probabilities, and marker posterior probabilities, for four different simulation scenarios and three models for the marker prior probabilities

All nullBimodalBeta(0.2, 1)Uniform
Parent MAE
DP0.00020.00110.07050.1196
Separate0.10850.12200.11010.1176
Joint0.00030.31500.18700.2587
Marker MAE
DP0.00080.01930.05310.1531
Separate0.02860.04870.06970.1531
Joint0.00040.11510.07620.2052
All nullBimodalBeta(0.2, 1)Uniform
Parent MAE
DP0.00020.00110.07050.1196
Separate0.10850.12200.11010.1176
Joint0.00030.31500.18700.2587
Marker MAE
DP0.00080.01930.05310.1531
Separate0.02860.04870.06970.1531
Joint0.00040.11510.07620.2052
Table 1.

Mean absolute error (MAE) in gene parent probabilities, and marker posterior probabilities, for four different simulation scenarios and three models for the marker prior probabilities

All nullBimodalBeta(0.2, 1)Uniform
Parent MAE
DP0.00020.00110.07050.1196
Separate0.10850.12200.11010.1176
Joint0.00030.31500.18700.2587
Marker MAE
DP0.00080.01930.05310.1531
Separate0.02860.04870.06970.1531
Joint0.00040.11510.07620.2052
All nullBimodalBeta(0.2, 1)Uniform
Parent MAE
DP0.00020.00110.07050.1196
Separate0.10850.12200.11010.1176
Joint0.00030.31500.18700.2587
Marker MAE
DP0.00080.01930.05310.1531
Separate0.02860.04870.06970.1531
Joint0.00040.11510.07620.2052

4.2 Structured prior: gene-parent and covariate effects

We evaluate the addition of covariates in the hierarchical setting by simulating true marker associations that are influenced by the assigned gene-parents as well as six additional covariates (six categorical variables that reflect SNP annotation). We simulated data as in the Uniform scenario of Section 4.1, but with Γ(2,1) effects for the six additional marker-level variables. We perform 20 replications of the simulation, and we compare our model with covariates to the same model with covariate effects set to 0. In the cases considered above, we estimate Bayesian credible interval coverage of the parent and covariate effects (when applicable) and mean absolute error (MAE) in marker posterior probability for comparing models under similar data settings. All simulations are done with 2000 collected samples with 1000 burn-in for convergence to the posterior distributions.

When gene effects are drawn from a standard normal distribution, the addition of marker-level covariates lowered MAE in marker posterior probability by about 0.03 (Table 2). We highlight that under similar circumstances, for a model that is misspecified when covariates truly contributed to posterior probabilities of marker association, the MAE of that model was around 0.1719. Coverage for gene-parent effects were less than the nominal under either setting. This likely reflects the discrepancy of the generative model (with independent and distinct gene effects) and our DP model (which assumes some clustering of the genes). However, as shown in Section 4.1, the accuracy of the results under the DP model is comparable to those under the generative model. We did not explore the estimation accuracy of our model for continuous marker-level covariates.

Table 2.

Adding covariates with parent hierarchical structure: the model with local priors

Number of markersNumber of genes
5001000
Parent with 6 categorical effects
150.1363 ± 0.00430.1389 ± 0.0030
300.1348 ± 0.00330.1316 ± 0.0042
500.1259 ± 0.00270.1281 ± 0.0038
Parent effects only
150.1642 ± 0.00120.1625 ± 0.0006
300.1551 ± 0.00090.1552 ± 0.0006
500.1518 ± 0.00100.1512 ± 0.0004
Number of markersNumber of genes
5001000
Parent with 6 categorical effects
150.1363 ± 0.00430.1389 ± 0.0030
300.1348 ± 0.00330.1316 ± 0.0042
500.1259 ± 0.00270.1281 ± 0.0038
Parent effects only
150.1642 ± 0.00120.1625 ± 0.0006
300.1551 ± 0.00090.1552 ± 0.0006
500.1518 ± 0.00100.1512 ± 0.0004

Note: The true parent effects are drawn from a N(0, 1). True categorical effects are drawn from a Γ(2,1) distribution. Each model is applied to simulated data with matching data generation specifications (with and without 6 categorical effects, respectively). These measures are mean absolute errors over 20 simulations ± their standard errors.

Table 2.

Adding covariates with parent hierarchical structure: the model with local priors

Number of markersNumber of genes
5001000
Parent with 6 categorical effects
150.1363 ± 0.00430.1389 ± 0.0030
300.1348 ± 0.00330.1316 ± 0.0042
500.1259 ± 0.00270.1281 ± 0.0038
Parent effects only
150.1642 ± 0.00120.1625 ± 0.0006
300.1551 ± 0.00090.1552 ± 0.0006
500.1518 ± 0.00100.1512 ± 0.0004
Number of markersNumber of genes
5001000
Parent with 6 categorical effects
150.1363 ± 0.00430.1389 ± 0.0030
300.1348 ± 0.00330.1316 ± 0.0042
500.1259 ± 0.00270.1281 ± 0.0038
Parent effects only
150.1642 ± 0.00120.1625 ± 0.0006
300.1551 ± 0.00090.1552 ± 0.0006
500.1518 ± 0.00100.1512 ± 0.0004

Note: The true parent effects are drawn from a N(0, 1). True categorical effects are drawn from a Γ(2,1) distribution. Each model is applied to simulated data with matching data generation specifications (with and without 6 categorical effects, respectively). These measures are mean absolute errors over 20 simulations ± their standard errors.

4.3 Non-local alternative: asymptotic rates

We generate p0 and p1 under either the null model or the non-local alternative model, and generate data s0 and s1 for a specified sample size N0=N1=N. We calculate marginal likelihoods under each hypothesis via importance sampling (see Section 3.1). Figure 2 shows the log Bayes factors for increasing sample size N, demonstrative of exponential convergence under both the null and the alternative.

Fig. 2.

Non-local prior and asymptotic rates of convergence: Data were generated with non-local parameters τ=2,V=4 and k=1.

4.4 Non-local alternative: sensitivity and comparison

To evaluate the non-local alternative model, we simulate data as in the Uniform scenario of Section 4.1, but using the non-local alternative of Section 2.2 to generate minor allele counts. As potential hyperparameters for the non-local model, we consider all possible combinations of the values V:{2.0,2.50,3.0,3.50,4.0} and τ:{0.10,0.45,0.80,1.15,1.50}, with k =1, For estimation we also consider the same parameter combinations for the non-local model, as well as the local alternative model. We collect the 80% and 90% credible coverage and the mean absolute error in marker posterior probability of association for twenty simulations of each scenario.

Figure 3 displays the marker MAE for each generative scenario and for each estimation model. The results show that the non-local alternative generally performs substantially better when the true generative model is non-local, even if the hyperparameters are misspecified. For τ=0.10 and 0.45, τ reflects possible effect sizes in associated probabilities, before transforming to the probit scale. Most often, for data generated with τ=0.10, the model with the specified τ^=0.10 outperforms models that use other values for τ^ with respect to mean absolute error. This pattern is also exhibited with τ=0.45 for the data and the specified model parameter. However, our model exhibits a flip in performance once the true τ is large enough, when the true τ changes from 0.10 to 0.45. For the case that the data were truly generated from an NLP distribution with τ larger than 0.45, the local model performs consistently worse than the non-local model. This reflects that larger τ values imply easier detection of associated markers by the non-local prior, as V and τ can be viewed as a joint effect size modulator for the difference between pij0 and pij1 (See Equation 2). As V grows and τ shrinks, we approach a scenario where there is little difference between pij0 and pij1, thus approaching the behavior of the model specified under the local prior. With smaller values of V, the model not considering non-local priors improved slightly in terms of the three aforementioned statistics for true τ set at its smallest value.

Fig. 3.

Non-local model implemented on non-local data: Mean Absolute Error (MAE) is shown for each pair of the parameter settings that generated the data and the parameter specifications for the model, V = {2.0,2.5,3.0,3.5,4.0} and τ = {0.10,0.45,0.80,1.15,1.50}. The ° refers to the results from the Local Model for comparison

We also consider the robustness of the non-local alternative prior when data are generated under the local model. The resulting MAE for marker posterior probabilities under different specifications of the non-local prior are shown in Table 3. The non-local prior gave comparable performance to the (correctly specified) local prior, under certain values of its hyperparameters, demonstrating the robustness of the non-local model even when the generative distribution under the alternative does not have a non-local density.

Table 3.

Non-local prior implemented on local generated data: MAE for marker posterior probabilities are shown for non-local prior specifications V and τ

τ / V2.02.53.03.54.0
0.100.15080.15650.16360.17170.1809
0.450.15820.15480.15200.14990.1481
0.800.17510.17100.16690.16390.1607
1.150.18870.18400.17990.17620.1728
1.500.20020.19430.19080.18710.1831
τ / V2.02.53.03.54.0
0.100.15080.15650.16360.17170.1809
0.450.15820.15480.15200.14990.1481
0.800.17510.17100.16690.16390.1607
1.150.18870.18400.17990.17620.1728
1.500.20020.19430.19080.18710.1831

Note: The local model had a MAE of 0.1551.

Table 3.

Non-local prior implemented on local generated data: MAE for marker posterior probabilities are shown for non-local prior specifications V and τ

τ / V2.02.53.03.54.0
0.100.15080.15650.16360.17170.1809
0.450.15820.15480.15200.14990.1481
0.800.17510.17100.16690.16390.1607
1.150.18870.18400.17990.17620.1728
1.500.20020.19430.19080.18710.1831
τ / V2.02.53.03.54.0
0.100.15080.15650.16360.17170.1809
0.450.15820.15480.15200.14990.1481
0.800.17510.17100.16690.16390.1607
1.150.18870.18400.17990.17620.1728
1.500.20020.19430.19080.18710.1831

Note: The local model had a MAE of 0.1551.

4.5 Non-local alternative: hyperparameter selection

We evaluate our marginal likelihood maximization technique to determine the true set of hyperparameters for the NLP. First, we generate data with half of the markers from the null and half of the markers from a non-local alternative, with 15 000 total markers, N0=100 controls and N1=100 cases. After generative data, we empirically estimate the NLP parameters as described in Section 3.2.1. We ran 30 simulations for each combination of: V: 2.0, 2.50, 3.0, 3.50, 4.0; τ: 0.10, 0.45, 0.80, 1.15, 1.50; and k =1, for data generation and modeling.

The proposed log-likelihood maximization method detected the appropriate set of parameters in most of the cases over the thirty simulations (see Table 4). τ is accurately estimated in most of the simulations; smaller values of τ are estimated to be the true value more often than for larger values of τ. This reflects that the degree to which the case and control proportions of minor allele presence differ can be quite small and be estimated quite accurately for this method. In general, the estimated values for V^ were less accurate, but still tended to be correct in a majority of simulations.

Table 4.

Non-local prior parameter selection results

V / τ0.100.450.801.151.50
(a)
2.090.00%93.33%76.67%60.00%76.67%
2.533.33%83.33%86.67%76.67%56.67%
3.010.00%80.00%80.00%76.67%80.00%
3.56.66%80.00%86.67%70.00%80.00%
4.00.00%70.00%83.33%80.00%83.33%
(b)
τ /  τ^0.100.450.801.151.50
0.101473000
0.450145500
0.8002134140
1.1500212325
1.5000022128
(c)
V /  V^2.02.53.03.54.0
2.012421203
2.534921121
3.030398105
3.50839976
4.001124295
V / τ0.100.450.801.151.50
(a)
2.090.00%93.33%76.67%60.00%76.67%
2.533.33%83.33%86.67%76.67%56.67%
3.010.00%80.00%80.00%76.67%80.00%
3.56.66%80.00%86.67%70.00%80.00%
4.00.00%70.00%83.33%80.00%83.33%
(b)
τ /  τ^0.100.450.801.151.50
0.101473000
0.450145500
0.8002134140
1.1500212325
1.5000022128
(c)
V /  V^2.02.53.03.54.0
2.012421203
2.534921121
3.030398105
3.50839976
4.001124295

Note: (a) Percentages for the log-likelihood maximization technique correctly specifying the non-local parameter values V and τ for the data generation out of 30 simulations for each scenario. (b) Total counts specifying τ correctly and incorrectly; rows and columns correspond to true and chosen τ, respectively. (c) Total counts specifying V correctly and incorrectly; rows and columns correspond to true and chosen V, respectively.

Table 4.

Non-local prior parameter selection results

V / τ0.100.450.801.151.50
(a)
2.090.00%93.33%76.67%60.00%76.67%
2.533.33%83.33%86.67%76.67%56.67%
3.010.00%80.00%80.00%76.67%80.00%
3.56.66%80.00%86.67%70.00%80.00%
4.00.00%70.00%83.33%80.00%83.33%
(b)
τ /  τ^0.100.450.801.151.50
0.101473000
0.450145500
0.8002134140
1.1500212325
1.5000022128
(c)
V /  V^2.02.53.03.54.0
2.012421203
2.534921121
3.030398105
3.50839976
4.001124295
V / τ0.100.450.801.151.50
(a)
2.090.00%93.33%76.67%60.00%76.67%
2.533.33%83.33%86.67%76.67%56.67%
3.010.00%80.00%80.00%76.67%80.00%
3.56.66%80.00%86.67%70.00%80.00%
4.00.00%70.00%83.33%80.00%83.33%
(b)
τ /  τ^0.100.450.801.151.50
0.101473000
0.450145500
0.8002134140
1.1500212325
1.5000022128
(c)
V /  V^2.02.53.03.54.0
2.012421203
2.534921121
3.030398105
3.50839976
4.001124295

Note: (a) Percentages for the log-likelihood maximization technique correctly specifying the non-local parameter values V and τ for the data generation out of 30 simulations for each scenario. (b) Total counts specifying τ correctly and incorrectly; rows and columns correspond to true and chosen τ, respectively. (c) Total counts specifying V correctly and incorrectly; rows and columns correspond to true and chosen V, respectively.

5 Application to ADNI data

The Alzheimer’s Disease Neuroimaging Initiative is a multi-center effort in identifying genomic, metabolomic, and neurological markers that associate with Alzheimer’s disease progression. We obtained our samples from the ADNI1 period of this initiative. These samples were genotyped using the Illumina Human610-Quad BeadChip and GenomeStudio v2009.1 was used to process the intensity data (www.adni-info.org). We collected 584 994 SNPs without functional annotation from 179 patients with Alzheimer’s disease (AD) and 214 with cognitively normal status (CN). Then we used the BioMart package in R version 3.4.1, to gather marker consequences and gene assignments for each SNP (R Core Team, 2017). We omitted SNPs that had duplicate entries to rid of the ambiguity in gene and marker consequence assignments. The counts of SNP consequence categories are: 77 splice cites, 497 5′UTR, 4861 3′UTR, 3324 non-synonymous, 1716 synonymous and 267 550 introns. We omitted intergenic SNPs from this analysis. There are 18 957 unique genes for 278 025 SNPs, and we apply our model onto this data. For up-to-date information, please go to adni.loni.usc.edu.

We applied our proposed hierarchical gene-parent and marker-level covariate adjusted model with non-local priors to the 278 025 SNPs from ADNI to identify markers that have high posterior probability of association with AD. From the SNP annotations collected from BioMart, we created an ANOVA-like binary matrix for SNP annotation (i.e. synonymous, intronic, etc.). For our analysis, we designated intronic SNPs as the reference group. We applied our maximum log-likelihood strategy to determine the optimal set of non-local prior parameters to model the ADNI data. We evaluated the mean log-likelihoods of the data over a large grid of values for τ and V, where τ:0.001,0.01,0.05,0.075,0.1,0.2,0.45,0.8,1.15,1.5 and V:2,3,4,5,6,7,8,9,10. Following this procedure, we analyzed the ADNI data with our hierarchical model that employs the previously determined NLP parameters in the alternative marginal likelihood estimation. 20 000 samples of each of the parameters and the posterior probability of association for markers were collected after a burn-in of 10 000 samples.

The maximum log-likelihood strategy for choosing the optimal set of non-local parameters yielded V^=8 and τ^=0.075, with a mean log-likelihood of −8.23; in comparison, the local model resulted in a mean log-likelihood of −8.35. Under our chosen non-local specification, the top SNP was rs2075650 with a posterior probability of association of 88.14% [see Table 5a for all detected SNPs with a PPA of at least 10%]. This SNP is located on TOMM40, near APOE4; this is a gene repeatedly observed to be significantly associated with Alzheimer’s disease in GWA studies. In comparison, not one SNP is identified when using Fisher’s exact test with the various multiple testing corrections. We only report rs2075650 as a significant marker with a modest expected false discovery rate, and a Holm adjusted P-value cut-off smaller than 10%. The flexible prior on the marker-gene membership yielded uniform atom-effect convergence to one or two values around −3.3 over the collected samples. This reflects that the probability of a random SNP within a given gene being associated with Alzheimer’s disease is very small. SNP annotation effects are summarized in Table 5b. Within Table 5a we include the difference in proportions of minor allele presence between the case and controls calculated from the raw data. These values may provide guidance on how to interpret one SNP’s relationship with Alzheimer’s disease. For instance, rs2075650 has the highest probability of association and has a positive risk difference, meaning there were more cases annotated with minor allele presence, 1 or 2 minor alleles, as compared to the controls. One may infer that this SNP has a noxious role within the pathogenesis of AD.

Table 5.

(a) Detected SNPs associated with Alzheimer’s disease with posterior probability of association (PPA) of 10% or higher; gene ID, SNP placement on gene, posterior probability of association (PPA), and difference in proportions of minor allele presence between cases and controls (risk difference). (b) Probabilities of associating with Alzheimer’s disease for each SNP annotation

SNP rsIDGeneConsequencePPARisk difference
(a)
rs2075650TOMM40Intronic0.88140.2550
rs906283PIEZO2Intronic0.2612−0.1994
rs1461707LINC00907Intronic0.1856−0.2079
rs11072463PMLIntronic0.1499−0.2028
rs7191801HS3ST4Intronic0.1365−0.2022
rs2883782MYO3BIntronic0.13450.1734
rs1981542RP11-556I14.2Intronic0.1201−0.1925
rs708036FARS2Intronic0.11370.1916
rs1155490PREX2Intronic0.1074−0.1697
(b)
Percentile2.5%50.0%Posterior mean97.5%
3′UTR0.00044%0.00794%0.01093%0.04124%
5′UTR0.00001%0.01048%0.04901%0.36228%
Non-synonymous0.00095%0.03277%0.05830%0.26069%
Splice-site0.00002%0.02068%0.20888%1.65663%
Synonymous0.00087%0.03988%0.09586%0.48173%
SNP rsIDGeneConsequencePPARisk difference
(a)
rs2075650TOMM40Intronic0.88140.2550
rs906283PIEZO2Intronic0.2612−0.1994
rs1461707LINC00907Intronic0.1856−0.2079
rs11072463PMLIntronic0.1499−0.2028
rs7191801HS3ST4Intronic0.1365−0.2022
rs2883782MYO3BIntronic0.13450.1734
rs1981542RP11-556I14.2Intronic0.1201−0.1925
rs708036FARS2Intronic0.11370.1916
rs1155490PREX2Intronic0.1074−0.1697
(b)
Percentile2.5%50.0%Posterior mean97.5%
3′UTR0.00044%0.00794%0.01093%0.04124%
5′UTR0.00001%0.01048%0.04901%0.36228%
Non-synonymous0.00095%0.03277%0.05830%0.26069%
Splice-site0.00002%0.02068%0.20888%1.65663%
Synonymous0.00087%0.03988%0.09586%0.48173%

Each measure is a cumulative normal measure of the linear expression Φ(Xβ+θ¯(m)) where θ¯(m) is the average gene effect for MCMC sample m

Table 5.

(a) Detected SNPs associated with Alzheimer’s disease with posterior probability of association (PPA) of 10% or higher; gene ID, SNP placement on gene, posterior probability of association (PPA), and difference in proportions of minor allele presence between cases and controls (risk difference). (b) Probabilities of associating with Alzheimer’s disease for each SNP annotation

SNP rsIDGeneConsequencePPARisk difference
(a)
rs2075650TOMM40Intronic0.88140.2550
rs906283PIEZO2Intronic0.2612−0.1994
rs1461707LINC00907Intronic0.1856−0.2079
rs11072463PMLIntronic0.1499−0.2028
rs7191801HS3ST4Intronic0.1365−0.2022
rs2883782MYO3BIntronic0.13450.1734
rs1981542RP11-556I14.2Intronic0.1201−0.1925
rs708036FARS2Intronic0.11370.1916
rs1155490PREX2Intronic0.1074−0.1697
(b)
Percentile2.5%50.0%Posterior mean97.5%
3′UTR0.00044%0.00794%0.01093%0.04124%
5′UTR0.00001%0.01048%0.04901%0.36228%
Non-synonymous0.00095%0.03277%0.05830%0.26069%
Splice-site0.00002%0.02068%0.20888%1.65663%
Synonymous0.00087%0.03988%0.09586%0.48173%
SNP rsIDGeneConsequencePPARisk difference
(a)
rs2075650TOMM40Intronic0.88140.2550
rs906283PIEZO2Intronic0.2612−0.1994
rs1461707LINC00907Intronic0.1856−0.2079
rs11072463PMLIntronic0.1499−0.2028
rs7191801HS3ST4Intronic0.1365−0.2022
rs2883782MYO3BIntronic0.13450.1734
rs1981542RP11-556I14.2Intronic0.1201−0.1925
rs708036FARS2Intronic0.11370.1916
rs1155490PREX2Intronic0.1074−0.1697
(b)
Percentile2.5%50.0%Posterior mean97.5%
3′UTR0.00044%0.00794%0.01093%0.04124%
5′UTR0.00001%0.01048%0.04901%0.36228%
Non-synonymous0.00095%0.03277%0.05830%0.26069%
Splice-site0.00002%0.02068%0.20888%1.65663%
Synonymous0.00087%0.03988%0.09586%0.48173%

Each measure is a cumulative normal measure of the linear expression Φ(Xβ+θ¯(m)) where θ¯(m) is the average gene effect for MCMC sample m

6 Discussion

Current research has also aligned with our findings for Alzheimer’s disease. rs2075650 has been repeatedly noted as a target SNP to be in association with Alzheimer’s disease with other SNPs along its gene TOMM40. TOMM40 has been detected to associate with regions of white matter strength that are observed to integrate memory with the rest of the brain (Lyall et al., 2014; Kamboh et al., 2012). Although, we did not find other variants along the neighborhood of TOMM40 and APOE, our model complemented more recent literature on AD pathogenesis. For instance, we identified rs1461707 where there is an apparent shortage in descriptive research of this variant that resides on a long intergenic non-protein coding RNA between PIK3C3 and RIT2. Contemporary work suggests that differential expression of regulatory long non-coding RNA (lncRNA) fill important roles in the neurogenesis of late onset Alzheimer’s disease; equally as important, lncRNA have been shown in animal models to regulate expressions of genes associated with synaptic plasticity (Deng et al., 2017; Idda et al., 2018; Pereira Fernandes et al., 2018). rs906283 is located along the PIEZ02; depletion in expression of this gene has been shown to convert activated to non-responding subsets of dorsal-root ganglia sensory neurons for adult mice, hinting towards a disruptive relationship with somatosensory-cellular processes (Coste et al., 2010). One may hypothesize that low expression of this gene may be associated with the lack in balance or lack in pain perception that dementia patients experience, and visual diseases (Vounou et al., 2012). Finally, PML, the gene that contains rs11072463 has also been acknowledged from another ADNI GWAS, and noted in other studies to be crucial in central nervous integrity and activity (Moon et al., 2015). Altogether, these results indicate possible merit in incorporating multiple sub-traits of the phenotype of interest within GWA studies.

Supplementing models that evaluate genetic variants’ relationships with a phenotype with the additional information on those genetic variants have been previously shown to boost detection power. Additionally, due to the independence between the two components of our model, the prior probability modeling can be easily utilized for other likelihood contexts. We observed in our application that markers were assigned multiple gene-parents or consequences and this issue can be obviated with the integration of the marker location along the gene. Lewinger et al. (2007) discussed including a separate modeling step to account for linkage disequilibrium by borrowing information between neighboring SNPs.

We increased efficiency in estimating probabilities of marker-phenotype associations with the employment of non-local priors. Appropriately choosing τ provides most of the influence in accurately detecting alternative markers; when true τ is small which we observe in the GWAS context, then fluctuation in V provides negligible change to alternative marginal likelihood evaluation. We emphasize that for the data analysis we used a marginal log-likelihood maximization approach to choose the best fitting value for τ. One shortcoming is preset values for τ. Fixed choices for τ may not yield the appropriate profile for τ that external data can provide. We can address this by letting τ depend on patient-level clinical characteristics, similar to the approach by Lee et al. (2017). These additions of demographic variables encourages us to incorporate estimation of the marginal likelihoods as additional steps in the entire estimation scheme, to estimate the effect of patient demographics on SNP posterior probability of association. This is a promising goal that may alleviate the ambiguity and slight inefficiency in estimating a given data’s NLP parameters.

Acknowledgements

Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through contributions from: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research funds ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.

Funding

This work was supported by T32 GM108557: NIGMS T32 Interdisciplinary Biostatistics Training in Genetics and Genomics grant, and by NIH grant 1R01-GM130622.

Conflict of Interest: none declared.

References

Albert
 
J.H.
,
Chib
S.
(
1993
)
Bayesian analysis of binary and polychotomous response data
.
J. Am. Stat. Assoc
.,
88
,
669
679
.

Benjamini
 
Y.
,
Hochberg
Y.
(
1995
)
Controlling the false discovery rate: a practical and powerful approach to multiple testing
.
J. R. Stat. Soc. Ser. B (Methodological)
,
57
,
289
300
. pages

Coste
 
B.
 et al.  (
2010
)
Piezo1 and piezo2 are essential components of distinct mechanically activated cation channels
.
Science
,
330
,
55
60
.

Deng
 
Y.
 et al.  (
2017
)
Plasma long noncoding RNA 51a as a stable biomarker of Alzheimer’s disease
.
Int. J. Clin. Exp. Pathol
.,
10
,
4694
4699
.

Dunson
 
D.B.
(
2010
)
Nonparametric Bayes applications to biostatistics. Bayesian nonparametrics
,
28
,
223
273
.

Hochberg
 
Y.
,
Tamhane
A.C.
(
1987
)
Multiple Comparison Procedures
.
John Wiley & Sons, Inc
.,
New York, NY, USA
.

Holm
 
S.
(
1979
)
A simple sequentially rejective multiple test procedure
.
Scand. J. Stat
.,
6
,
65
70
.

Idda
 
M.L.
 et al.  (
2018
)
Noncoding RNAs in Alzheimer’s disease
.
Wiley Interdiscipl. Rev. RNA
,
9
,
e1463.

Johnson
 
V.E.
,
Rossell
D.
(
2010
)
On the use of non-local prior densities in Bayesian hypothesis tests
.
J. R. Stat. Soc. Ser. B (Stat. Methodol.)
,
72
,
143
170
.

Johnson
 
V.E.
,
Rossell
D.
(
2012
)
Bayesian model selection in high-dimensional settings
.
J. Am. Stat. Assoc
.,
107
,
649
660
.

Kamboh
 
M.
 et al.  (
2012
)
Genome-wide association study of Alzheimer’s disease
.
Transl. Psychiatry
,
2
,
e117.

Lee
 
E.
 et al.  (
2017
)
Single-nucleotide polymorphisms are associated with cognitive decline at Alzheimer’s disease conversion within mild cognitive impairment patients
.
Alzheimer’s Dementia
.,
8
,
86
95
.

Lewinger
 
J.P.
 et al.  (
2007
)
Hierarchical Bayes prioritization of marker associations from a genome-wide association scan for further investigation
.
Genet. Epidemiol
.,
31
,
871
882
.

Lock
 
E.F.
,
Dunson
D.B.
(
2015
)
Shared kernel Bayesian screening
.
Biometrika
,
102
,
829
842
.

Lock
 
E.F.
,
Dunson
D.B.
(
2017
)
Bayesian genome-and epigenome-wide association studies with gene level dependence
.
Biometrics
,
73
,
1018
1028
.

Lyall
 
D.M.
 et al.  (
2014
)
Alzheimer’s disease susceptibility genes APOE and TOMM40, and brain white matter integrity in the Lothian Birth Cohort 1936
.
Neurobiol. Aging
,
35
,
1513.e25.

Moon
 
S.W.
 et al.  (
2015
)
Structural neuroimaging genetics interactions in Alzheimer’s disease
.
J. Alzheimer’s Dis
.,
48
,
1051
1063
.

Pereira Fernandes
 
D.
 et al.  (
2018
)
Long non-coding RNAs in neuronal aging
.
Noncoding RNA
,
4
,
12.

Pitman
 
J.
 et al.  (
2002
) Combinatorial stochastic processes. Technical report, Technical Report 621, Dept. Statistics, UC Berkeley, 2002. Lecture notes.

R Core Team (

2017
)
R: A Language and Environment for Statistical Computing
.
R Foundation for Statistical Computing
,
Vienna, Austria
.

Sanyal
 
N.
 et al.  (
2019
)
GWASinlps: non-local prior based iterative SNP selection tool for genome-wide association studies
.
Bioinformatics
,
35
,
1
11
.

Scott
 
J.G.
,
Berger
J.O.
(
2010
)
Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem
.
Ann. Stat
.,
38
,
2587
2619
.

Sethuraman
 
J.
(
1994
)
A constructive definition of Dirichlet priors
.
Stat. Sin
.,
4
:
639
650
.

Stephens
 
M.
,
Balding
D.J.
(
2009
)
Bayesian statistical methods for genetic association studies
.
Nat. Rev. Genet
.,
10
,
681.

Vounou
 
M.
 et al.  (
2012
)
Sparse reduced-rank regression detects genetic associations with voxel-wise longitudinal phenotypes in Alzheimer’s disease
.
Neuroimage
,
60
,
700
716
.

Yi
 
N.
 et al.  (
2014
)
Multiple comparisons in genetic association studies: a hierarchical modeling approach
.
Stat. Appl. Genet. Mol. Biol
.,
13
,
35
48
.

Zablocki
 
R.W.
 et al.  (
2017
) Semiparametric covariate-modulated local false discovery rate for genome-wide association studies. Ann. Appl. Stat.,
11
,
2252
2269
.

Zablocki
 
R.W.
 et al.  (
2014
)
Covariate-modulated local false discovery rate for genome-wide association studies
.
Bioinformatics
,
30
,
2098
2104
.

Author notes

Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: adni.loni.usc.edu

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)
Associate Editor: John Hancock
John Hancock
Associate Editor
Search for other works by this author on: