Abstract

Motivation

In the context of genome-wide association studies (GWAS), there is a variety of statistical techniques in order to conduct the analysis, but, in most cases, the underlying genetic model is usually unknown. Under these circumstances, the classical Cochran-Armitage trend test (CATT) is suboptimal. Robust procedures that maximize the power and preserve the nominal type I error rate are preferable. Moreover, performing a meta-analysis using robust procedures is of great interest and has never been addressed in the past. The primary goal of this work is to implement several robust methods for analysis and meta-analysis in the statistical package Stata and subsequently to make the software available to the scientific community.

Results

The CATT under a recessive, additive and dominant model of inheritance as well as robust methods based on the Maximum Efficiency Robust Test statistic, the MAX statistic and the MIN2 were implemented in Stata. Concerning MAX and MIN2, we calculated their asymptotic null distributions relying on numerical integration resulting in a great gain in computational time without losing accuracy. All the aforementioned approaches were employed in a fixed or a random effects meta-analysis setting using summary data with weights equal to the reciprocal of the combined cases and controls. Overall, this is the first complete effort to implement procedures for analysis and meta-analysis in GWAS using Stata.

Availability and Implementation

A Stata program and a web-server are freely available for academic users at http://www.compgen.org/tools/GWAR

Supplementary information

Supplementary data are available at Bioinformatics online.

1 Introduction

In genome-wide association studies (GWAS), millions of single nucleotide polymorphisms (SNP) are read in a single genome (Manolio, 2010). Statistical issues in GWAS analysis include quality control, adjustment for population stratification, selection of the optimal statistical test for the analysis and tackling the problem of multiple testing. Some excellent overviews of these issues can be found in (Teo, 2008; Ziegler et al., 2008). Although GWAS have identified many variants associated with complex diseases, these currently explain little of the risk variability for most diseases. Common variants have weak effects and detection of signals requires large sample sizes (Chapman et al., 2011). As single GWAS are often underpowered, meta-analysis, being the statistical procedure for the collection and combination of the available information (Normand, 1999; Petiti, 1994; Trikalinos et al., 2008), increases power and decreases false positive findings. Therefore, most genetic risk variants discovered in the past few years have originated from large-scale meta-analyses of GWAS (Zeggini and Ioannidis, 2009).

In GWAS, the analysis is conducted using a variety of statistical techniques (Balding, 2006; Sasieni, 1997). However, when the genetic model (recessive, additive, dominant and so on) is unknown, robust tests, which retain high power across all plausible models, are preferable (Bagos, 2013; Freidlin et al., 2002). Some of the robust methods are based on the theory of efficiency robust procedures (Freidlin et al., 1999; Gastwirth, 1985), namely the Maximum Efficiency Robust Test (MERT) and the maximum (MAX) test (Freidlin et al., 2002). However, the use of the MAX statistic introduces an extra complexity in the analysis, as under the null hypothesis of no association, it does not follow the standard normal distribution, resulting in computationally intensive resampling-based procedures in order to estimate its P-value. Later, an asymptotic distribution was derived by simplifying the computation of the respective P-values and efficient, faster algorithms were developed (Zang et al., 2010). MAX is uniformly more powerful than Pearson’s chi-square statistic (χ22) when the underlying genetic model is recessive, additive, multiplicative and dominant (Joo et al., 2009). In the MIN2 approach, a combination of test statistics is used heuristically and the asymptotic null distribution is derived (Joo et al., 2009). It was shown that MAX has greater efficient robustness than χ22 and MIN2 when the genetic models are restricted to recessive, additive and dominant (Joo et al., 2009).

Several methods and software tools have been developed for meta-analysis of GWAS [e.g. METAL (Willer et al., 2010) and GWAMA (Magi and Morris, 2010)]. These methods have been employed successfully, with hundreds of GWAS meta-analyses being published (Panagiotou et al., 2013). However, all these tools perform standard meta-analysis assuming the genetic model beforehand (usually, the additive one). A comprehensive comparison of tools is reviewed elsewhere (Evangelou and Ioannidis, 2013). Some of the robust methods allow for a stratified analysis, which can be formulated as a fixed-effect meta-analysis (Hothorn and Hothorn, 2009; Pan et al., 2011; Zang and Fung, 2010, 2011; Zheng and Tian, 2006), whereas other approaches are direct extensions of the traditional methods of meta-analysis under random-effects models (Pereira et al., 2011; Zintzaras, 2010). Another approach is to perform a separate analysis for each study, using the robust method of choice (i.e. MAX, MERT or MIN2) and then combine the individual results by pooling the Z-values or the effect sizes in a fixed or random effects method, as described by Zhou et al. (2011). This approach, which is closer to the norm followed for meta-analysis of GWAS (de Bakker et al., 2008), requires only summary estimates and was adopted in this work.

The objective of this work is to present efficient implementations of the Cochran-Armitage trend test (CATT) under a recessive, additive and dominant model of inheritance and, most importantly, of the robust methods MERT (Freidlin et al., 2002), MAX (Zang et al., 2010) and MIN2 (Joo et al., 2009) in Stata and to offer an easy-to-use, fast and reliable software. All the aforementioned approaches were also employed in a fixed or a random effects meta-analysis setting for summary data.

2 Materials and methods

2.1 Notation

Consider a case-control association study with a biallelic locus (i.e. a SNP) with alleles A and B having frequencies PA and PB, respectively. The three possible genotypes are denoted by g0 = AA, g1 = AB and g2 = BB with respective frequencies πj = P(gj) for j = 0,1,2. We further denote the disease penetrances by fj = P(case|gj) and the disease prevalence by k = P(case) = Σfjπj. In a case-control study, r cases and s controls are sampled independently and the observed genotype counts (r0, r1, r2) and (s0, s1, s2) follow multinomial distributions, mul(r; p0, p1, p2) and mul(s; q0, q1, q2), respectively (Table 1), with pj = P(gj|case) = fjπj/k and qj = P(gj|control) = (1−fjj/(1−k) for j = 0,1,2. The null hypothesis is equivalent to H0: f0 = f1 =  f2 = k. Assuming that B is the risk allele, we may denote the genotype relative risks by γj = fj/f0 for i = 1,2 and f0 > 0. Then, the null hypothesis can be reformulated as H0: γ2 = γ1 = 1 against H1: γ2 ≥ γ1 ≥ 1 with at least one inequality. Standard models for disease penetrance that imply a specific relationship between genotype and disease include common dominant and common recessive models, multiplicative or additive models. Thus, the underlying genetic model of inheritance can be dominant, recessive, multiplicative or additive, if γ2 = γ1 > 1, γ2 > γ1 = 1, γ2 = 2γ1−1 > 1 and γ2 = γ12  > 1, respectively. The multiplicative and additive models are asymptotically equivalent and in many cases are indistinguishable (hence the general term co-dominant may be used sometimes) (Bagos, 2013; Clarke et al., 2011). Another less frequently encountered situation occurs when γ1 > γ2 = 1, in which case the genetic model is termed overdominant.

Table 1

The 2 × 3 contingency table with the distribution of cases and controls in a traditional GWAS concerning a single biallelic locus

AA (g0)AB (g1)BB (g2)Total
Cases r0 r1 r2 r 
Controls s0 s1 s2 s 
Total n0 n1 n2 n 
AA (g0)AB (g1)BB (g2)Total
Cases r0 r1 r2 r 
Controls s0 s1 s2 s 
Total n0 n1 n2 n 
Table 1

The 2 × 3 contingency table with the distribution of cases and controls in a traditional GWAS concerning a single biallelic locus

AA (g0)AB (g1)BB (g2)Total
Cases r0 r1 r2 r 
Controls s0 s1 s2 s 
Total n0 n1 n2 n 
AA (g0)AB (g1)BB (g2)Total
Cases r0 r1 r2 r 
Controls s0 s1 s2 s 
Total n0 n1 n2 n 

2.2 Analysis of GWAS

There is a variety of methods for analyzing data from GWAS (Balding, 2006; Langefeld and Fingerlin, 2007). Perhaps the most common analysis of genotypes in case-control studies under a genetic model-free (GMF) perspective is to examine the association between the rows and columns of the 2 × 3 contingency table (Table 1). This can be performed using the traditional Pearson’s chi-square statistic (χ22), which is distributed following a chi-square distribution with two degrees of freedom (Balding, 2006; Langefeld and Fingerlin, 2007). Another option would be to calculate the two odds-ratios that correspond to the comparison of the genotypes carrying the risk allele (i.e. g1 and g2) against g0. Such tests can be performed in a logistic regression framework using the case/control status as the dependent variable (Balding, 2006). Alternatively, one may compute the odds-ratios using the summary counts of the contingency table (Chen and Chatterjee, 2007). Asymptotically, these methods are equivalent to χ22, but the major disadvantage of all these GMF methods lies in the lack of power.

At the same time, methods that are based on prior assumption on the genetic model also exist. These methods involve some sort of re-arrangement of the data or a differential scoring of the genotypes (Sasieni, 1997; Zheng et al., 2003). For instance, if we re-arrange the data of Table 1 by grouping genotypes g1 and g2 and perform a χ12, we explicitly assume a dominant model of inheritance; if we group g0 and g1 and compare them to g2, we are testing the recessive model. An allele-based comparison implies an additive genetic model under the assumption of HWE. Nevertheless, the most commonly used test is CATT (Armitage, 1955; Cochran, 1954), which is a score test for linear trend of the proportions in the contingency table. Assuming a general scoring scheme xi = (x0, x1, x2) for the genotypes (g0, g1, g2), a Z-test is formulated as:
ZCATT(x)=UxvarH0(Ux)=ni=02xi(srirsi)rsn[ni=02xi2ni(i=02xini)2]N(0,1)
(1)
where Ux is the score statistic and varHo(Ux) its variance evaluated under H0. The scores are usually set to xi = (0, ½, 1), in which case the statistic tests the additive model of inheritance (ZCATT(½)). CATT is invariant under linear transformations and, additionally, it does not rely on the assumption of HWE. It should be noted that the corresponding CATT is the most powerful statistic under the dominant, the additive and the recessive model, a fact that explains its widespread use in GWAS (Zheng et al., 2003). The CATT(½) is routinely used in real life applications, since it is more powerful when compared against the χ22 and does not assume HWE.

2.3 Robust analysis of GWAS

The most prominent methods for robust analysis are based on the theory of efficiency robust procedures (Freidlin et al., 1999; Gastwirth, 1985), namely the MERT and the MAX test (Freidlin et al., 2002). For a family of optimal tests, MERT is a linear combination of the two optimal tests for the extreme members of the family, i.e., the tests with the minimum correlation (ρ). Two conditions need to be satisfied; (i) ρ ≥ 0, and (ii) ρ + 1 must be less than the sum of the other two correlations. Henceforth, we focus on the optimal tests for recessive, additive and dominant models. Freidlin et al. (2002) showed that conditions (i) and (ii) hold if recessive and dominant relationships are modeled. Thus, MERT is expressed as a linear combination of the optimal CATTs for the recessive and dominant models and is asymptotically normally distributed:
ZMERT=ZCATT(0)+ZCATT(1)2(1+ρCATT(0,1))N(0,1)
(2)
In Eq. (2), ZCATT(0) and ZCATT(1) denote the Z-values under a recessive and dominant model respectively and ρCATT(0,1) denotes the correlation for (ZCATT(0), ZCATT(1)) (Freidlin et al., 2002). Expressions for the correlations among the three trend tests were correctly presented in (Zang et al., 2010). The MAX test is based on the simple idea to test all three possible models and choose the one with the highest score (note that we use the absolute value for completeness, in case A is the risk allele):
ZMAX=max(|ZCATT(0)|,|ZCATT(1/2)|,|ZCATT(1)|)
(3)
However, for calculating the P-value (i.e. P(|ZMAX|>t)), we need to take into account the correlations in order to avoid inflation of type I error rate. Freidlin and coworkers estimated the correlations of the three test statistics under H0 (Freidlin et al., 2002), but, even in that case, simulations or numerical integration are needed; this makes the method computationally intensive. MAX is more powerful, but MERT is easier to apply since the approximate P-value can be obtained from a standard normal distribution (Freidlin et al., 2002). Traditionally, Monte Carlo simulations were used for P-value calculations of MAX, but several less computationally intensive approximations (rhombus formula) have been described in Li et al. (2008). Recently, Zang et al. found that ZCATT(0), ZCATT(½) and ZCATT(1) are linearly dependent, a result that allowed them to develop faster algorithms for calculating the statistical significance of MAX (Zang et al., 2010). Thus, the P-value for the MAX statistic is given by:
P(ZMAX<t)=20t(1ω1)/ω0Φ(tρCATT(0,1)z01ρCATT(0,1)2)ϕ(z0)dz0+2t(1ω1)/ω0tΦ((tω0z0)/ω1ρCATT(0,1)z01ρCATT(0,1)2)ϕ(z0)dz020tΦ(tρCATT(0,1)z01ρCATT(0,1)2)ϕ(z0)dz0
(4)
where:
ω0=ρCATT(0,1/2)ρCATT(0,1)ρCATT(1/2,1)1ρCATT(0,1)2
(5)
ω1=ρCATT(1/2,1)ρCATT(0,1)ρCATT(0,1/2)1ρCATT(0,1)2
(6)
In Equation (5–6), ρCATT(0,1/2) denotes the correlation (ZCATT(0), ZCATT(1/2)) and ρCATT(1/2,1) the correlation (ZCATT(1/2), ZCATT(1)). MIN2 is an interesting robust approach that was adopted by investigators of the Wellcome Trust Case-Control Consortium (WTCCC, 2007). They applied the χ22 along with the CATT(½) and, subsequently, chose the minimum of the P-values:
(7)
The use of MIN2 is justified by simulations showing that χ22 has 5% less power compared with MAX and outperforms MERT, except when the additive model holds (Zheng et al., 2006). However, MIN2 is not a proper P-value since the statistics are correlated and multiple tests are performed. Later, Joo et al. (2009) derived the joint distribution needed in order to calculate a proper P-value:
P(ZCATT(1/2)2<t1,Tχ222<t2)=      {112et1212et22+12πt1t2eu2arcsin(2t1u1)du,t1<t21et22,t1>t2
(8)

Unlike MAX, MIN2 is independent of the allele frequency. Additionally, the P-value of MIN2 is strictly increasing in MIN2. Thus, MIN2 can be directly used to rank all SNPs as WTCCC (2007) did. Simulations have shown that MIN2 is slightly less powerful (1–4%) than MAX when the recessive model holds; it outperforms MAX under the additive model, whereas, for the dominant model, both tests perform similarly. We also observe that MAX needs three integrals whereas MIN2 only one and thus it is slower. The actual complexity depends on the number of integration points used (in our implementation we chose 100).

2.4 Meta-analysis of GWAS

Meta-analysis is a method that aims at increasing the power to detect weak genotype effects, estimating the effect size with greater precision and investigating between-studies variability. Meta-analysis of GWAS using summary statistics, in contrast to direct analysis of pooled individual-level data, diminishes the limitations that are imposed by restrictions on sharing individual-level data. Empirical findings (Janssens et al., 2009) suggest that using time-consuming individual data methods for meta-analysis of GWAS does not necessarily lead to increase efficiency (Lin and Zeng, 2010a,b) and thus, summary methods are routinely used for meta-analyses of GWAS (de Bakker et al., 2008; Magi and Morris, 2010; Willer et al., 2010).

Some of the statistics discussed in the previous sections, such as the CATT, are formed as a score test:
Zscore(i)=UβivarH0(Uβi)=UβiI^(βi)
(9)
where Uβi is the score statistic from study i, and I^(βi) the estimate of the information matrix evaluated at H0 (i.e. βi^=0). Other tests (i.e. those based on the log-odds ratios) are formulated as a Wald statistic:
ZWald(i)=β^ivar(β^i)=β^iI^(βi)1=β^iI^(βi)
(10)

Here, β^i is the maximum likelihood estimate and its variance var(β^i) is calculated at β^i. If both tests evaluate the same null hypothesis and if β^i is small in absolute value, the score and Wald statistics are asymptotically equivalent under H0 (i.e. β^i = 0).

Even though optimal methods for combining the results of independent studies that used a Wald statistic are available for years and described in standard textbooks (Normand, 1999; Petiti, 1994), optimal weights for combining the results of score statistics such as the CATT had not been presented until recently (Zhou et al., 2011). Consider a meta-analysis of k studies under a fixed-effect model in which the true effect is common across the k studies (i.e. βi ∼ N(β, si2)). Then, the pooled Z will be given by the weighted average:
Z¯=i=1kwiZii=1kwi
(11)
Zhou and coworkers have shown that, in any case, the optimal weights will be given by wi=I^(βi) (Zhou et al., 2011). It should be noted that when we are pooling Z-values, irrespective of the method by which they have arisen, I^(βi) is usually not available to the meta-analyst. An approach commonly followed in the literature is to weigh the studies proportionally to the study-specific sample sizes (de Bakker et al., 2008; Magi and Morris, 2010; Willer et al., 2010). However, Zhou et al. showed that even though this scheme performs well for most practical applications, it is suboptimal and proposed that a good choice for the weights would be wi=risi/ni=(1/ri+1/si)1. Equation (11) is a general formula and in the case of pooling a score statistic, it reduces to:
Z¯score=i=1kUβii=1kI^(βi)
(12)
which is a well-known expression for the stratified score statistic obtained from the sum of k log-likelihoods (i.e. a Mantel-Haenszel type statistic). In case we pool a Wald statistic, Equation (11) is equivalent to obtaining a pooled estimate of β:
β^=i=1kwiβii=1kwi
(13)

This estimate is the well-known inverse-variance estimate commonly used in meta-analysis, which utilizes the estimated standard errors of the beta coefficients as weights. Asymptotically, weighting by the sample size (or the method proposed by Zhou and coworkers) and the standard errors are equivalent when the trait distribution is identical across samples (such that standard errors are a predictable function of sample size).

In the presence of between-studies heterogeneity, a random-effects model for pooling the estimates is preferable, assuming βi ∼ N(β, si2 + τ2) with τ2 being the between-studies variance estimate. Usually, τ2 is estimated from the data using the method of moments, even though there are several alternatives. The problem, however, with robust statistics is that only a P-value or a Z-score is reported, and not a Wald statistics along with its variance. Thus, we relied on an approximation proposed by Zhou and coworkers (Zhou et al., 2011). They used the same method for modeling the components of the score test and showed that, when βi is small, the asymptotic mean of Uβi can be approximated by βiI^(βi). Thus, after calculating the appropriate weights equal to wi=I^(βi)=(1/ri+1/si)1, we use the Z-score to calculate an effect size, taking the direction of the association into account:
β^i=UβiI^(βi)=ZiI^(βi)=Zi(1/ri+1/si)1
(14)

We can then perform a standard random-effects meta-analysis; we only need to keep in mind that the actual effect sizes calculated by Equation (14), as well as by the meta-analysis, are not valid. Consequently, we have to rely only on the overall Z-score and the overall P-value, which are valid under any circumstances. This method can be regarded as an extension of the Stouffer’s method (Equation 11)), which was one of the first methods that combined P-values. Thus, in principle other methods that combine P-values could be used, but these have serious disadvantages: the direction of the association is not taken into consideration and therefore all P-values have to be one-sided, and, most importantly, they do not account for between-studies heterogeneity.

3 Results

3.1 Implementation

GWAR is implemented in Stata (StataCorp). For the MAX and MIN2 statistics, we implement the methods described in Equations (4) and (8), respectively, that require integration. For this purpose, we use numerical integration with Gauss-Legendre quadrature implemented in the integrate command that uses Mata, and this results in great gain in computing time. For the meta-analysis part, we use the metan command in Stata. Due to integrate, GWAR requires Stata v12 or above. GWAR is implemented as an ado file as well as a web-server available at www.compgen.org/tools/GWAR.

3.2 Usage

There are two programs: gwar and min2gwar. The syntax for gwar is: gwar varlist, Method(string) [SNP(varlist min = 1 max = 1) EFfect(string)]. varlist contains the variables corresponding to the genotypes for cases and controls. The first three variables are the genotypes of controls (e.g. aa0, ab0, bb0), and the remaining ones the genotypes of cases (e.g. aa1, ab1, bb1). Allele b is assumed to be the risk variant. In the single-study mode, the user needs to specify only the genotypes and the method of analysis (MAX, MIN2, MERT as well as the three versions of CATT). In the meta-analysis mode, the user also needs to specify the variable that indicates the SNPs and the requested method of meta-analysis (fixed or random effects). Meta-analysis is performed, using Stata’s metan command, by grouping the observations with the same SNP, so it is the user’s responsibility to ensure that data are in the appropriate format when the program is called. For fast grouping of large datasets the program uses the ftools library, which the user has to install.min2gwar is the version that uses the P-values corresponding to the Pearson′s chi-square test and the CATT(1/2) directly. In the single-study mode, its use is simple (min2gwar p1 p2). In the meta-analysis mode, the user needs also to provide three additional variables, the total number of cases and controls and a variable (−1, +1) for the direction of the association. Similarly, the meta-analysis is performed by grouping the observations with the same SNP, so it is the user's responsibility to ensure that data are in the appropriate format when the program is called.

The web-server provides the functionality for uploading a file, or it can be used as a simple online calculator in which the user can fill the genotype counts and receive the results. For large datasets, we allow uploading compressed files in most popular formats, and we also allow the user to receive the results by email.

3.3 Performance

GWAR is fast, reliable and possesses features currently not available in other similar packages. The benchmark was performed on an Intel Xeon CPU E5-2660v2, 2.20GHz server with 32 GB of RAM using STATA/SE. A GWAS with 100 000 SNPs can be analyzed within 1-2 hours, even using the computationally intensive MAX or MIN2 methods (the analysis using CATT or MERT is performed within seconds), whereas, for a meta-analysis of five studies and 100 000 SNPs, the results are produced depending on the method, from half an hour to four hours (Supplementary Table S1). The accuracy can be seen with a comparison against results known in the literature. We need to mention that, due to the approximate formula employed, GWAR can calculate P-values up to 1018 for MAX and 1052 for MIN2, values that are nevertheless outside of the range of any reasonable simulation method (for the latter one would need more than 1052 repetitions, a number which is untenable).

In order to compare the robust methods (MERT, MAX and MIN2), we performed a simulation study. We simulated 500 cases and 500 controls. We considered different values for the risk allele frequency (RAF) ranging from 0.1 to 0.5, the odds-ratio for the g2 versus g0 comparison (1–2), and the genetic model of inheritance (dominant, additive and recessive). The results indicate that, for each mode of inheritance, the respective CATT is, as one would expect, more powerful compared with every other test statistic (see Supplementary Table S1, Supplementary Fig. S1). For the recessive model, MAX is more powerful than MERT and MIN2, with differences ranging from 0.2 to 13.2%. Moreover, MAX is almost uniformly more powerful than CATT(1/2) which is routinely used in GWAS with a difference in power that exceeds 10% for odds-ratios >1.5. The latter also holds for MERT which can be used as an alternative to reduce complexity that arises from the computation of MAX and MIN2. Under the additive model, MIN2 outperforms MAX, but the difference in power is <3% for odds-ratios <1.5. For the dominant model, MAX and MIN2 have similar performance. MERT is the least powerful, with a difference in power up to 17.4% for RAF = 0.1 when compared against MAX.

3.4 Application to real studies

For illustration purposes, we re-analyzed the 22 SNPs with an estimated P-value [by the rhombus formula for approximating the P-value of MAX (Li et al., 2008)] below 105 from a GWAS of Coronary Artery Disease (CAD) (WTCCC, 2007). The results from the MAX method of analysis in GWAR have a remarkable agreement with those obtained using the rhombus formula (Fig. 1). It should also be noted that both approaches can be used to estimate extremely low P-values (say <107) where resampling procedures are not easily applicable (or they are too demanding).

Fig. 1

The comparison of P-values obtained from the MAX method of analysis in GWAR versus those obtained using the rhombus formula for 22 chosen SNPs from the GWAS of CAD

We also analyzed the data from a GWAS that was designed to identify susceptibility loci for type-2 diabetes (Sladek et al., 2007). We calculated the P-values from the MAX method of analysis in GWAR and compared them to those obtained by the authors using a permutation approach as well as using a proposed asymptotic distribution of the MAX statistic (Gonzalez et al., 2008). We observe an almost perfect correlation among the results with consistently, but only slightly higher P-values obtained by GWAR (Fig. 2).

Fig. 2

The comparison of P-values obtained from the MAX method of analysis in GWAR versus those obtained using a permutation approach and the asymptotic distribution of the MAX statistic (Gonzalez et al., 2008) for 67 SNPs from the GWAS of type-2 diabetes. Four extra SNPs (rs7903146, rs12255372, rs10885409 and rs7904519) were also reported, all with estimated P-values: <3.30E-10 for the permutation approach, 6.36E-19, 9.33E-15, 2.47E-11 and 5.60E-11 for the asymptotic procedure and 3.77E-15, 1.89E-14, 4.21E-11 and 6.44E-11 for GWAR, respectively

We finally analyzed 6 SNPs from a hypertension GWAS (WTCCC, 2007). We calculated the P-values from the MAX and MIN2 method of analysis in GWAR and compared them to those reported by (Joo et al., 2009). The results are identical for MIN2 with some small discrepancies for the MAX test (Table 2).

Table 2

Application to the SNPs associated with hypertension in WTCCC (2007) with MIN2 and MAX robust tests

MIN2 reported in (Joo et al., 2009)MIN2 in gwarMAX reported in (Joo et al., 2009)MAX in gwar
rs2820037 1.31E-06 1.31E-06 3.16E-06 3.23E-06 
rs6997709 1.34E-05 1.34E-05 2.06E-05 2.07E-05 
rs7961152 1.25E-05 1.25E-05 1.80E-05 2.01E-05 
rs11110912 1.56E-05 1.56E-05 8.16E-06 8.15E-06 
rs1937506 1.56E-05 1.56E-05 2.46E-05 2.43E-05 
rs2398162 9.64E-06 9.64E-06 2.50E-06 2.42E-06 
MIN2 reported in (Joo et al., 2009)MIN2 in gwarMAX reported in (Joo et al., 2009)MAX in gwar
rs2820037 1.31E-06 1.31E-06 3.16E-06 3.23E-06 
rs6997709 1.34E-05 1.34E-05 2.06E-05 2.07E-05 
rs7961152 1.25E-05 1.25E-05 1.80E-05 2.01E-05 
rs11110912 1.56E-05 1.56E-05 8.16E-06 8.15E-06 
rs1937506 1.56E-05 1.56E-05 2.46E-05 2.43E-05 
rs2398162 9.64E-06 9.64E-06 2.50E-06 2.42E-06 
Table 2

Application to the SNPs associated with hypertension in WTCCC (2007) with MIN2 and MAX robust tests

MIN2 reported in (Joo et al., 2009)MIN2 in gwarMAX reported in (Joo et al., 2009)MAX in gwar
rs2820037 1.31E-06 1.31E-06 3.16E-06 3.23E-06 
rs6997709 1.34E-05 1.34E-05 2.06E-05 2.07E-05 
rs7961152 1.25E-05 1.25E-05 1.80E-05 2.01E-05 
rs11110912 1.56E-05 1.56E-05 8.16E-06 8.15E-06 
rs1937506 1.56E-05 1.56E-05 2.46E-05 2.43E-05 
rs2398162 9.64E-06 9.64E-06 2.50E-06 2.42E-06 
MIN2 reported in (Joo et al., 2009)MIN2 in gwarMAX reported in (Joo et al., 2009)MAX in gwar
rs2820037 1.31E-06 1.31E-06 3.16E-06 3.23E-06 
rs6997709 1.34E-05 1.34E-05 2.06E-05 2.07E-05 
rs7961152 1.25E-05 1.25E-05 1.80E-05 2.01E-05 
rs11110912 1.56E-05 1.56E-05 8.16E-06 8.15E-06 
rs1937506 1.56E-05 1.56E-05 2.46E-05 2.43E-05 
rs2398162 9.64E-06 9.64E-06 2.50E-06 2.42E-06 

4 Discussion

We presented GWAR, a fast and reliable tool for analysis and meta-analysis of GWAS. We implemented both standard CATTs as well as the most widely used robust methods, namely MERT, MAX and MIN2. For MAX and MIN2, which are also the most powerful among the robust methods, we implemented accurate and fast routines that rely on numerical integration in order to calculate the P-value accurately. GWAR is implemented as an ado file for Stata 12 or higher, as well as a web-server available at www.compgen.org/tools/GWAR.

GWAR offers functionalities not available in other similar tools for the analysis of GWAS (coin (Hothorn and Hothorn, 2009), SNPassoc (Gonzalez et al., 2007), Rassoc (Zang et al., 2010), RobustSNP (So and Sham, 2011), GenABEL (Aulchenko et al., 2007) and Plink (Purcell et al., 2007). First of all, it implements MIN2, which is currently not available in any statistical package. MAX is also available in coin, SNPassoc, Rassoc and RobustSNP, whereas MERT is available only in Rassoc. Even though MAX implementation in Stata is among the second slowest found in these tools (it surpasses only RobustSNP), MERT and the various CATTs are the fastest ones (Supplementary Tables S1 and S3).

GWAR is also unique in terms of meta-analysis since it is the only tool that implements meta-analysis methods using the aforementioned robust statistics. Similar methods are not available in other popular packages for meta-analysis of GWAS, like METAL (Willer et al., 2010), GWAMA (Magi and Morris, 2010), PLINK (Purcell et al., 2007) and MetABEL (Aulchenko et al., 2007). Even for the standard meta-analysis that assumes a model of inheritance beforehand, GWAR is unique since it offers the option for both fixed and random effects meta-analysis. For comparison purposes, we need to mention that only PLINK and GWAMA offer the option for random effects meta-analysis, whereas MetABEL and METAL allow only fixed effect. Moreover, GWAMA, PLINK and METAL, although computational optimal, require precalculated log-odds ratios from the individual studies assuming a specific model of inheritance beforehand. PLINK and METAL offer also an option for Z-score-based P-values meta-analysis, weighted by the sample size. Finally, MetABEL allows for meta-analysis of CATT(1/2) using the qtscore function.

The web-server that we implemented is also unique of its kind since it allows researchers to upload files and receive the results without having to run the software locally. The tool requires only summary genotype counts, and thus security is not a main concern (even though we guarantee the privacy of the results, and in any case the user may not provide the list with the actual SNP ids). Nevertheless, there is theoretical evidence that summary statistics provide the same efficiency of the estimates (Lin and Zeng, 2010a,b) and meta-analysis of GWAS is usually performed using summary estimates from the included studies (Begum et al., 2012; de Bakker et al., 2008; Magi and Morris, 2010; Willer et al., 2010). However, MIN2 statistic can also be used even without the genotype counts. The user may only provide the P-values from the Pearson’s chi-square and the standard CATT (for the single study analysis), or additionally the total number of cases and controls and the direction of the association (for the meta-analysis). With this option, the analyst may need to perform some calculations, but he/she avoids additional complications arising from confidentiality agreements, especially when it comes to large consortia in meta-analysis of GWAS.

Our simulations showed that MAX is more powerful than MIN2 (except when the additive model holds), and MIN2 is always more powerful than MERT. However, the computation time needed is in the reverse order (MERT is faster than MIN2 and MIN2 faster than MAX). All these should be taken into account by the user. For instance, if time is not an issue, MAX should be used. MIN2 is a good compromise (especially if we consider that most associations behave like additive), whereas MERT is fast and could be used for really large datasets, or as an exploratory tool.

Currently, GWAR allows only for analysis of case-control studies. Possible future improvements of GWAR will include the ability to analyse also quantitative traits. To this end however, only the MERT and MAX statistics will be possible to implement, since MIN2 is inherently bound to case-control studies. Another possible extension would be to allow for covariates since it is common that GWAS suffer from several hidden biases that need to be adjusted for. In most GWAS, some type of adjustment is used by allowing for covariates (e.g. gender, age, eigenvectors), but the majority of robust methods are not easily amenable for such modifications. In principle, this could be possible for GWAR by modifying the score statistic and using individual data. This would be a major update of the tool and we plan it for the future. However, one of the most important sources of bias in GWAS, which is population stratification, that can inflate the test statistics, can be estimated and adjusted for, without the modification of the basic method (Clayton et al., 2005). One can generate a quantile–quantile plot of the observed test statistics for disease association versus each of the SNPs included in the analysis. Usually, in such approaches SNPs with extreme deviations from HWE in either controls or cases are excluded, and the same has to be done for the top 10% of the SNPs. The tests can then be ranked from smallest to largest and plotted against their expected values under the global null hypothesis that no true association exists. The slope of the fitted line would produce an estimate of the inflation and suggest adjustments to the significance level. Finally, another possible extension will be to implement the tool in a standalone version (e.g. in Perl programming language), in order to increase the speed and minimize the dependence on external software.

Acknowledgements

The authors also gracefully acknowledge NGP-net COST Action (BM1405) for support and the three reviewers for their valuable comments and suggestions that improved the quality of the article.

Funding

The authors are financially supported by the project ‘Integration of Data from Multiple Sources’ (IntDaMus), which is implemented under the ‘ARISTEIA ΙΙ’ Action of the ‘OPERATIONAL PROGRAMME EDUCATION AND LIFELONG LEARNING’ and is co-funded by the European Social Fund and National Resources.

Conflict of Interest: none declared.

References

Armitage
 
P.
(
1955
)
Tests for linear trends in proportions and frequencies
.
Biometrics
,
11
,
375
386
.

Aulchenko
 
Y.S.
 et al.  (
2007
)
GenABEL: an R library for genome-wide association analysis
.
Bioinformatics
,
23
,
1294
1296
.

Bagos
 
P.G.
(
2013
)
Genetic model selection in genome-wide association studies: robust methods and the use of meta-analysis
.
Stat. Appl. Genet. Mol. Biol
.,
12
,
285
308
.

Balding
 
D.J.
(
2006
)
A tutorial on statistical methods for population association studies
.
Nat. Rev. Genet
.,
7
,
781
791
.

Begum
 
F.
 et al.  (
2012
)
Comprehensive literature review and statistical considerations for GWAS meta-analysis
.
Nucleic Acids Res
.,
40
,
3777
3784
.

Chapman
 
K.
 et al.  (
2011
)
Defining the power limits of genome-wide association scan meta-analyses
.
Genet. Epidemiol
.,
35
,
781
789
.

Chen
 
J.
,
Chatterjee
N.
(
2007
)
Exploiting Hardy-Weinberg equilibrium for efficient screening of single SNP associations from case-control studies
.
Hum. Hered
.,
63
,
196
204
.

Clarke
 
G.M.
 et al.  (
2011
)
Basic statistical analysis in genetic case-control studies
.
Nat. Protoc
.,
6
,
121
133
.

Clayton
 
D.G.
 et al.  (
2005
)
Population structure, differential bias and genomic control in a large-scale, case-control association study
.
Nat. Genet
.,
37
,
1243
1246
.

Cochran
 
W.G.
(
1954
)
Some methods for strengthening the common chi-squared tests
.
Biometrics
,
10
,
417
451
.

de Bakker
 
P.I.
 et al.  (
2008
)
Practical aspects of imputation-driven meta-analysis of genome-wide association studies
.
Hum. Mol. Genet
.,
17
,
R122
R128
.

Evangelou
 
E.
,
Ioannidis
J.P.
(
2013
)
Meta-analysis methods for genome-wide association studies and beyond
.
Nat. Rev. Genet
.,
14
,
379
389
.

Freidlin
 
B.
 et al.  (
1999
)
Efficiency robust tests for survival or ordered categorical data
.
Biometrics
,
55
,
883
886
.

Freidlin
 
B.
 et al.  (
2002
)
Trend tests for case-control studies of genetic markers: power, sample size and robustness
.
Hum. Hered
.,
53
,
146
152
.

Gastwirth
 
J.L.
(
1985
)
The use of maximin efficiency robust tests in combining contingency tables and survival analysis
.
J. Am. Stat. Assoc
.,
80
,
380
384
.

Gonzalez
 
J.R.
 et al.  (
2007
)
SNPassoc: an R package to perform whole genome association studies
.
Bioinformatics
,
23
,
644
645
.

Gonzalez
 
J.R.
 et al.  (
2008
)
Maximizing association statistics over genetic models
.
Genet. Epidemiol
.,
32
,
246
254
.

Hothorn
 
L.A.
,
Hothorn
T.
(
2009
)
Order-restricted scores test for the evaluation of population-based case-control studies when the genetic model is unknown
.
Biom. J
.,
51
,
659
669
.

Janssens
 
A.C.
 et al.  (
2009
)
An empirical comparison of meta-analyses of published gene-disease associations versus consortium analyses
.
Genet. Med
.,
11
,
153
162
.

Joo
 
J.
 et al.  (
2009
)
A robust genome-wide scan statistic of the Wellcome Trust Case-Control Consortium
.
Biometrics
,
65
,
1115
1122
.

Langefeld
 
C.D.
,
Fingerlin
T.E.
(
2007
)
Association methods in human genetics
.
Methods Mol. Biol
.,
404
,
431
460
.

Li
 
Q.
 et al.  (
2008
)
Efficient approximation of P-value of the maximum of correlated tests, with applications to genome-wide association studies
.
Ann. Hum. Genet
.,
72
,
397
406
.

Lin
 
D.Y.
,
Zeng
D.
(
2010a
)
Meta-analysis of genome-wide association studies: no efficiency gain in using individual participant data
.
Genet. Epidemiol
.,
34
,
60
66
.

Lin
 
Y.
,
Zeng
D.
(
2010b
)
On the relative efficiency of using summary statistics versus individual-level data in meta-analysis
.
Biometrika
,
97
,
321
332
.

Magi
 
R.
,
Morris
A.P.
(
2010
)
GWAMA: software for genome-wide association meta-analysis
.
BMC Bioinformatics
,
11
,
288.

Manolio
 
T.A.
(
2010
)
Genomewide association studies and assessment of the risk of disease
.
N. Engl. J. Med
.,
363
,
166
176
.

Normand
 
S.L.
(
1999
)
Meta-analysis: formulating, evaluating, combining, and reporting
.
Stat. Med
.,
18
,
321
359
.

Pan
 
D.
 et al.  (
2011
)
Robust joint analysis allowing for model uncertainty in two-stage genetic association studies
.
BMC Bioinformatics
,
12
,
9.

Panagiotou
 
O.A.
 et al.  (
2013
)
The power of meta-analysis in genome-wide association studies
.
Annu. Rev. Genomics Hum. Genet
.,
14
,
441
465
.

Pereira
 
T.V.
 et al.  (
2011
)
Strategies for genetic model specification in the screening of genome-wide meta-analysis signals for further replication
.
Int. J. Epidemiol
.,
40
,
457
469
.

Petiti
 
D.B.
(
1994
) Meta-analysis decision analysis and cost-effectiveness analysis. In:
Monographs in Epidemiology and Biostatistics
.
Oxford University Press
,
Oxford
.

Purcell
 
S.
 et al.  (
2007
)
PLINK: a tool set for whole-genome association and population-based linkage analyses
.
Am. J. Hum. Genet
.,
81
,
559
575
.

Sasieni
 
P.D.
(
1997
)
From genotypes to genes: doubling the sample size
.
Biometrics
,
53
,
1253
1261
.

Sladek
 
R.
 et al.  (
2007
)
A genome-wide association study identifies novel risk loci for type 2 diabetes
.
Nature
,
445
,
881
885
.

So
 
H.C.
,
Sham
P.C.
(
2011
)
Robust association tests under different genetic models, allowing for binary or quantitative traits and covariates
.
Behav. Genet
.,
41
,
768
775
.

Teo
 
Y.Y.
(
2008
)
Common statistical issues in genome-wide association studies: a review on power, data quality control, genotype calling and population structure
.
Curr. Opin. Lipidol
.,
19
,
133
143
.

Trikalinos
 
T.A.
 et al.  (
2008
)
Meta-analysis methods
.
Adv. Genet
.,
60
,
311
334
.

Willer
 
C.J.
 et al.  (
2010
)
METAL: fast and efficient meta-analysis of genomewide association scans
.
Bioinformatics
,
26
,
2190
2191
.

WTCCC
(
2007
)
Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
.
Nature
,
447
,
661
678
.

Zang
 
Y.
,
Fung
W.K.
(
2010
)
Robust tests for matched case-control genetic association studies
.
BMC Genet
.,
11
,
91.

Zang
 
Y.
,
Fung
W.K.
(
2011
)
Robust Mantel-Haenszel test under genetic model uncertainty allowing for covariates in case-control association studies
.
Genet. Epidemiol
.,
35
,
695
705
.

Zang
 
Y.
 et al.  (
2010
)
Simple algorithms to calculate asymptotic null distribution for robust tests in case-control genetic association studies in R
.
J. Stat. Softw
.,
33
,

Zeggini
 
E.
,
Ioannidis
J.P.
(
2009
)
Meta-analysis in genome-wide association studies
.
Pharmacogenomics
,
10
,
191
201
.

Zheng
 
G.
 et al.  (
2006
)
Comparison of robust tests for genetic association using case-control studies
.
IMS Lect. Notes Monogr. Ser
.,
49
,
253
265
.

Zheng
 
G.
 et al.  (
2003
)
Choice of scores in trend tests for case-control studies of candidate-gene associations
.
Biometric. J
.,
45
,
335
348
.

Zheng
 
G.
,
Tian
X.
(
2006
)
Robust trend tests for genetic association using matched case-control design
.
Stat. Med
.,
25
,
3160
3173
.

Zhou
 
B.
 et al.  (
2011
)
Optimal methods for meta-analysis of genome-wide association studies
.
Genet. Epidemiol
.,
35
,
581
591
.

Ziegler
 
A.
 et al.  (
2008
)
Biostatistical aspects of genome-wide association studies
.
Biom. J
.,
50
,
8
28
.

Zintzaras
 
E.
(
2010
)
The generalized odds ratio as a measure of genetic risk effect in the analysis and meta-analysis of association studies
.
Stat. Appl. Genet. Mol. Biol
.,
9
,
Article21.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/about_us/legal/notices)
Associate Editor: Oliver Stegle
Oliver Stegle
Associate Editor
Search for other works by this author on:

Supplementary data