Abstract

Motivation

Common small-effect genetic variants that contribute to human complex traits and disease are typically identified using traditional fixed-effect (FE) meta-analysis methods. However, the power to detect genetic associations under FE models deteriorates with increasing heterogeneity, so that some small-effect heterogeneous loci might go undetected. A modified random-effects meta-analysis approach (RE2) was previously developed that is more powerful than traditional fixed and random-effects methods at detecting small-effect heterogeneous genetic associations, the method was updated (RE2C) to identify small-effect heterogeneous variants overlooked by traditional fixed-effect meta-analysis. Here, we re-appraise a large-scale meta-analysis of coronary disease with RE2C to search for small-effect genetic signals potentially masked by heterogeneity in a FE meta-analysis.

Results

Our application of RE2C suggests a high sensitivity but low specificity of this approach for discovering small-effect heterogeneous genetic associations. We recommend that reports of small-effect heterogeneous loci discovered with RE2C are accompanied by forest plots and standardized predicted random-effects statistics to reveal the distribution of genetic effect estimates across component studies of meta-analyses, highlighting overly influential outlier studies with the potential to inflate genetic signals.

Availability and implementation

Scripts to calculate standardized predicted random-effects statistics and generate forest plots are available in the getspres R package entitled from https://magosil86.github.io/getspres/.

Supplementary information

Supplementary data are available at Bioinformatics online.

1 Introduction

The conservative nature of the traditional random-effects model (RE), which assumes the presence of heterogeneity under the null, has contributed to the dominance of fixed-effect (FE) meta-analysis methods in the discovery of small-effect variants (per-allele disease odds ratios <1.2 or trait variance <0.2%) (Bush and Moore, 2012; Yang et al., 2011) even at heterogeneous loci. A modification of the traditional random-effects method, RE2, was designed to detect genetic associations both in the presence and absence of heterogeneity, to provide an opportunity to identify small-effect heterogeneous variants that might go unnoticed in a FE meta-analysis (Han and Eskin, 2011). Most users of the RE2 random-effects method employed it to refine associations at significant and suggestive genetic signals identified in FE meta-analyses (Sapkota et al., 2015, 2017; Wyss et al., 2018); in its latest iteration, RE2C (Lee et al., 2017) it reports a subset of variants detected by RE2 where PRE2 PFE. The RE2C update is intended to have a broad application beyond the augmentation of summary association P-values of variants identified in FE meta-analysis to the discovery of additional and potentially novel loci. The RE2 and by extension RE2C random-effects method’s power advantage over traditional fixed and random-effects meta-analysis models is partly attributable to a relaxed null hypothesis, which assumes homogeneity of genetic effects under the null and thereby provides a greater contrast between the null and alternative hypotheses H0: μ=0 and τ2=0 versus H1: μ0 or τ2>0; asymptotically.

Heterogeneity of genetic effects might arise from biologically relevant differences among contributing studies in a meta-analysis, such as diverse: ancestries, linkage disequilibrium patterns, sub-phenotypes, ages-of disease onset, family-history of disease or gender. Alternatively, differences in the direction and/or size of genetic effect-estimates among participating studies in a meta-analysis could reflect genotyping error or population structure (i.e. local admixture), where, for example, the average genetic effect estimate at a variant of interest is inflated by a few outlier studies showing outsized effects while the majority of study effects are marginal. Heterogeneity at individual variants can be explored through forest plots and the calculation of standardized predicted random-effects (SPREs), while heterogeneity patterns across multiple variants can be conveniently inspected through the calculation of M statistics (Magosi et al., 2017). Notably, SPREs are precision weighted residuals that indicate the direction and extent with which individual studies in a meta-analysis deviate from the average genetic effect (Harbord and Higgins, 2008; Magosi et al., 2017), and can be a useful quantitative indicator of whether the average genetic effect at a variant of interest might be unduly influenced by outlier studies showing extreme effects.

In this report, we revisit the CARDIoGRAMplusC4D meta-analysis (60 801 cases and 123 504 controls) of coronary artery disease (CAD) with the RE2C random-effects method, to search for additional CAD loci potentially masked by heterogeneity in the primary FE meta-analysis.

2 Materials and methods

2.1 GWAS datasets

2.1.1 CARDIoGRAMplusC4D

Summary data (i.e. logistic regression coefficients and their corresponding standard errors) were collated from 48 genome-wide association studies of coronary disease risk that comprised individuals from 6 different ancestry groups including: African (n = 1) and Hispanic American (n = 1), East (China and Korea, n = 3) and South (India and Pakistan, n = 4) Asian, Middle Eastern (Lebanese, n = 1) and European (n = 38); meta-analysis was conducted for a set of ∼ 9 million variants with minor allele frequencies >0.005 (CARDIoGRAMplusC4D Consortium, 2015). Design details of each participating CARDIoGRAMplusC4D study are summarized in Supplementary Table S1; the coronary disease phenotype included patients with an inclusive CAD diagnosis (e.g. myocardial infarction, acute coronary syndrome, chronic stable angina or coronary stenosis >50%). Study-level genomic correction (Devlin and Roeder, 1999) was applied to each study to minimize false positives induced by inflated association test statistics. Variant effect-size estimates (β coefficients scaled as loge(odds ratios) from an additive-effects-only association model) in each study were aligned such that the same risk allele was compared across the studies assembled in the meta-analysis. The studies contributing to the CARDIoGRAMplusC4D study obtained ethical approval from the ethics committees of the respective medical faculties, and informed consent was obtained from all participants, summary genetic association data were anonymously meta-analysed and reported here. Membership of the CARDIoGRAMplusC4D Consortium is provided in the Supplementary Text S1. Requests for access to the summary statistics are coordinated by the CARDIoGRAMplusC4D Steering Committee (www.cardiogramplusc4d.org).

2.1.2 UK Biobank

The UK Biobank study (UKBB) is a large-scale prospective study of over half a million participants commissioned to assemble comprehensive data on genotypic, socio-demographic, lifestyle and environmental factors with the aim of developing better strategies for the prevention, diagnosis and treatment of common diseases (Sudlow et al., 2015) such as cardiovascular disease (Littlejohns et al., 2019). Data from an interim release of GWAS genotypes for 296 525 participants were previously merged and analysed with clinical phenotype data that identified 34 541 cases of coronary heart disease and 261 984 controls from England, Scotland and Wales aged 45–69 years (van der Harst and Verweij, 2018). Coronary disease case status was assigned to prevalent and incident cases of myocardial infarction, acute coronary syndromes and associated therapeutic interventions (e.g. revascularization). Association summary statistics (β coefficients scaled as loge(odds ratios) and associated standard errors from an additive-effects-only logistic regression association model) from this analysis were downloaded from the www.cardiomics.net server. Design details of the UK Biobank participants to compare with the CARDIoGRAMplusC4D cohorts are included in Supplementary Table S1.

2.2 RE2 and RE2C meta-analysis

Genetic association meta-analyses are typically performed under a RE when the objective is both to estimate a summary effect (i.e. average genetic effect) across studies in a meta-analysis and measure the amount of heterogeneity. Consider a meta-analysis comprising S studies (s=1, 2, 3,, S) where the genetic effect-size estimate and corresponding standard error of a variant of interest were obtained via regression modelling in each study, and the average genetic effect estimate, μ^ calculated as the inverse-variance weighted mean of the individual study effects:  
μ^= s=1Swsyss=1Sws 
(1)
where ys represents the study effect-size estimate in the sth study, and ws denotes the weight assigned to the sth study, which can be calculated as ws=1σs2+ τ^2. Notably, σs2 and τ^2 represent sampling variance and heterogeneity, respectively.

2.2.1 Traditional RE

The traditional RE tests the null hypothesis that the average genetic effect, μ is zero that is, H0:μ=0 versus H1:μ0, and its summary association test statistic under the null is given by, ZRE2=(μ^SE(μ^))2  χ12 (asymptotically) (Neupane et al., 2012).

2.2.2 Contemporary random-effects model (RE2)

In contrast to the traditional RE which assumes the presence of heterogeneity under the null the RE2 model tests the null hypothesis that the average genetic effect is zero and there is no heterogeneity; that is, H0:μ=0 and τ2=0 versus H1:μ0 or τ2>0 (asymptotically) (Han and Eskin, 2011; Neupane et al., 2012). The summary association test statistic (or likelihood ratio test statistic) for the RE2 model under the null is denoted by:  
SRE2= ZRE (new)2=2log(λ)        =2log(L0(0,0)L1(µ^,^τ2)),
(2)
and approximates a 50:50 mixture of χ12 and χ22 asymptotically in meta-analyses with larger numbers of studies. For meta-analyses with fewer studies (2–50), Han and Eskin provide tabulated RE2 P-values corrected for small sample-size based on the assumption that the studies are equally weighted (i.e. same sample-size). The asymptotic RE2 summary association P-value is denoted by:  
P*RE2=0.5· Pχ12  SRE2+0.5 · Pχ22  SRE2,
after a correction for small samples, the RE2 summary association P-value is given by:  
PRE2= λN,SRE2· P*RE2,
where λN,SRE2 is the small-sample correction factor (Lee et al., 2017).

2.2.3 Updated RE2 model (RE2C)

The RE2C approach is an adaptation of the RE2 model designed to: (i) facilitate discovery of small-effect heterogeneous variants and (ii) minimize redundancy between genetic variants identified by the FE and RE2 models; as it is commonplace to perform an FE analysis prior to a random-effects analysis when conducting genetic association meta-analyses. To reduce redundancies between RE2 and FE analyses the RE2C approach partitions summary association P-values produced by the RE2 model into two groups assigning variants with RE2 P-value ≤ FE P-value the RE2 summary association statistic, SRE2 and zero otherwise (Lee et al., 2017):  
SRE2C=SRE2 if PRE2PFE0 if PRE2>PFE . 
(3)
In contrast to the RE2 summary association statistic the RE2C statistic, SRE2C does not approximate a ‘well-known’ asymptotic distribution; to calculate RE2C P-values the RE2 summary association statistic is decomposed into two component statistics, the first, SFE is equal to the square of the FE summary association statistic, ZFE2 and asymptotically approximates χ12 under the null. The second, SHet tests for the presence of heterogeneity akin to the Q-test of heterogeneity and asymptotically approximates a 50:50 mixture of 0 and χ12 when the number of studies in a meta-analysis is large, for smaller meta-analyses, Lee et al. (2017) provide tabulated empirical distributions of SHet. For each SFE, the RE2C approach searches for SHet such that PRE2  PFE and the resulting lower boundary of SHet is referred to as, SHet.low(SFE, N) where N is the number of studies. Then for an observed RE2C statistic, SRE2Ĉ the range of SFE is divided into K small bins (xi = 1, 2, 3,, K) (e.g. 1000 bins in [0, 50]) and the RE2C summary association P-value is approximated by:  
PRE2C  i=1kP SHet>maxSRE2Ĉ- xi, SHet.lowxi, N·χ12xi·Δx,
such that, PRE2C< PRE2 while PRE2  PFE and where Δx denotes the width of the bins (Lee et al., 2017).

2.3 Evaluation of heterogeneity for individual variants and M statistics

2.3.1 Calculation of SPRE statistics

Standardized predicted random-effect statistics are precision-weighted residuals that capture the direction and extent with which individual genetic effects of studies in a meta-analysis deviate from the average genetic effect at a variant of interest. Consider a genetic association meta-analysis (P), comprising S GWAS (s=1, 2, 3,, S) and V independently associated lead variants (v=1, 2, 3,, V). At each lead variant, study effect-size estimates (and the corresponding standard errors) are analysed with a RE to estimate the average genetic effect and separate the variability observed among study effects into random sampling variation and between-study heterogeneity. A SPRE is then computed for each lead variant such that the SPRE for the vth lead variant in the sth study is:  
SPREsv=ysv- θvσsv2+ τ^v2- spsv2 
(4)
This yields an array of SPREs,  
PS,V=  SPRE1,1 SPRE1,2  SPRE1,v SPRE2,1SPRE2,2  SPRE2,v        SPREs,1 SPREs,2  SPREs,v 
(5)
that can be exploited to reveal systematic genetic differences among studies in the meta-analysis. Specifically, SPREs can be aggregated by study to expose outlier studies showing either consistently stronger or weaker than average genetic effects.

2.3.2 Calculation of M statistics—aggregation of SPREs

SPRE statistics can be aggregated in a variety of ways, a simple approach that both identifies systematic outliers and reveals their direction of effect is to calculate the ‘mean’ aggregate heterogeneity statistic, M. M statistics are computed by calculating the arithmetic mean of SPREs within each study in a meta-analysis so that each study has a single M statistic value and the M statistic value for the sth study is represented by:  
Ms= 1Vv=1VSPREsv.
(6)
Assuming the SPREs of lead variants in each study are mutually independent standard normal random variables, that is  
SPRE  Φ0, 1,
with mean: ESPRE= 0 and variance: VarSPRE= 1 then M is normally distributed,  
Ms  Φ0, 1V,
with mean: EMs= V× 1V × μ =0 and variance: VarMs= V× (1V)2 × σ2 = 1V.

2.3.3 Q-statistic and heterogeneity index

Heterogeneity was also assessed using the Q-statistic (Cochran, 1954) and the heterogeneity index (I2) measure (Higgins and Thompson, 2002); I2 was further used to quantify heterogeneity in M statistics.

3 Results

3.1 RE2C association analysis

Of 9 455 778 variants in a RE2C meta-analysis of 48 CARDIoGRAMplusC4D studies, 4645 showed genome-wide significant associations with coronary disease (PRE2C<5×10−8), yielding 382 loci where lead variants were centered on a genetic distance window of ± 0.5 cM (Table 1).

Table 1.

A summary of RE2C association results from the CARDIoGRAMplusC4D meta-analysis of coronary disease

Description
Number of variants examined in the CARDIoGRAMplusC4D meta-analysis of coronary disease 9 455 778 
Number of variants significantly associated with coronary disease under the RE2C method (PRE2C<5×10−84645 
Number of loci obtained after grouping the 4645 significantly associated variants by a genetic distance window of ±0.5 cM around each lead variant 382 
Number of lead variants that replicated in the UK Biobank (UKBB) prospective study (PUKBB<5×10−524 
Description
Number of variants examined in the CARDIoGRAMplusC4D meta-analysis of coronary disease 9 455 778 
Number of variants significantly associated with coronary disease under the RE2C method (PRE2C<5×10−84645 
Number of loci obtained after grouping the 4645 significantly associated variants by a genetic distance window of ±0.5 cM around each lead variant 382 
Number of lead variants that replicated in the UK Biobank (UKBB) prospective study (PUKBB<5×10−524 

cM, centiMorgans.

Table 1.

A summary of RE2C association results from the CARDIoGRAMplusC4D meta-analysis of coronary disease

Description
Number of variants examined in the CARDIoGRAMplusC4D meta-analysis of coronary disease 9 455 778 
Number of variants significantly associated with coronary disease under the RE2C method (PRE2C<5×10−84645 
Number of loci obtained after grouping the 4645 significantly associated variants by a genetic distance window of ±0.5 cM around each lead variant 382 
Number of lead variants that replicated in the UK Biobank (UKBB) prospective study (PUKBB<5×10−524 
Description
Number of variants examined in the CARDIoGRAMplusC4D meta-analysis of coronary disease 9 455 778 
Number of variants significantly associated with coronary disease under the RE2C method (PRE2C<5×10−84645 
Number of loci obtained after grouping the 4645 significantly associated variants by a genetic distance window of ±0.5 cM around each lead variant 382 
Number of lead variants that replicated in the UK Biobank (UKBB) prospective study (PUKBB<5×10−524 

cM, centiMorgans.

This compares with the conventional FE meta-analysis that revealed 2213 GWAS (PFE<5×10−8) variants in 46 loci, and an RE2 analysis that afforded 5942 GWAS (PRE2<5×10−8) variants in 406 loci (Fig. 1).

Fig. 1.

A flowchart summarizing meta-analysis genetic association results under the RE2 and RE2C random-effects models and the traditional fixed-effect (FE) method

Fig. 1.

A flowchart summarizing meta-analysis genetic association results under the RE2 and RE2C random-effects models and the traditional fixed-effect (FE) method

3.2 Single-variant heterogeneity analysis of 382 novel RE2C loci

Most (85.6%) of the lead variants showed marked heterogeneity (Q-statistic P <1×10−7), with at least half of the lead variants showing relatively high levels of heterogeneity (I2>72.1%) (Supplementary Table S2). Next, we calculated SPREs and generated forest plots to inspect heterogeneity patterns at lead variants of the 382 RE2C loci. Most (90%) of the RE2C lead variants had one or more outlier studies where genetic effect-size estimates deviated substantially (SPRE > 3σ) from the average genetic effect. This empirical threshold to flag overly influential outliers (SPRE > 3σ) was informed by rs2891168 (chromosome 9p21) in the primary CARDIoGRAMplusC4D meta-analysis, where this well-established locus had max SPRE=2.87 (CARDIoGRAMplusC4D Consortium, 2015). An inspection of forest plots for the 382 RE2C lead variants revealed heterogeneity patterns that were grouped into three categories (Supplementary Fig. S1). Most (n = 323) of the lead variants fell in the first category where at least one study showed outsized effects while the majority of the studies showed minimal effects (Supplementary Fig. S2 and Table S3). Lead variants (n = 28) in the second category generally showed heterogeneity patterns with outlier studies showing contrasting effects, in particular the forest plots showed both positive outlier studies (SPRE > +3σ) with the potential to inflate the average genetic effect as well as negative outliers (SPRE< -3σ) that might lower or change the direction of the mean genetic effect, a scenario where dropping either type of outlier would likely induce a false positive or negative signal (Supplementary Fig. S3 and Table S3). The final category comprised 31 lead variants where there was little evidence of overly influential outlier studies consistent with heterogeneity patterns plausibly induced by biologically relevant differences (Supplementary Fig. S4 and Table S3). A general trend that emerged from inspecting heterogeneity patterns at the individual RE2C lead variants was that RE2C P-values became more extreme (i.e. smaller) with increasing levels of heterogeneity (Supplementary Table S2).

3.3 M Statistic, multi-variant heterogeneity analysis

A multi-variant heterogeneity analysis across the 382 RE2C lead variants revealed five significant outlier studies (14, 15, 16, 17 and 18) that systematically showed stronger than average effects (Bonferroni-corrected M statistic P-values <0.05) (Supplementary Fig. S5). A meta-regression of the M statistics found no evidence of systematic heterogeneity patterns due to differences in ancestry, age-of CAD onset and CAD family-history (Supplementary Table S4), design factors that were prominent in our previous analysis of the CARDIoGRAMplusC4D data (Magosi et al., 2017) using lead variants for 46 published loci (CARDIoGRAMplusC4D Consortium, 2015). We note that studies 15, 16, 17 and 18 showed relatively high genomic inflation (1.08<λ<1.38) prior to study-level genomic correction and a meta-regression of the M statistics confirmed varying levels of genomic inflation among contributing studies in the CARDIoGRAMplusC4D meta-analysis as a significant explanatory factor (F4,43=16.68, P=2.52 × 10-8, adjusted-R2=70.55%, I2=88.28%) (Supplementary Table S4).

3.4 Replication in the UK Biobank study

We next explored whether genetic associations between lead variants at the novel RE2C loci and CAD risk could be replicated in a large-scale prospective study based on 296 525 participants (including 34 541 cases of coronary heart disease) from England, Scotland and Wales aged 45–69 years (van der Harst and Verweij, 2018). Only 24 of the 323 RE2C lead variants available in the UK Biobank GWAS were replicated (PUKBB < 5 × 10−5, Supplementary Table S5). All but 3 of the replicated genetic signals had traditional FE meta-analysis P-values that were significant at genome-wide levels (PFE < 5 × 10−8) and just 2 of the 24 showed marked heterogeneity (I2>0.5) (Supplementary Table S5). Furthermore, 3 replicated variants included an influential outlier study in the CARDIoGRAMplusC4D meta-analysis, these 3 variants were also GWAS-significant in the FE meta-analysis (Supplementary Tables S2 and S5). These findings are consistent with Han and Eskin’s (2011) observation that the power of RE2 only exceeded FE meta-analysis for markedly heterogeneous variants.

Finally, a meta-regression model of M statistics for 323 RE2C lead variants in a combined CARDIoGRAMplusC4D and UK Biobank meta-analysis confirmed genomic control inflation as a potential source of systematic heterogeneity in genetic meta-analyses (Supplementary Table S6 and Fig. S6).

4 Discussion

Our application of the RE2C method to the CARDIoGRAMplusC4D meta-analysis dataset highlights the high sensitivity but low specificity of the method as a discovery tool for small-effect heterogeneous genetic associations. Consequently the practical advantage afforded by the improved power of the RE2C method will likely be in augmenting P-values for putative loci highlighted by traditional fixed and random-effects meta-analyses.

Beyond variants that would have otherwise been detected through a traditional FE meta-analysis approach, 21 lead variants that were associated with CAD under the RE2C method (PRE2C < 5 × 10−8) were suggestively associated under the traditional FE method (5×10−8 < PFE  5 × 10−5); and 2 (rs12509595, rs62181365) of these were part of the group of RE2C lead variants that replicated in the UKBB analysis while the remaining 19 fell below the replication threshold (PUKBB<5×10−5) (Supplementary Tables S2 and S5). Of the list of 24 significant RE2C replicated variants in the UKBB analysis, a single lead variant (rs662799) on chromosome 11 showed neither significant nor suggestive association with CAD under the traditional FE method (Q-statistic P =2.4×10−4, I2 = 47%, PFE = 1.28×10−4) (Supplementary Table S4 and Fig. S7). Notably, rs662799 maps to the APOA1-C3-A4-A5 locus, immediately upstream of APOA5, a locus that is strongly associated with higher triglyceride levels (TG) and lower HDL cholesterol (HDL-C) in individuals of East Asian and European ancestry (βTG=0.081, SE(βTG)=0.003, P=4.18 × 10-213; βHDL-C=-2.516, SE(βHDL-C)=0.126, P=1.84 × 10-85) (Lu et al., 2016; Spracklen et al., 2017). APOA5 is a ‘well-known’ CAD-associated locus (e.g. rs964184; CARDIoGRAM Consortium et al., 2011), thus the rs662799 CAD association detected in this RE2C analysis represents a confident positive assignment that can guide future functional genomic experiments to identify the underlying causal variants(s).

Altogether, the majority (n = 331) of lead variants discovered in the CARDIoGRAMplusC4D meta-analysis by the RE2C random-effects method fell outside the scope of tentatively associated CAD risk variants (PFE>5×10−5) (Supplementary Table S2). Significant P-values under the RE2 and RE2C models can represent a non-null average genetic effect and/or considerable heterogeneity (H0: μ=0 and τ2=0 versus H1: μ0 or τ2>0; asymptotically) (Neupane et al., 2012). Therefore, the genome-wide significant RE2C P-values at the 277 lead variants where genetic associations with CAD were irreproducible in the UKBB dataset (PUKBB>5×10−5) and where PFE>5×10−5, likely signify substantial heterogeneity of genetic effects at the individual variants rather than novel CAD signals.

Small-effect genetic associations at variants with relatively high heterogeneity might elicit skepticism regarding the potential reproducibility of such associations. However, there are notable exceptions within the coronary disease landscape, such as rs2891168, the lead variant for the chromosome 9p21 CAD risk locus in the CARDIoGRAMplusC4D data (2015) that shows substantial heterogeneity (Q-statistic P <4.2×10−7; I2= 58%) but with no exceptional outlier studies (i.e. SPRE< 2.6σ), a heterogeneity pattern typified in Supplementary Figure S4. rs2891168 tags one of the strongest associated loci in CARDIoGRAMplusC4D (odds ratio = 1.2, P <2×10−98), a meta-analysis dataset heavily weighted by European (69%), South Asian (20%) and East Asian (7%) data (Supplementary Table S1). Other tagging variants for this locus in strong linkage disequilibrium have been convincingly validated to show comparable strength associations with CAD risk in some non-European populations (e.g. India and Pakistan, Coronary Artery Disease (C4D) Genetics Consortium, 2011; Han Chinese, Lu et al., 2012; multi-ethnic cohorts from East Asia, Han et al., 2017) but not for instance, and to our knowledge in populations of African ancestry. The latter are poorly represented in CARDIoGRAMplusC4D (African Americans form ∼1% of the total data), limiting opportunities to judge the informativity or otherwise of individual loci in this meta-analysis dataset.

Based on our experience of applying RE2C to the CARDIoGRAMplusC4D dataset, we recommend as best practice that reports of small-effect heterogeneous loci discovered with this method be accompanied by forest plots and SPRE statistics to explore the distribution of genetic effect estimates across participating studies. This can highlight overly influential outlier studies with the potential to inflate genetic signals prompting researchers to reflect upon the underlying data that gave rise to novel heterogeneous associations.

Acknowledgements

We are grateful to the CARDIoGRAMplusC4D collaborators (http://www.cardiogramplusc4d.org) for their support during this work.

Funding

This research was supported by a Wellcome Trust core award (090532/Z/09/Z and 203141/Z/16/Z, M.F.), The British Heart Foundation (FS/14/55/30806, J.C.H.), the BHF Centre of Research Excellence, Oxford (RE/13/1/30181, M.F. and J.C.H.), the Government of Botswana (L.E.M.), the European Union Seventh Framework programme (HEALTH-F2-2013-60145, A.G.) and the Wellcome Trust Institutional strategic support fund (M.F.). A.G. participates in the TriPartite Immunometabolism Consortium (TrIC) supported by the Novo Nordisk Foundation (NNF15CC0018486). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the article.

Conflict of Interest: none declared.

References

Bush
W.S.
,
Moore
J.H.
(
2012
)
Chapter 11: genome-wide association studies
.
PLoS Comput. Biol
.,
8
,
e1002822.

CARDIoGRAM Consortium et al.  (

2011
)
Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease
.
Nat. Genet
.,
43
,
333
338
.

CARDIoGRAMplusC4D Consortium. (

2015
)
A comprehensive 1000 genomes-based genome-wide association meta-analysis of coronary artery disease
.
Nat. Genet
.,
47
,
1121
1130
.

Cochran
W.G.
(
1954
)
The combination of estimates from different experiments
.
Biometrics
,
10
,
101
129
.

Coronary Artery Disease (C4D) Genetics Consortium. (

2011
)
A genome-wide association study in Europeans and South Asians identifies five new loci for coronary artery disease
.
Nat. Genet
.,
43
,
339
344
.

Devlin
B.
,
Roeder
K.
(
1999
)
Genomic control for association studies
.
Biometrics
,
55
,
997
1004
.

Han
B.
,
Eskin
E.
(
2011
)
Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies
.
Am. J. Hum. Genet
.,
88
,
586
598
.

Han
Y.
et al.  (
2017
)
Genome-wide association study identifies a missense variant at APOA5 for coronary artery disease in Multi-Ethnic Cohorts from Southeast Asia
.
Sci. Rep
.,
7
,
17921.

Harbord
R.M.
,
Higgins
J.
(
2008
)
Meta-regression in Stata
.
Stata J
.,
8
,
493
519
.

Higgins
J.P.T.
,
Thompson
S.G.
(
2002
)
Quantifying heterogeneity in a meta-analysis
.
Stat. Med
.,
21
,
1539
1558
.

Lee
C.
et al.  (
2017
)
Increasing the power of meta-analysis of genome-wide association studies to detect heterogeneous effects
.
Bioinformatics
,
33
,
i379
i388
.

Littlejohns
T.J.
et al.  (
2019
)
UK Biobank: opportunities for cardiovascular research
.
Eur. Heart J.
,
40
,
1158
1166
.

Lu
X.
et al.  (
2012
)
Genome-wide association study in Han Chinese identifies four new susceptibility loci for coronary artery disease
.
Nat. Genet
.,
44
,
890
894
.

Lu
X.
et al.  (
2016
)
Genetic susceptibility to lipid levels and lipid change over time and risk of incident hyperlipidemia in Chinese populations
.
Circ. Cardiovasc. Genet
.,
9
,
37
44
.

Magosi
L.E.
et al.  (
2017
)
Identifying systematic heterogeneity patterns in genetic association meta-analysis studies
.
PLoS Genet
.,
13
,
e1006755
.

Neupane
B.
et al.  (
2012
)
Meta-analysis of genetic association studies under heterogeneity
.
Eur. J. Hum. Genet
.,
20
,
1174
1181
.

Sapkota
Y.
et al.  (
2015
)
Association between endometriosis and the interleukin 1A (IL1A) locus
.
Hum. Reprod
.,
30
,
239
248
.

Sapkota
Y.
et al.  (
2017
)
Meta-analysis identifies five novel loci associated with endometriosis highlighting key genes involved in hormone metabolism
.
Nat. Commun
.,
8
,
15539.

Spracklen
C.N.
et al.  (
2017
)
Association analyses of East Asian individuals and trans-ancestry analyses with European individuals reveal new loci associated with cholesterol and triglyceride levels
.
Hum. Mol. Genet
.,
26
,
1770
1784
.

Sudlow
C.
et al.  (
2015
)
UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age
.
PLoS Med
.,
12
,
e1001779
.

van der Harst
P.
,
Verweij
N.
(
2018
)
Identification of 64 novel genetic loci provides an expanded view on the genetic architecture of coronary artery disease
.
Circ. Res
.,
122
,
433
443
.

Wyss
A.B.
et al.  (
2018
)
Multiethnic meta-analysis identifies ancestry-specific and cross-ancestry loci for pulmonary function
.
Nat. Commun
.,
9
,
2976.

Yang
J.
et al.  (
2011
)
Genome partitioning of genetic variation for complex traits using common SNPs
.
Nat. Genet
.,
43
,
519
525
.

Author notes

The authors wish it to be known that, in their opinion, Jemma C. Hopewell and Martin Farrall should be regarded as Joint Last Authors.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Associate Editor: Alfonso Valencia
Alfonso Valencia
Associate Editor
Search for other works by this author on:

Supplementary data