-
PDF
- Split View
-
Views
-
Cite
Cite
Claudia P Cabrera, Fu Liang Ng, Hannah L Nicholls, Ajay Gupta, Michael R Barnes, Patricia B Munroe, Mark J Caulfield, Over 1000 genetic loci influencing blood pressure with multiple systems and tissues implicated, Human Molecular Genetics, Volume 28, Issue R2, 15 October 2019, Pages R151–R161, https://doi.org/10.1093/hmg/ddz197
- Share Icon Share
Abstract
High blood pressure (BP) remains the major heritable and modifiable risk factor for cardiovascular disease. Persistent high BP, or hypertension, is a complex trait with both genetic and environmental interactions. Despite swift advances in genomics, translating new discoveries to further our understanding of the underlying molecular mechanisms remains a challenge. More than 500 loci implicated in the regulation of BP have been revealed by genome-wide association studies (GWAS) in 2018 alone, taking the total number of BP genetic loci to over 1000. Even with the large number of loci now associated to BP, the genetic variance explained by all loci together remains low (~5.7%). These genetic associations have elucidated mechanisms and pathways regulating BP, highlighting potential new therapeutic and drug repurposing targets. A large proportion of the BP loci were discovered and reported simultaneously by multiple research groups, creating a knowledge gap, where the reported loci to date have not been investigated in a harmonious way. Here, we review the BP-associated genetic variants reported across GWAS studies and investigate their potential impact on the biological systems using in silico enrichment analyses for pathways, tissues, gene ontology and genetic pleiotropy.
Introduction
Cardiovascular diseases (CVDs), including stroke, renal and heart disease, represent the largest cause of global mortality. The Global Burden of Diseases, Injuries and Risk Factor study calculated ~10.4 million deaths that can be attributed to high systolic BP (SBP) (1) and it is estimated that 25% of the adults have elevated BP (2). To compound this, hypertension is mostly asymptomatic, resulting in the disease being potentially unnoticed until a life-threatening event such as a heart attack or stroke affects the individual.
Individuals with family history of hypertension diagnosed before 55 years of age have been found to have an associated risk to hypertension (odds ratio 2.10 and 1.33 for affected parents and grandparents, respectively). These findings reinforce the hypothesis of genetic predisposition to high BP, independent from the environmental factors (3). BP is a complex polygenic trait, even discussed as a probable omnigenic trait in previous studies (4). The heritability of the common genetic variation in BP has been observed to be ethnicity-dependent. A study in ~8900 European ancestry individuals estimated the heritability of SBP to be ~20% and ~50% for diastolic BP (DBP), while heritability in African ancestry individuals (n = 2860) was estimated at ~27% and ~39%, respectively (5). While the expected heritability is between 20–50%, the recent analyses indicated that the BP variants identified thus far explains only 27% of the genetic contribution (4), suggesting many loci are yet to be identified.
Here, we review the BP genetic associations unravelled by genome-wide association studies (GWAS), their implications at a pathway and systems level, and investigate the pleiotropy effects of reported BP loci. Collectively, these discoveries may bring new mechanistic insights to the treatment of hypertension.
BP and GWAS
Our understanding of genomic regions linked with BP was rapidly expanded with increasingly large and sophisticated GWAS. Up to 2015, there were 64 validated BP loci reported (6–22). Four years later, the number of known BP loci has increased to 1477 (4,23–30), including 1214 lead signals and 263 secondary signals [Supplementary Material, Table S1; (a) lead BP loci; (b) secondary BP loci]. The substantial rise in the reported loci is driven by the collaborative work of international consortia, combined with the availability of public datasets such as UK Biobank (UKB) (31). The UKB data has provided a resource of standardised genotypic and phenotypic data from an unprecedented population size (n = 502 620). Other large key resources include the US Million Veteran Program (n ~ 318 891) (32), the Genetic Epidemiology Research on Adult Health and Aging cohort (n ~ 100 000) (33) and the International Consortium of Blood Pressure (ICBP) (29). To date, the ICBP consortium has accumulated 77 cohorts which include ~299 024 individuals. These studies have paved the way to the biggest analysis reported to date including 1 million people and unravelling 535 novel loci in a single study (4). Figure 1 shows the timeline of some of the key BP GWAS in the last decade and their validated loci.

Timeline of validated BP GWAS associations of the last 10 years. The total sample size (left vertical axis) used for the discovery analysis is represented by the blue long dotted line, while the replication sample size represented by the green short dotted line. The total number of loci is represented by the bars (right vertical axis). The yellow bars represent the number of novel loci reported without independent replication and in pink the total number of novel loci with replication. The numbers in parenthesis after the study (horizontal axis) show the numbers for the novel without replication and the novel loci with replication, respectively (yellow/pink).
BP candidate genes
The common genetic variants identified as associated with BP continue to improve our understanding of the polygenic nature of BP regulation. Nevertheless, the increased number of loci—and therefore genes—makes the functional investigation of each locus impossible. At the present, there are but a handful of genes followed up by experimental studies (34). While a proportion of the candidate genes at BP loci have been fairly well characterized, many others do not have known links to BP regulation. It is the successes relating to the latter group that highlights the potential to expand our knowledge of the pathophysiology of hypertension. One such example is the transcription factor, Nuclear Receptor 2 Family 2 (NR2F2). A murine model with a mutated Nr2f2 protein resulted in lower SBP and DBP, and was more resistant to the hypertensive effects of a high-salt diet. The deletion of a five amino acid hinge region required for the protein–protein interaction with another transcription factor, Fog2 appears to reveal the potential underlying molecular mechanism (35), in particular as the Nr2f2-Fog2 interaction potentially influences expression of the vasorelaxant atrial natriuretic factor (Anf) (36).
Another candidate gene of interest is SH2B3 encoding the lymphocyte adaptor protein LNK. This regulator of cytokine signalling and cell proliferation is predominantly expressed in haematopoietic and endothelial cells. SH2B3-/- mice had exaggerated hypertensive responses to angiotensin II, and it is noteworthy that haematopoietic cells were the apparent primary driver of the observed vascular inflammation and predisposition to hypertension (37). However, extensive pleiotropy is observed in the SH2B3-ATXN2 locus, associations with myocardial infraction, multiple sclerosis, juvenile idiopathic arthritis have been reported (38–40). It should also be recognized that while most of the attention is paid to SH2B3 as the GWAS signal at rs3184504 results in a coding change within SH2B3, this signal falls within a 1-megabase block of linkage disequilibrium (LD) on chromosome 12q24 that encompases at least 15 annotated genes (8).
Other examples include genes that are within pathways with known links to BP regulation, such as ARHGAP42 (Rho GTPase-activating protein 42 gene) in the RhoA pathway. The deletion of ARHGAP42 in a murine model enhanced the hypertensive effects of both L-NAME and deoxycorticosterone acetate–salt treatments (41). This was later supported by human studies of balanced chromosomal rearrangement carriers resulting in ARHGAP42 truncation, leading to age-dependent hypertension (42). Finally, UMOD (uromodulin), recently reviewed in more detail (34), is another notable candidate gene identified by GWAS. UMOD deficient mice have shown to increase sequestration of the target of loop diuretics, the sodium-potassium-chloride co-transporter 2 (KKCC2), resulting in reduced co-transporter activity. This mimics the effect of diuretic drugs, and with the subsequent lower BP and reduced hypertensive response to increased salt intake. The pharmacological inhibition of uromodulin may have a diuretic effect that may perhaps be synergistic with existing pharmacological options (13, 43–45). This has led to a clinical trial on CVD (www.clinicaltrials.gov: NCT03354897) (46), making significant progress towards a novel therapeutic target in hypertension.
Altogether, these are examples of translating GWAS discovery to an improved understanding of biological impact, and providing promise of new therapeutic pathways. However, the identification of a true causal variant and the relevant gene product impacted is rarely straightforward. The lead single nucleotide polymorphism (SNP) typically indicates a chromosomal region usually with tens and sometimes thousands of SNPs in LD (47), but it may also mark further-away regions with long-range chromatin interactions (48). For example, following the identification of a BP-associated SNP near the ANTXR2 (anthrax toxin receptor 2) gene, ANTXR2–/– knockout rats were generated. The knockout rats exhibited similar BP to wild-type rats, at both basal and stimulated states with either angiotensin II infusion of high-salt diet (49). This can highlight the difficulty in identifying the causal variant or gene within a locus, particularly since this region also encompasses other genes of interest such as FGF5 (fibroblast growth factor 5). It is often noted that follow-up mechanistic studies are time- and resource-heavy endeavours. With this consideration, we expand on our previous in silico analyses, aiming to provide candidate genes for prioritization in future mechanistic studies.
In silico analyses: BP loci pathway and tissue enrichment
Pathway and tissue enrichment analyses can give us a snapshot on the interactions and downstream consequences of BP loci at a systems level. For the purposes of this review, we investigated all published validated BP signals (lead + secondary) and their SNPs in high LD (r2 ≥ 0.8). SNPs were annotated to the nearest gene (within 5 kb distance) using bedtools (v2.17) (50), and further characterized using ANNOVAR (51). We performed the following analyses on all the genes annotated to the BP-loci: (a) tissue enrichment using DEPICT (52); (b) gene-set enrichment on pathways using GSEA is the software name GSEA (53); and (c) a permutation based gene-set enrichment analysis on gene ontology (GO) terms using GOfuncR (54,55).
Tissue enrichment analyses—DEPICT
In this tissue enrichment analysis, blood vessels, cardiac and adipose tissues remain in the top most enriched tissues (Table 1), being similar to those presented in Evangelou et al. (4). The addition of more recently published loci (25) to the analysis, led to an increase of enrichment in mostly all tissues described in previous studies, with the endocrine glands (adrenal cortex, adrenal glands and gonads) and the urogenital system presenting the largest differences between previous studies and the current analysis. The analysis highlighting adrenal tissue is unsurprising with hyperaldosteronism being a well-established secondary cause of hypertension (56), and primary hyperaldosteronism estimated to be responsible for up to 10% of hypertension cases (57,58). The enrichment in the urogenital system appears driven by genes relating to the myometrium that predominantly consists of uterine smooth muscle cells, which share many similarities with vascular smooth muscle cells. Tissue enrichment analysis also showed for first time enrichment for the exocrine glands and tissues of the digestive system (Table 1). The underlying mechanism of the enrichment observed in the digestive system may reflect the gastro-endocrine pathways which are discussed later on.
DEPICT tissue enrichment results across all validated BP loci to date and compared with the reported enrichment in Evangelou et al. (4). Results are shown for tissue enrichments with an FDR < 0.05 ordered by nominal –log10 P-value
Name . | Mesh second level term . | Nominal -log10(P-value) . | Evangelou et al. (4)-log10(P-value) . |
---|---|---|---|
Myometrium | Genitalia | 14.66 | 9.50 |
Arteries | Blood vessels | 13.72 | 10.84 |
Cartilage | Cartilage | 10.53 | 9.03 |
Adipose tissue | Connective tissue | 10.36 | 8.01 |
Genitalia female | Genitalia | 10.04 | 4.89 |
Subcutaneous fat | Connective tissue | 9.62 | 7.56 |
Adipose tissue white | Connective tissue | 9.62 | 7.56 |
Genitalia | Genitalia | 9.59 | 4.48 |
Uterus | Genitalia | 9.48 | 4.88 |
Joint capsule | Skeleton | 9.19 | 7.12 |
Joints | Skeleton | 9.19 | 7.12 |
Synovial membrane | Skeleton | 9.19 | 7.12 |
Subcutaneous fat abdominal | Connective tissue | 8.60 | 6.09 |
Abdominal fat | Connective tissue | 8.60 | 6.09 |
Endocrine glands | Endocrine glands | 8.38 | 3.67 |
Fallopian tubes | Genitalia | 7.86 | 3.83 |
Ovary | Genitalia | 7.76 | 3.30 |
Adnexa uteri | Genitalia | 7.75 | 3.31 |
Serous membrane | Membranes | 7.26 | 4.76 |
Gonads | Endocrine glands | 7.07 | 2.92 |
Aortic valve | Heart | 6.97 | 4.91 |
Heart valves | Heart | 6.97 | 4.91 |
Endometrium | Genitalia | 6.97 | 3.44 |
Adipocytes | Connective tissue cells | 6.03 | 4.24 |
Adrenal glands | Endocrine glands | 5.76 | 2.99 |
Adrenal cortex | Endocrine glands | 5.70 | 3.44 |
Blood vessels | Blood vessels | 5.19 | 4.85 |
Stomach | Gastrointestinal tract | 5.12 | 3.37 |
Upper gastrointestinal tract | Gastrointestinal tract | 5.09 | 3.23 |
Heart | Heart | 4.89 | 4.57 |
Stromal cells | Connective tissue cells | 4.45 | 3.56 |
Serum | Blood | 4.40 | 2.92 |
Muscle smooth | Muscles | 4.38 | 3.45 |
Heart atria | Heart | 4.17 | 3.82 |
Heart ventricles | Heart | 4.10 | 4.03 |
Veins | Blood vessels | 4.09 | 3.67 |
Lung | Lung | 3.89 | 2.40 |
Atrial appendage | Heart | 3.87 | 3.43 |
Pancreas | Pancreas | 3.70 | 3.81 |
Mesenchymal stem cells | Stem cells | 3.59 | 2.97 |
Chondrocytes | Connective tissue cells | 3.54 | 2.00 |
Fibroblasts | Connective tissue cells | 3.52 | 2.45 |
Umbilical veins | Blood vessels | 3.50 | 3.13 |
Portal system | Blood vessels | 3.50 | 3.13 |
Esophagus | Gastrointestinal tract | 3.39 | 2.00 |
Urinary bladder | Urinary tract | 3.26 | 2.84 |
Chorion | Membranes | 3.16 | 2.68 |
Extraembryonic membranes | Membranes | 3.16 | 2.68 |
Ileum | Gastrointestinal tract | 3.09 | 3.15 |
Endothelial cells | Epithelial cells | 3.09 | 2.80 |
Urinary tract | Urinary tract | 3.02 | 3.21 |
Osteoblasts | Connective tissue cells | 3.01 | 3.46 |
Exocrine glands | Exocrine glands | 2.92 | |
Kidney cortex | Urinary tract | 2.89 | 2.07 |
Prostate | Exocrine glands | 2.80 | |
Kidney | Urinary tract | 2.76 | 2.97 |
Intestine small | Gastrointestinal tract | 2.75 | 2.81 |
Genitalia male | Genitalia | 2.61 | |
Cecum | Gastrointestinal tract | 2.46 | |
Islets of langerhans | Pancreas | 2.13† | |
Cervical vertebrae | Skeleton | 2.06† | |
Spine | Skeleton | 2.01† | |
Cervix uteri | Genitalia | 1.70† | |
Skin | Skin | 1.70† | |
Lower gastrointestinal tract | Gastrointestinal tract | 1.70† |
Name . | Mesh second level term . | Nominal -log10(P-value) . | Evangelou et al. (4)-log10(P-value) . |
---|---|---|---|
Myometrium | Genitalia | 14.66 | 9.50 |
Arteries | Blood vessels | 13.72 | 10.84 |
Cartilage | Cartilage | 10.53 | 9.03 |
Adipose tissue | Connective tissue | 10.36 | 8.01 |
Genitalia female | Genitalia | 10.04 | 4.89 |
Subcutaneous fat | Connective tissue | 9.62 | 7.56 |
Adipose tissue white | Connective tissue | 9.62 | 7.56 |
Genitalia | Genitalia | 9.59 | 4.48 |
Uterus | Genitalia | 9.48 | 4.88 |
Joint capsule | Skeleton | 9.19 | 7.12 |
Joints | Skeleton | 9.19 | 7.12 |
Synovial membrane | Skeleton | 9.19 | 7.12 |
Subcutaneous fat abdominal | Connective tissue | 8.60 | 6.09 |
Abdominal fat | Connective tissue | 8.60 | 6.09 |
Endocrine glands | Endocrine glands | 8.38 | 3.67 |
Fallopian tubes | Genitalia | 7.86 | 3.83 |
Ovary | Genitalia | 7.76 | 3.30 |
Adnexa uteri | Genitalia | 7.75 | 3.31 |
Serous membrane | Membranes | 7.26 | 4.76 |
Gonads | Endocrine glands | 7.07 | 2.92 |
Aortic valve | Heart | 6.97 | 4.91 |
Heart valves | Heart | 6.97 | 4.91 |
Endometrium | Genitalia | 6.97 | 3.44 |
Adipocytes | Connective tissue cells | 6.03 | 4.24 |
Adrenal glands | Endocrine glands | 5.76 | 2.99 |
Adrenal cortex | Endocrine glands | 5.70 | 3.44 |
Blood vessels | Blood vessels | 5.19 | 4.85 |
Stomach | Gastrointestinal tract | 5.12 | 3.37 |
Upper gastrointestinal tract | Gastrointestinal tract | 5.09 | 3.23 |
Heart | Heart | 4.89 | 4.57 |
Stromal cells | Connective tissue cells | 4.45 | 3.56 |
Serum | Blood | 4.40 | 2.92 |
Muscle smooth | Muscles | 4.38 | 3.45 |
Heart atria | Heart | 4.17 | 3.82 |
Heart ventricles | Heart | 4.10 | 4.03 |
Veins | Blood vessels | 4.09 | 3.67 |
Lung | Lung | 3.89 | 2.40 |
Atrial appendage | Heart | 3.87 | 3.43 |
Pancreas | Pancreas | 3.70 | 3.81 |
Mesenchymal stem cells | Stem cells | 3.59 | 2.97 |
Chondrocytes | Connective tissue cells | 3.54 | 2.00 |
Fibroblasts | Connective tissue cells | 3.52 | 2.45 |
Umbilical veins | Blood vessels | 3.50 | 3.13 |
Portal system | Blood vessels | 3.50 | 3.13 |
Esophagus | Gastrointestinal tract | 3.39 | 2.00 |
Urinary bladder | Urinary tract | 3.26 | 2.84 |
Chorion | Membranes | 3.16 | 2.68 |
Extraembryonic membranes | Membranes | 3.16 | 2.68 |
Ileum | Gastrointestinal tract | 3.09 | 3.15 |
Endothelial cells | Epithelial cells | 3.09 | 2.80 |
Urinary tract | Urinary tract | 3.02 | 3.21 |
Osteoblasts | Connective tissue cells | 3.01 | 3.46 |
Exocrine glands | Exocrine glands | 2.92 | |
Kidney cortex | Urinary tract | 2.89 | 2.07 |
Prostate | Exocrine glands | 2.80 | |
Kidney | Urinary tract | 2.76 | 2.97 |
Intestine small | Gastrointestinal tract | 2.75 | 2.81 |
Genitalia male | Genitalia | 2.61 | |
Cecum | Gastrointestinal tract | 2.46 | |
Islets of langerhans | Pancreas | 2.13† | |
Cervical vertebrae | Skeleton | 2.06† | |
Spine | Skeleton | 2.01† | |
Cervix uteri | Genitalia | 1.70† | |
Skin | Skin | 1.70† | |
Lower gastrointestinal tract | Gastrointestinal tract | 1.70† |
FDR: False discovery rate; significant tissue enrichment threshold set at an FDR < 5%.
†MeSH terms with an FDR ≥ 1% and below 5%, all others represent an FDR < 1% MeSH term: medical subject heading term.
DEPICT tissue enrichment results across all validated BP loci to date and compared with the reported enrichment in Evangelou et al. (4). Results are shown for tissue enrichments with an FDR < 0.05 ordered by nominal –log10 P-value
Name . | Mesh second level term . | Nominal -log10(P-value) . | Evangelou et al. (4)-log10(P-value) . |
---|---|---|---|
Myometrium | Genitalia | 14.66 | 9.50 |
Arteries | Blood vessels | 13.72 | 10.84 |
Cartilage | Cartilage | 10.53 | 9.03 |
Adipose tissue | Connective tissue | 10.36 | 8.01 |
Genitalia female | Genitalia | 10.04 | 4.89 |
Subcutaneous fat | Connective tissue | 9.62 | 7.56 |
Adipose tissue white | Connective tissue | 9.62 | 7.56 |
Genitalia | Genitalia | 9.59 | 4.48 |
Uterus | Genitalia | 9.48 | 4.88 |
Joint capsule | Skeleton | 9.19 | 7.12 |
Joints | Skeleton | 9.19 | 7.12 |
Synovial membrane | Skeleton | 9.19 | 7.12 |
Subcutaneous fat abdominal | Connective tissue | 8.60 | 6.09 |
Abdominal fat | Connective tissue | 8.60 | 6.09 |
Endocrine glands | Endocrine glands | 8.38 | 3.67 |
Fallopian tubes | Genitalia | 7.86 | 3.83 |
Ovary | Genitalia | 7.76 | 3.30 |
Adnexa uteri | Genitalia | 7.75 | 3.31 |
Serous membrane | Membranes | 7.26 | 4.76 |
Gonads | Endocrine glands | 7.07 | 2.92 |
Aortic valve | Heart | 6.97 | 4.91 |
Heart valves | Heart | 6.97 | 4.91 |
Endometrium | Genitalia | 6.97 | 3.44 |
Adipocytes | Connective tissue cells | 6.03 | 4.24 |
Adrenal glands | Endocrine glands | 5.76 | 2.99 |
Adrenal cortex | Endocrine glands | 5.70 | 3.44 |
Blood vessels | Blood vessels | 5.19 | 4.85 |
Stomach | Gastrointestinal tract | 5.12 | 3.37 |
Upper gastrointestinal tract | Gastrointestinal tract | 5.09 | 3.23 |
Heart | Heart | 4.89 | 4.57 |
Stromal cells | Connective tissue cells | 4.45 | 3.56 |
Serum | Blood | 4.40 | 2.92 |
Muscle smooth | Muscles | 4.38 | 3.45 |
Heart atria | Heart | 4.17 | 3.82 |
Heart ventricles | Heart | 4.10 | 4.03 |
Veins | Blood vessels | 4.09 | 3.67 |
Lung | Lung | 3.89 | 2.40 |
Atrial appendage | Heart | 3.87 | 3.43 |
Pancreas | Pancreas | 3.70 | 3.81 |
Mesenchymal stem cells | Stem cells | 3.59 | 2.97 |
Chondrocytes | Connective tissue cells | 3.54 | 2.00 |
Fibroblasts | Connective tissue cells | 3.52 | 2.45 |
Umbilical veins | Blood vessels | 3.50 | 3.13 |
Portal system | Blood vessels | 3.50 | 3.13 |
Esophagus | Gastrointestinal tract | 3.39 | 2.00 |
Urinary bladder | Urinary tract | 3.26 | 2.84 |
Chorion | Membranes | 3.16 | 2.68 |
Extraembryonic membranes | Membranes | 3.16 | 2.68 |
Ileum | Gastrointestinal tract | 3.09 | 3.15 |
Endothelial cells | Epithelial cells | 3.09 | 2.80 |
Urinary tract | Urinary tract | 3.02 | 3.21 |
Osteoblasts | Connective tissue cells | 3.01 | 3.46 |
Exocrine glands | Exocrine glands | 2.92 | |
Kidney cortex | Urinary tract | 2.89 | 2.07 |
Prostate | Exocrine glands | 2.80 | |
Kidney | Urinary tract | 2.76 | 2.97 |
Intestine small | Gastrointestinal tract | 2.75 | 2.81 |
Genitalia male | Genitalia | 2.61 | |
Cecum | Gastrointestinal tract | 2.46 | |
Islets of langerhans | Pancreas | 2.13† | |
Cervical vertebrae | Skeleton | 2.06† | |
Spine | Skeleton | 2.01† | |
Cervix uteri | Genitalia | 1.70† | |
Skin | Skin | 1.70† | |
Lower gastrointestinal tract | Gastrointestinal tract | 1.70† |
Name . | Mesh second level term . | Nominal -log10(P-value) . | Evangelou et al. (4)-log10(P-value) . |
---|---|---|---|
Myometrium | Genitalia | 14.66 | 9.50 |
Arteries | Blood vessels | 13.72 | 10.84 |
Cartilage | Cartilage | 10.53 | 9.03 |
Adipose tissue | Connective tissue | 10.36 | 8.01 |
Genitalia female | Genitalia | 10.04 | 4.89 |
Subcutaneous fat | Connective tissue | 9.62 | 7.56 |
Adipose tissue white | Connective tissue | 9.62 | 7.56 |
Genitalia | Genitalia | 9.59 | 4.48 |
Uterus | Genitalia | 9.48 | 4.88 |
Joint capsule | Skeleton | 9.19 | 7.12 |
Joints | Skeleton | 9.19 | 7.12 |
Synovial membrane | Skeleton | 9.19 | 7.12 |
Subcutaneous fat abdominal | Connective tissue | 8.60 | 6.09 |
Abdominal fat | Connective tissue | 8.60 | 6.09 |
Endocrine glands | Endocrine glands | 8.38 | 3.67 |
Fallopian tubes | Genitalia | 7.86 | 3.83 |
Ovary | Genitalia | 7.76 | 3.30 |
Adnexa uteri | Genitalia | 7.75 | 3.31 |
Serous membrane | Membranes | 7.26 | 4.76 |
Gonads | Endocrine glands | 7.07 | 2.92 |
Aortic valve | Heart | 6.97 | 4.91 |
Heart valves | Heart | 6.97 | 4.91 |
Endometrium | Genitalia | 6.97 | 3.44 |
Adipocytes | Connective tissue cells | 6.03 | 4.24 |
Adrenal glands | Endocrine glands | 5.76 | 2.99 |
Adrenal cortex | Endocrine glands | 5.70 | 3.44 |
Blood vessels | Blood vessels | 5.19 | 4.85 |
Stomach | Gastrointestinal tract | 5.12 | 3.37 |
Upper gastrointestinal tract | Gastrointestinal tract | 5.09 | 3.23 |
Heart | Heart | 4.89 | 4.57 |
Stromal cells | Connective tissue cells | 4.45 | 3.56 |
Serum | Blood | 4.40 | 2.92 |
Muscle smooth | Muscles | 4.38 | 3.45 |
Heart atria | Heart | 4.17 | 3.82 |
Heart ventricles | Heart | 4.10 | 4.03 |
Veins | Blood vessels | 4.09 | 3.67 |
Lung | Lung | 3.89 | 2.40 |
Atrial appendage | Heart | 3.87 | 3.43 |
Pancreas | Pancreas | 3.70 | 3.81 |
Mesenchymal stem cells | Stem cells | 3.59 | 2.97 |
Chondrocytes | Connective tissue cells | 3.54 | 2.00 |
Fibroblasts | Connective tissue cells | 3.52 | 2.45 |
Umbilical veins | Blood vessels | 3.50 | 3.13 |
Portal system | Blood vessels | 3.50 | 3.13 |
Esophagus | Gastrointestinal tract | 3.39 | 2.00 |
Urinary bladder | Urinary tract | 3.26 | 2.84 |
Chorion | Membranes | 3.16 | 2.68 |
Extraembryonic membranes | Membranes | 3.16 | 2.68 |
Ileum | Gastrointestinal tract | 3.09 | 3.15 |
Endothelial cells | Epithelial cells | 3.09 | 2.80 |
Urinary tract | Urinary tract | 3.02 | 3.21 |
Osteoblasts | Connective tissue cells | 3.01 | 3.46 |
Exocrine glands | Exocrine glands | 2.92 | |
Kidney cortex | Urinary tract | 2.89 | 2.07 |
Prostate | Exocrine glands | 2.80 | |
Kidney | Urinary tract | 2.76 | 2.97 |
Intestine small | Gastrointestinal tract | 2.75 | 2.81 |
Genitalia male | Genitalia | 2.61 | |
Cecum | Gastrointestinal tract | 2.46 | |
Islets of langerhans | Pancreas | 2.13† | |
Cervical vertebrae | Skeleton | 2.06† | |
Spine | Skeleton | 2.01† | |
Cervix uteri | Genitalia | 1.70† | |
Skin | Skin | 1.70† | |
Lower gastrointestinal tract | Gastrointestinal tract | 1.70† |
FDR: False discovery rate; significant tissue enrichment threshold set at an FDR < 5%.
†MeSH terms with an FDR ≥ 1% and below 5%, all others represent an FDR < 1% MeSH term: medical subject heading term.
Gene-set enrichment analyses
We revisited the pathway enrichment analysis performed using the GSEA software we conducted in 2015, where only 81 candidate genes in only eight enriched pathways were described (6). In the updated analysis the latest curated list of 1630 candidate BP genes annotated to gene-sets highlights more than 200 enriched pathways (Supplementary Material, Table S2 GSEA). While all eight pathways enriched in 2015 remain significant, the calcium signalling pathway and the Reactome gene set of genes involved in haemostasis present the strongest increase in overlapping genes, where both gene-sets are in the top 20 highest-ranked enriched gene-sets (Table 2).
GSEA: gene set enrichment analysis. Results shown for the top 20 enriched gene sets. All sets observed with an at an FDR threshold below or equal to 0.05
Gene set name . | # Genes in gene set (K) . | Description . | # Genes in overlap (k) . | k/K . | P-value . |
---|---|---|---|---|---|
REACTOME_SIGNALING_BY_GPCR | 920 | Genes involved in signalling by GPCR | 107 | 0.1163 | 4.87E-27 |
REACTOME_GPCR_DOWNSTREAM_SIGNALING | 805 | Genes involved in GPCR downstream signalling | 98 | 0.1217 | 2.32E-26 |
KEGG_PATHWAYS_IN_CANCER | 328 | Pathways in cancer | 51 | 0.1555 | 8.23E-19 |
REACTOME_DEVELOPMENTAL_BIOLOGY | 396 | Genes involved in developmental biology | 54 | 0.1364 | 3.07E-17 |
NABA_MATRISOME | 1028 | Ensemble of genes encoding extracellular matrix and extracellular matrix-associated proteins | 95 | 0.0924 | 3.06E-17 |
REACTOME_OLFACTORY_SIGNALING_PATHWAY | 328 | Genes involved in olfactory signalling pathway | 45 | 0.1372 | 1.06E-14 |
KEGG_PROSTATE_CANCER | 89 | Prostate cancer | 23 | 0.2584 | 4.72E-14 |
REACTOME_HEMOSTASIS | 466 | Genes involved in hemostasis | 54 | 0.1159 | 3.28E-14 |
KEGG_OLFACTORY_TRANSDUCTION | 389 | Olfactory transduction | 48 | 0.1234 | 8.33E-14 |
KEGG_DILATED_CARDIOMYOPATHY | 92 | Dilated cardiomyopathy | 23 | 0.25 | 1.02E-13 |
KEGG_HYPERTROPHIC_CARDIOMYOPATHY_HCM | 85 | Hypertrophic cardiomyopathy (HCM) | 22 | 0.2588 | 1.62E-13 |
KEGG_ARRHYTHMOGENIC_RIGHT_VENTRICULAR_CARDIOMYOPATHY_ARVC | 76 | Arrhythmogenic right ventricular cardiomyopathy (ARVC) | 20 | 0.2632 | 1.45E-12 |
NABA_CORE_MATRISOME | 275 | Ensemble of genes encoding core extracellular matrix including ECM glycoproteins, collagens and proteoglycans | 38 | 0.1382 | 9.75E-13 |
KEGG_WNT_SIGNALING_PATHWAY | 151 | Wnt signalling pathway | 27 | 0.1788 | 4.32E-12 |
KEGG_CHRONIC_MYELOID_LEUKEMIA | 73 | Chronic myeloid leukemia | 19 | 0.2603 | 6.48E-12 |
KEGG_FOCAL_ADHESION | 201 | Focal adhesion | 31 | 0.1542 | 6.60E-12 |
KEGG_MAPK_SIGNALING_PATHWAY | 267 | MAPK signalling pathway | 35 | 0.1311 | 3.43E-11 |
KEGG_CALCIUM_SIGNALING_PATHWAY | 178 | Calcium signalling pathway | 28 | 0.1573 | 4.21E-11 |
KEGG_SMALL_CELL_LUNG_CANCER | 84 | Small cell lung cancer | 19 | 0.2262 | 9.28E-11 |
KEGG_ADHERENS_JUNCTION | 75 | Adherens junction | 18 | 0.24 | 1.01E-10 |
Gene set name . | # Genes in gene set (K) . | Description . | # Genes in overlap (k) . | k/K . | P-value . |
---|---|---|---|---|---|
REACTOME_SIGNALING_BY_GPCR | 920 | Genes involved in signalling by GPCR | 107 | 0.1163 | 4.87E-27 |
REACTOME_GPCR_DOWNSTREAM_SIGNALING | 805 | Genes involved in GPCR downstream signalling | 98 | 0.1217 | 2.32E-26 |
KEGG_PATHWAYS_IN_CANCER | 328 | Pathways in cancer | 51 | 0.1555 | 8.23E-19 |
REACTOME_DEVELOPMENTAL_BIOLOGY | 396 | Genes involved in developmental biology | 54 | 0.1364 | 3.07E-17 |
NABA_MATRISOME | 1028 | Ensemble of genes encoding extracellular matrix and extracellular matrix-associated proteins | 95 | 0.0924 | 3.06E-17 |
REACTOME_OLFACTORY_SIGNALING_PATHWAY | 328 | Genes involved in olfactory signalling pathway | 45 | 0.1372 | 1.06E-14 |
KEGG_PROSTATE_CANCER | 89 | Prostate cancer | 23 | 0.2584 | 4.72E-14 |
REACTOME_HEMOSTASIS | 466 | Genes involved in hemostasis | 54 | 0.1159 | 3.28E-14 |
KEGG_OLFACTORY_TRANSDUCTION | 389 | Olfactory transduction | 48 | 0.1234 | 8.33E-14 |
KEGG_DILATED_CARDIOMYOPATHY | 92 | Dilated cardiomyopathy | 23 | 0.25 | 1.02E-13 |
KEGG_HYPERTROPHIC_CARDIOMYOPATHY_HCM | 85 | Hypertrophic cardiomyopathy (HCM) | 22 | 0.2588 | 1.62E-13 |
KEGG_ARRHYTHMOGENIC_RIGHT_VENTRICULAR_CARDIOMYOPATHY_ARVC | 76 | Arrhythmogenic right ventricular cardiomyopathy (ARVC) | 20 | 0.2632 | 1.45E-12 |
NABA_CORE_MATRISOME | 275 | Ensemble of genes encoding core extracellular matrix including ECM glycoproteins, collagens and proteoglycans | 38 | 0.1382 | 9.75E-13 |
KEGG_WNT_SIGNALING_PATHWAY | 151 | Wnt signalling pathway | 27 | 0.1788 | 4.32E-12 |
KEGG_CHRONIC_MYELOID_LEUKEMIA | 73 | Chronic myeloid leukemia | 19 | 0.2603 | 6.48E-12 |
KEGG_FOCAL_ADHESION | 201 | Focal adhesion | 31 | 0.1542 | 6.60E-12 |
KEGG_MAPK_SIGNALING_PATHWAY | 267 | MAPK signalling pathway | 35 | 0.1311 | 3.43E-11 |
KEGG_CALCIUM_SIGNALING_PATHWAY | 178 | Calcium signalling pathway | 28 | 0.1573 | 4.21E-11 |
KEGG_SMALL_CELL_LUNG_CANCER | 84 | Small cell lung cancer | 19 | 0.2262 | 9.28E-11 |
KEGG_ADHERENS_JUNCTION | 75 | Adherens junction | 18 | 0.24 | 1.01E-10 |
GSEA: gene set enrichment analysis. Results shown for the top 20 enriched gene sets. All sets observed with an at an FDR threshold below or equal to 0.05
Gene set name . | # Genes in gene set (K) . | Description . | # Genes in overlap (k) . | k/K . | P-value . |
---|---|---|---|---|---|
REACTOME_SIGNALING_BY_GPCR | 920 | Genes involved in signalling by GPCR | 107 | 0.1163 | 4.87E-27 |
REACTOME_GPCR_DOWNSTREAM_SIGNALING | 805 | Genes involved in GPCR downstream signalling | 98 | 0.1217 | 2.32E-26 |
KEGG_PATHWAYS_IN_CANCER | 328 | Pathways in cancer | 51 | 0.1555 | 8.23E-19 |
REACTOME_DEVELOPMENTAL_BIOLOGY | 396 | Genes involved in developmental biology | 54 | 0.1364 | 3.07E-17 |
NABA_MATRISOME | 1028 | Ensemble of genes encoding extracellular matrix and extracellular matrix-associated proteins | 95 | 0.0924 | 3.06E-17 |
REACTOME_OLFACTORY_SIGNALING_PATHWAY | 328 | Genes involved in olfactory signalling pathway | 45 | 0.1372 | 1.06E-14 |
KEGG_PROSTATE_CANCER | 89 | Prostate cancer | 23 | 0.2584 | 4.72E-14 |
REACTOME_HEMOSTASIS | 466 | Genes involved in hemostasis | 54 | 0.1159 | 3.28E-14 |
KEGG_OLFACTORY_TRANSDUCTION | 389 | Olfactory transduction | 48 | 0.1234 | 8.33E-14 |
KEGG_DILATED_CARDIOMYOPATHY | 92 | Dilated cardiomyopathy | 23 | 0.25 | 1.02E-13 |
KEGG_HYPERTROPHIC_CARDIOMYOPATHY_HCM | 85 | Hypertrophic cardiomyopathy (HCM) | 22 | 0.2588 | 1.62E-13 |
KEGG_ARRHYTHMOGENIC_RIGHT_VENTRICULAR_CARDIOMYOPATHY_ARVC | 76 | Arrhythmogenic right ventricular cardiomyopathy (ARVC) | 20 | 0.2632 | 1.45E-12 |
NABA_CORE_MATRISOME | 275 | Ensemble of genes encoding core extracellular matrix including ECM glycoproteins, collagens and proteoglycans | 38 | 0.1382 | 9.75E-13 |
KEGG_WNT_SIGNALING_PATHWAY | 151 | Wnt signalling pathway | 27 | 0.1788 | 4.32E-12 |
KEGG_CHRONIC_MYELOID_LEUKEMIA | 73 | Chronic myeloid leukemia | 19 | 0.2603 | 6.48E-12 |
KEGG_FOCAL_ADHESION | 201 | Focal adhesion | 31 | 0.1542 | 6.60E-12 |
KEGG_MAPK_SIGNALING_PATHWAY | 267 | MAPK signalling pathway | 35 | 0.1311 | 3.43E-11 |
KEGG_CALCIUM_SIGNALING_PATHWAY | 178 | Calcium signalling pathway | 28 | 0.1573 | 4.21E-11 |
KEGG_SMALL_CELL_LUNG_CANCER | 84 | Small cell lung cancer | 19 | 0.2262 | 9.28E-11 |
KEGG_ADHERENS_JUNCTION | 75 | Adherens junction | 18 | 0.24 | 1.01E-10 |
Gene set name . | # Genes in gene set (K) . | Description . | # Genes in overlap (k) . | k/K . | P-value . |
---|---|---|---|---|---|
REACTOME_SIGNALING_BY_GPCR | 920 | Genes involved in signalling by GPCR | 107 | 0.1163 | 4.87E-27 |
REACTOME_GPCR_DOWNSTREAM_SIGNALING | 805 | Genes involved in GPCR downstream signalling | 98 | 0.1217 | 2.32E-26 |
KEGG_PATHWAYS_IN_CANCER | 328 | Pathways in cancer | 51 | 0.1555 | 8.23E-19 |
REACTOME_DEVELOPMENTAL_BIOLOGY | 396 | Genes involved in developmental biology | 54 | 0.1364 | 3.07E-17 |
NABA_MATRISOME | 1028 | Ensemble of genes encoding extracellular matrix and extracellular matrix-associated proteins | 95 | 0.0924 | 3.06E-17 |
REACTOME_OLFACTORY_SIGNALING_PATHWAY | 328 | Genes involved in olfactory signalling pathway | 45 | 0.1372 | 1.06E-14 |
KEGG_PROSTATE_CANCER | 89 | Prostate cancer | 23 | 0.2584 | 4.72E-14 |
REACTOME_HEMOSTASIS | 466 | Genes involved in hemostasis | 54 | 0.1159 | 3.28E-14 |
KEGG_OLFACTORY_TRANSDUCTION | 389 | Olfactory transduction | 48 | 0.1234 | 8.33E-14 |
KEGG_DILATED_CARDIOMYOPATHY | 92 | Dilated cardiomyopathy | 23 | 0.25 | 1.02E-13 |
KEGG_HYPERTROPHIC_CARDIOMYOPATHY_HCM | 85 | Hypertrophic cardiomyopathy (HCM) | 22 | 0.2588 | 1.62E-13 |
KEGG_ARRHYTHMOGENIC_RIGHT_VENTRICULAR_CARDIOMYOPATHY_ARVC | 76 | Arrhythmogenic right ventricular cardiomyopathy (ARVC) | 20 | 0.2632 | 1.45E-12 |
NABA_CORE_MATRISOME | 275 | Ensemble of genes encoding core extracellular matrix including ECM glycoproteins, collagens and proteoglycans | 38 | 0.1382 | 9.75E-13 |
KEGG_WNT_SIGNALING_PATHWAY | 151 | Wnt signalling pathway | 27 | 0.1788 | 4.32E-12 |
KEGG_CHRONIC_MYELOID_LEUKEMIA | 73 | Chronic myeloid leukemia | 19 | 0.2603 | 6.48E-12 |
KEGG_FOCAL_ADHESION | 201 | Focal adhesion | 31 | 0.1542 | 6.60E-12 |
KEGG_MAPK_SIGNALING_PATHWAY | 267 | MAPK signalling pathway | 35 | 0.1311 | 3.43E-11 |
KEGG_CALCIUM_SIGNALING_PATHWAY | 178 | Calcium signalling pathway | 28 | 0.1573 | 4.21E-11 |
KEGG_SMALL_CELL_LUNG_CANCER | 84 | Small cell lung cancer | 19 | 0.2262 | 9.28E-11 |
KEGG_ADHERENS_JUNCTION | 75 | Adherens junction | 18 | 0.24 | 1.01E-10 |
The concept of pathways involving G-protein coupled receptors in BP regulation is unsurprising. Among others, this includes receptors for adrenaline, endothelin, cholinergic transmission, serotonin and histamine. However, there is also a notable contribution from gastro-endocrine regulation, such as gastric inhibitory polypeptide (GIP) and its receptor (GIPR), the cholecystokinin B receptor (CCKBR) and peptide YY (PYY) highlighting the interplay between BP and other metabolic processes. From this selection of potential gastro-endocrine therapeutic targets, perhaps GIP/GIPR is the closest to translation to patient care with the recent development of a subcutaneous dual GIP/GLP1 agonist (59), currently intended for use in the management of diabetes. The role of GIP in the vasculature has been reviewed in detail by Pujadas and Drucker (60). In summary, various in vitro studies have shown GIP as pro-proliferative for endothelial cells (61), with increased endothelin-1 secretion (62). In a study of the development of experimental atherosclerosis, GIP infusion reduced lesion formation. Interestingly, GIP infusion did not affect BP in this murine model (63). This is balanced against limited evidence in healthy volunteer studies of GIP influencing blood flow in some vascular beds during hyperglycaemic phases of a two-step euglycaemic-hyperglycaemic clamp study (64).
There is also a collection of candidate genes involving G-protein coupled receptor pathways that emphasizes the key role in neuroregulation of BP, potentially involving gamma-aminobutyric acid (GABBR1), opioids (OPRM1), neuropeptides (NPW) and metabotropic glutamate receptors (GRM4 and GRM7). While there is already some evidence that pharmacological modulation of GABA receptors may influence vascular tone (65), there is no literature thus far on the potential impact of Neuropeptide W and metabotropic glutamate receptors on BP regulation. As G-coupled protein receptors, they represent potentially fruitful avenues for future research.
One other standout result from this pathway analysis is that 48 candidate genes for BP are olfactory receptors. With the caveat that there is hitherto no evidence that genetic variants within olfactory receptors contribute to clinically relevant changes in taste/smell, it has been observed that patients with acquired hyposmia (smell loss) had significant increases in dietary salt intake (66). On a population level, care should be taken when interpreting these results as commonly prescribed antihypertensive and cholesterol-lowering drugs themselves may alter senses of taste and smell (67). Interestingly, there may be a role of olfactory receptors, when ectopically expressed (68), for example in the kidney, where in mouse models Olfr78 responds to short chain fatty acids with renin secretion, and in turn contributes to BP regulation (69). The human ortholog of Olfr78 (OR51E2) is notably situated near its family member OR51E1, which is also BP candidate gene (4).
Another grouping of would-be therapeutic targets may be from the extracellular matrix pathway. This includes gene products within the extracellular matrix itself (e.g. fibronectin, collagen, fibrillin and thrombospondin), hormones (e.g. transforming growth factor, vascular endothelial growth factor, fibroblast growth factor and platelet-derived growth factor) and enzymes that regulate the extracellular matrix (e.g. matrix metallopeptidases and ADAM metallopeptidase with thrombospondin enzymes). The potential for many of these candidate genes as therapeutic targets may stem from also being overlapping GWAS candidate genes for ischaemic heart disease (70,71), allowing for therapeutic agents with the possibility of pleotropic effects.
GO analyses—GOfuncR and REVIGO
GO enrichment analyses were visualized using REVIGO. This tool calculates the semantic similarity between GO terms and aids their visualization (72). There are two main clusters observed in the GO plot for biological processes (Fig. 2). While one cluster (A) is unsurprising, including broad intracellular functions such as regulation of intracellular signal transduction, cell cycle, organelle organization, cellular localization, cellular process and transcription, the second cluster (B) may be more revealing in terms of potential new understanding of the biology underlying BP regulation. Within this cluster, the five major GO terms refer to biosynthesis of nucleobase-containing compounds, organic cyclic compounds, aromatic compounds, cellular nitrogen compounds and heterocyclic compounds. While there are commonly recognized enzymes within these groups contributing to the regulation of BP (e.g. adenylate and guanylate cyclases, and adenosine kinases), there are three other enzymes here which we felt were worth highlighting.

GO enrichment analysis. The REVIGO graph displays the biological process GO enrichment. Each sphere represents a GO term coloured by the GOfuncR enrichment in the –log10(P-value) scale. The semantic similarity of each GO term is represented by the position of each sphere on the graph. Plot size (sphere size) indicates the frequency of each GO term in which is found in the GO database (i.e. the larger the sphere is the more general the term is). Two clusters are highlighted: (A) includes broad intracellular functions such as regulation of intracellular signal transduction, cell cycle, organelle organization, cellular localization, cellular process and transcription; (B) the second cluster revealing terms of potential new understanding of the biology underlying BP regulation. Within this cluster, the five major GO terms refer to biosynthesis of nucleobase-containing compounds, organic cyclic compounds, aromatic compounds, cellular nitrogen compounds and heterocyclic compounds.
Carbamoyl-phosphate synthase 1 (CPS1) is the enzyme that catalyses the rate-limiting step in the urea cycle and L-citrulline production. This reaction allows vascular endothelial cells to recycle the by-product of nitric oxide synthesis, L-citrulline, by using components of the urea cycle. The naturally occurring T1405N variation within CPS1 is already known to vascular dynamics in an experimental setting (73). MTAP (encoding S-methyl-5'-thioadenosine phosphorylase) is also an interesting potential target as an important in the salvage of adenine and methionine. It also resides in the 9p21 region, with the strongest GWAS signal for coronary artery disease and myocardial (74,75). Mice heterozygous for MTAP shows increased predisposition to atherosclerotic lesions (76), which may allow for the development of therapeutic options that has pleotropic effects. It should, however, be recognized that there are also multiple other genes of interest in the 9p21 region, including the tumour suppressor genes CDKN2A and CDKN2, and the long non-coding RNA ANRIL. DBH encodes dopamine β-hydroxylase which catalyses the conversion of dopamine to norepinephrine. It is predominantly expressed in neural and adrenal tissues, and is involved in noradrenergic transmission of central and peripheral nervous systems. More recently, its expression has been detected in endothelial cells, where dopamine β-hydroxylase inhibition reduces in vitro angiogenesis (77). This enzyme also potentially plays a role in vascular wall remodelling, where DBH–/– mice has attenuated vessel injury-induced medial hypertrophy compared with wild-type littermates (78).
Multi-trait BP associations
BP-variants have been reported to have many other genetic associations (4,6,79). Here, we interrogated the GWAS Catalog for all traits associated with the BP-variants (n = 45.9k) (80). After applying a stricter P-value association threshold (–log10 P-value < 10.5), these results were manually curated and summarized on the mapped genes. After excluding BP traits (e.g. SBP, DBP and pulse pressure), CVD, red blood cell counts, body mass index and type 2 diabetes were the traits with increased numbers of BP-variant associations. The co-occurrence observed between BP variants and other traits could be highlighting pleiotropy effects, where the associated variant impacts overlapping causal pathways for each phenotype. However, this co-occurrence could also possibly be due to BP being affected by other traits. Figure 3 shows all BP loci with more than five associations with at least four traits. Two clusters of interest can be observed, with several genes demonstrating potential such as cholesterol levels, diabetes, obesity, and CVD itself. These genes include APOE, LDLR, FGF5, SLC39A8, FUT2, FTO and SH2B3-ATXN2. Our group has previously demonstrated the role of the blood-pressure associated non-synonymous polymorphism at SLC39A8 influencing in vitro intracellular cadmium accumulation and subsequent toxicity (81).

Multi-trait associations of BP loci. BP loci were searched in the GWAS catalog to investigate the pleiotropy of the loci. The heatmap displays a subset of the BP loci with five or more gene associations with at least four traits (excluding BP traits). Ward hierarchical clustering was applied on rows and columns (genes and traits).
The landscape and future of BP genetics
The genetic research community realized at early stages of GWAS that the sample sizes needed to be very large and also dependent on the number of SNPs been tested (82). As the sample sizes increased, substantial number of loci associated to traits and disease started to be uncovered. The ~1477 validated BP associations reported to date have been achieved by analysing over 1 million individuals (4,7–29). Yet, ~73% of the genetic variance remains unaccounted for (4).
The genetics of hypertension and BP prove to be highly complex. The associated BP loci are localized across the whole genome, and their annotations revealed enrichment in some obvious BP pathways, but they also revealed hundreds of other pathways and tissues with no direct known connections to BP. Here, we only observed two pathways previously enriched (the calcium signalling pathway and genes involved in haemostasis) maintaining their status as ‘top’ enriched pathways. These discoveries are contrary to the initial idea that the unravelling of new loci would first fill in the gaps of the already known pathways, and instead we observe the enrichment of systems and tissues with no prior knowledge on BP regulation. However, BP loci have been found to have multiple associations across traits (e.g. type II diabetes, obesity and cholesterol among others) showing strong evidence of pleiotropy (83) and this is challenging for the characterization of loci specific to BP.
The number of genes and variants uncovered by GWAS means there is a great challenge for traditional functional studies. There seems to be only a few successful follow up studies on candidate genes identified through GWAS relative to the numerous loci reported. However, this should be viewed in context of the lag-time between GWAS and publication of mechanistic studies that is often in the range of 3–5 years. With that, it may be reasonable to suggest more mechanistic studies are to follow with the massively expanded candidate gene list. In silico follow-up, the utilization of resources such as ENCODE (84) and GTEx (85), are becoming increasingly more important, where the next task is to prioritize the genes that may be most likely to bear fruitful clinical translation. For example, the application of deep neural networks to prioritize BP genes, converging data from a range of genomic annotation resources on the BP associated genes identified by GWAS; providing model training and testing datasets. Exploring the systems biology of these prioritized genes may offer new mechanistic insights and therapeutic targets.
BP genetic associations have shed light into mechanisms and the architecture of BP. However, there are two main drawbacks on most of these GWAS studies, (1) they are largely based on individuals with European descent and (2) due to most GWAS based on imputed SNP arrays, where imputation performs poorly in regions with low LD (86), they are generally restricted to the analysis of common variants (minor allele frequency > 1%). Expanding research into rare variants proved to be extraordinarily fruitful for Wainshctein and colleagues, who succeeded to recover all the heritability from height using whole-genome sequencing, but there has been limited research to date on the impact of rare variants on BP regulation (27,28). The missing heritability in height was unearthed from rare variants in regions of low LD (87). The availability of whole genomes and initiatives such as Genomics England could pave the way to investigate all variants, including structural and non-coding variants, allowing us to expand our knowledge and discover the hidden genetic variance in BP.
Conflict of Interest statement.
None declared.
Acknowledgements
This works forms part of the research areas contributing to the Cardiovascular Biomedical Research Centre at Barts Health NHS Trust which is supported and funded by the National Institute for Health Research.
References
(
Author notes
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors