Abstract

Mitochondrial DNA copy number (mtDNAcn) variation has been associated with increased risk of several human diseases in epidemiological studies. The quantification of mtDNAcn performed with real-time PCR is currently considered the de facto standard among several techniques. However, the heterogeneity of the laboratory methods (DNA extraction, storage, processing) used could give rise to results that are difficult to compare and reproduce across different studies. Several lines of evidence suggest that mtDNAcn is influenced by nuclear and mitochondrial genetic variability, however this relation is largely unexplored. The aim of this work was to elucidate the genetic basis of mtDNAcn variation. We performed a genome-wide association study (GWAS) of mtDNAcn in 6836 subjects from the ESTHER prospective cohort, and included, as replication set, the summary statistics of a GWAS that used 295 150 participants from the UK Biobank. We observed two novel associations with mtDNAcn variation on chromosome 19 (rs117176661), and 12 (rs7136238) that reached statistical significance at the genome-wide level. A polygenic score that we called mitoscore including all known single nucleotide polymorphisms explained 1.11% of the variation of mtDNAcn (p = 5.93 × 10−7). In conclusion, we performed a GWAS on mtDNAcn, adding to the evidence of the genetic background of this trait.

Introduction

Mitochondria are organelles involved in the regulation of critical cellular functions such as energy production via the oxidative phosphorylation reaction, apoptosis and calcium homeostasis and are responsible for the production of reactive oxygen species. Mitochondria possess their own genome (mtDNA), which is maternally inherited. mtDNA content has been shown to be a proxy for mitochondrial status and function, and is consequently an attractive biomarker due to its relative ease of measurement (1).

In each eukaryotic cell there can be hundreds or thousands of copies of their genomes. Mitochondrial DNA copy number (mtDNAcn) can be readily measured by quantitative real-time PCR (qPCR) using DNA extracted from peripheral blood or other tissues, although in practice most measurements are performed on material from peripheral blood due to ease of accessibility. mtDNAcn varies by cell type, but there is generally a correlation in the amount of mtDNA in different cell types. Therefore, mtDNAcn measured in circulating leukocytes could represent a good indicator of the average amount of mtDNAcn in other tissues (2).

Many studies investigated the possible role of mtDNAcn as a risk factor for several cancer types (3), including breast (2,4), colorectal (5), lung (6), prostate (7), pancreatic cancers (8,9) and B-cell lymphomas (10). Besides cancer, mtDNAcn has been associated with a wide range of phenotypes, such as myocardial function (11), cardiovascular disease (12), male infertility (13), depression (14), posttraumatic stress disorder (15), chronic stress and stress-related health behaviours (16). In general, since mitochondria are at the crossroads of many fundamental cellular processes, it can be expected that mtDNAcn might be associated with a wide range of physiological and pathological phenotypes.

mtDNAcn is influenced by exogenous as well as endogenous factors, including genetic factors since it appears to have a high heritability, estimated at 65% (17,18). Single nucleotide polymorphisms (SNPs) in candidate gene and genome-wide association studies (GWASs) were reported to be associated with mtDNAcn (17,19–24). Curran and colleagues in 2007 performed the first GWAS on mitochondrial content. In their analyses 1259 Mexican American individuals were included, and mtDNAcn was estimated using qPCR (17). López and colleagues performed the second GWAS on mitochondrial content measured through qPCR using 21 Spanish families consisting of 386 individuals spanning across five generations. They observed several suggestive loci that did not reach genome-wide significance (21). Cai and colleagues used low-coverage sequence data from 5224 women with major depressive disorder and 5218 healthy controls to compute the normalised numbers of reads mapping to the mitochondrial genome as a proxy for the amount of mtDNA. They identified two SNPs (rs11006126 and rs445) associated with mtDNAcn (22). Workalemahu and colleagues evaluated, in 471 Peruvian women, the association between mtDNAcn measured by qPCR and 119 629 SNPs (25). A small scale GWAS on mtDNA performed on 301 subjects from the Chinese population using qPCR identified a few suggestive loci, but none reached genome-wide significance (p < 5x10−8) (20). Yamamoto and colleagues analysed mtDNA using the whole genome sequence of 1928 Japanese subjects and did not report any associations (26). In addition, Guyatt and colleagues performed a GWAS on very heterogeneous populations of European descent using different setups of qPCR, suggesting novel candidates which did not reach the genome-wide significance (23). Finally, Hägg and colleagues conducted a study using data from UK Biobank obtaining the mtDNAcn from genotyping microarray probe intensities and reported 50 novel genome-wide significant loci (24).

The majority of these studies analysed together more than one population group, or small groups of subjects collected in different context or included only females, making thus the studies less homogeneous among them. These factors make it difficult to compare different studies and provide a global interpretation of the results. In addition, the application of different mtDNAcn quantification methods contributes to increasing the heterogeneity between studies (27,28). qPCR is the current de facto standard and has been the most widely used method for measuring mtDNAcn. More recently, new technologies such as droplet digital PCR technology, the use of genotyping assay intensity or the numbers of reads obtained from whole exome sequencing and whole-genome sequencing data have become available (28,29).

In addition to the different study designs and quantification techniques, other factors could increase the heterogeneity of the results between different studies, such as the nature of the samples from which the DNA is extracted, the extraction techniques used, or the content of the different cells in the samples (white and red blood cell percentage, and platelet count). (11,28). The absence of uniformity in sample handling, quantification techniques and study design has produced contrasting results (30,31). Indeed, both positive and negative associations have been found for mtDNAcn and aging (32,33), type 2 diabetes (34,35), pancreatic cancer (8,9), and breast cancer (4,6,36). For example, retrospective studies are prone to reverse causation bias if the disease or its treatment influence the mitochondrial count.

To increase our knowledge on the genetics of mtDNAcn, we conducted a study on a homogeneous population using the same measurement technique. We performed a GWAS on 6836 unrelated German individuals from the ‘Epidemiologische Studie zu Chancen der Verhütung, Früherkennung und optimierten THerapie chronischer ERkrankungen in der älteren Bevölkerung’ (ESTHER) prospective cohort study (37), to identify novel genetic variants associated with mtDNAcn and to compute a polygenic score (PGS) which we called ‘mitoscore’.

Results

Associations between SNPs and mtDNA copy number

In the ESTHER population we observed two genome-wide statistically significant associations with mtDNAcn: one on chromosome 13q12.2 (rs58413920) and one on chromosome 9q31.1 (rs10124625). Both minor alleles, A for rs58413920 and T for rs10124625, were associated with increased mtDNAcn: beta = 0.259; 95%CI: 0.176–0.343; p = 1.30 × 10−9 and beta = 0.072; 95% CI: 0.046–0.097; p = 3.36 × 10−8 respectively. rs58413920 is situated near the small nuclear RNA (snRNA) pseudogene RNU6-70P (8496 bp downstream of the SNP), while rs10124625 lies between the 5S ribosomal pseudogene RNA5SP291 (122 831 bp upstream the SNP) and the uncharacterised lncRNA LOC101928523 (187 347 bp downstream of the SNP). All the results, alongside information on the nearest genes of each SNP, chromosome position and allele frequency, are shown in Supplementary Material, Table S1. A Manhattan plot is shown in Figure 1, and Supplementary Material, Figure S1 shows regional plots for the two new genome-wide significant loci. In this study were included as replication set the summary statistic results of the recent GWAS by Hägg and colleagues performed using 295 150 participants from the UK Biobank (24). We performed a meta-analysis of the results obtained in the ESTHER and UK-Biobank studies excluding the SNPs already known to be associated with mtDNAcn (and those associated with them r2 > 0.8) and keeping only the SNPs with p < 0.05 in the ESTHER study. The 13q12.2 (rs58413920) SNP (MAF = 0.01 in Europeans) and its proxies were not present in the summary statistics of the UK-Biobank GWAS (24) due to the MAF threshold (0.05) set by Hägg and colleagues, instead, 9q31.1 (rs10124625) was present and with beta = 0.007 and p = 0.045.

Manhattan plot of the whole genome association analysis of mtDNAcn.
Figure 1

Manhattan plot of the whole genome association analysis of mtDNAcn.

In the meta-analysis we identified two novel SNPs that reached genome-wide p-value threshold: 12q15-rs7136238 (beta = −0.013; p = 3.83 × 10−8), and 19p13.11-rs117176661 (beta = 0.034; p = 2.73 × 10−9). The association results are reported in Table 1.

Table 1

Associations between SNPs and mtDNAcn

ESTHERUK-BiobankMeta-analysis
CHRBPaSNPEAbNEAMAFc  (ESTHER)MAFc  (EUR)BETAdSEdPdNBETASEPNBETASEP
1269 752 509rs7136238TG44%48%−0.0200.0100.0416735−0.0130.0032.68 × 10−7286 381−0.0140.0033.83 × 10−8
1917 445 208rs117176661TC4%6%0.0900.0264.71 × 10−466230.0310.0061.13 × 10−7286 3810.0340.0062.73 × 10−9
ESTHERUK-BiobankMeta-analysis
CHRBPaSNPEAbNEAMAFc  (ESTHER)MAFc  (EUR)BETAdSEdPdNBETASEPNBETASEP
1269 752 509rs7136238TG44%48%−0.0200.0100.0416735−0.0130.0032.68 × 10−7286 381−0.0140.0033.83 × 10−8
1917 445 208rs117176661TC4%6%0.0900.0264.71 × 10−466230.0310.0061.13 × 10−7286 3810.0340.0062.73 × 10−9

avariant position based on Genome Reference Consortium Human Build 37 (GRCh37); bEA: effect allele; cMAF: minor allele frequency; dLinear regression models to examine the association between SNPs and mtDNAcn were adjusted by sex, BMI, fruit intake frequency, pack-years of cigarette smoking.

Table 1

Associations between SNPs and mtDNAcn

ESTHERUK-BiobankMeta-analysis
CHRBPaSNPEAbNEAMAFc  (ESTHER)MAFc  (EUR)BETAdSEdPdNBETASEPNBETASEP
1269 752 509rs7136238TG44%48%−0.0200.0100.0416735−0.0130.0032.68 × 10−7286 381−0.0140.0033.83 × 10−8
1917 445 208rs117176661TC4%6%0.0900.0264.71 × 10−466230.0310.0061.13 × 10−7286 3810.0340.0062.73 × 10−9
ESTHERUK-BiobankMeta-analysis
CHRBPaSNPEAbNEAMAFc  (ESTHER)MAFc  (EUR)BETAdSEdPdNBETASEPNBETASEP
1269 752 509rs7136238TG44%48%−0.0200.0100.0416735−0.0130.0032.68 × 10−7286 381−0.0140.0033.83 × 10−8
1917 445 208rs117176661TC4%6%0.0900.0264.71 × 10−466230.0310.0061.13 × 10−7286 3810.0340.0062.73 × 10−9

avariant position based on Genome Reference Consortium Human Build 37 (GRCh37); bEA: effect allele; cMAF: minor allele frequency; dLinear regression models to examine the association between SNPs and mtDNAcn were adjusted by sex, BMI, fruit intake frequency, pack-years of cigarette smoking.

The two polymorphisms identified in the meta-analysis were independent (r2 < 0.15 in EUR subjects of the 1000 Genomes project) with SNPs reported in other studies. This independence was confirmed from the results obtained from the functional mapping and annotation of genome-wide association studies (FUMA) portal. The results obtained from FUMA reported three independent genome-wide lead SNPs in the 1 000 000 bp region around rs117176661 (rs35006574; rs67397200; rs117176661) on chromosome 19, and one around rs7136238 (rs7136238) on chromosome 12. FUMA results on lead SNPs are reported in Supplementary Material, Table S2. Both variants identified through the meta-analysis are reported in GTEx as expression quantitative trait loci (eQTLs) for six genes (three for each SNP), in a large number of human tissues. The detailed results from bioinformatic annotation tools are reported in Table 2. RegulomeDB assigns a score of 4 and 5 to 19p13.11-rs117176661 and 12q15-rs7136238, respectively, indicating a moderate chance for the SNPs to have a functional role.

Table 2

Results of bioinformatics analyses

CHRBPaSNPGenebRdb-RankeQTLdeQTL-tissueGene NameDistance bpDnIfe
1269 752 509rs7136238LYZ [UG] 44955LYZ33 tissues
YEATS4 [DG] 974RP11-1143G9.514 tissues
YEATS448 tissues
CCT2; MIR3913–1; MIR3913–2225 0003.02
1917 445 208rs117176661ANO8 [OG]4ANKLE16 tissues
MRPL344 tissuesMRPL3430 0003.68
OCEL14 tissues
B3GNT3460 0008.62
INSL3485 0005.47
NR2F690 0004.1
DDA125 0003.97
FAM129C185 0003.78
MAP1S385 0003.77
BST270 0003.76
GLT25D1220 0003.67
F2RL3450 0003.55
CPAMD8310 0003.51
NXNL1125 0003.08
CHRBPaSNPGenebRdb-RankeQTLdeQTL-tissueGene NameDistance bpDnIfe
1269 752 509rs7136238LYZ [UG] 44955LYZ33 tissues
YEATS4 [DG] 974RP11-1143G9.514 tissues
YEATS448 tissues
CCT2; MIR3913–1; MIR3913–2225 0003.02
1917 445 208rs117176661ANO8 [OG]4ANKLE16 tissues
MRPL344 tissuesMRPL3430 0003.68
OCEL14 tissues
B3GNT3460 0008.62
INSL3485 0005.47
NR2F690 0004.1
DDA125 0003.97
FAM129C185 0003.78
MAP1S385 0003.77
BST270 0003.76
GLT25D1220 0003.67
F2RL3450 0003.55
CPAMD8310 0003.51
NXNL1125 0003.08

aPosition: variant position based on Genome Reference Consortium Human Build 37 (GRCh37). b [OG] overlapped gene; [UG] nearest upstream gene: [DG] nearest downstream gene; distance from nearest gene; information obtained from SNPnexus. c RegulomeDB rank. d GTEx database Release V8 (data obtained from the GTEx Portal on 01 March 2021). e DIV3 -Distance-normalised interaction frequency.

Table 2

Results of bioinformatics analyses

CHRBPaSNPGenebRdb-RankeQTLdeQTL-tissueGene NameDistance bpDnIfe
1269 752 509rs7136238LYZ [UG] 44955LYZ33 tissues
YEATS4 [DG] 974RP11-1143G9.514 tissues
YEATS448 tissues
CCT2; MIR3913–1; MIR3913–2225 0003.02
1917 445 208rs117176661ANO8 [OG]4ANKLE16 tissues
MRPL344 tissuesMRPL3430 0003.68
OCEL14 tissues
B3GNT3460 0008.62
INSL3485 0005.47
NR2F690 0004.1
DDA125 0003.97
FAM129C185 0003.78
MAP1S385 0003.77
BST270 0003.76
GLT25D1220 0003.67
F2RL3450 0003.55
CPAMD8310 0003.51
NXNL1125 0003.08
CHRBPaSNPGenebRdb-RankeQTLdeQTL-tissueGene NameDistance bpDnIfe
1269 752 509rs7136238LYZ [UG] 44955LYZ33 tissues
YEATS4 [DG] 974RP11-1143G9.514 tissues
YEATS448 tissues
CCT2; MIR3913–1; MIR3913–2225 0003.02
1917 445 208rs117176661ANO8 [OG]4ANKLE16 tissues
MRPL344 tissuesMRPL3430 0003.68
OCEL14 tissues
B3GNT3460 0008.62
INSL3485 0005.47
NR2F690 0004.1
DDA125 0003.97
FAM129C185 0003.78
MAP1S385 0003.77
BST270 0003.76
GLT25D1220 0003.67
F2RL3450 0003.55
CPAMD8310 0003.51
NXNL1125 0003.08

aPosition: variant position based on Genome Reference Consortium Human Build 37 (GRCh37). b [OG] overlapped gene; [UG] nearest upstream gene: [DG] nearest downstream gene; distance from nearest gene; information obtained from SNPnexus. c RegulomeDB rank. d GTEx database Release V8 (data obtained from the GTEx Portal on 01 March 2021). e DIV3 -Distance-normalised interaction frequency.

On chromosome 19, we reported a novel variant associated with mtDNAcn. Near this locus, Hägg and colleagues reported a variant that reached the genome-wide significance level (rs10419397, 51 955 bp distant from rs117176661), while Guyatt and colleagues identified another variant (rs10424198, 33 612 bp distant from rs117176661) lying between these two SNPs. Although independent of each other (rs10419397 r2 = 0.014 D′ = 0.92; rs10424198 r2 = 0.017 D′ = 1.00 with rs117176661 in EUR), these SNPs are physically very close, indicating the regulatory potential of this genomic region on chromosome 19. We used the 3D Interaction Viewer and Database (3DIV) (38) to obtain epigenetic and spatial chromatin interaction annotation of the region between those three polymorphisms. For all the analyses, we considered a window of 500 000 bp and a distance normalized interaction frequency (DNIF) threshold of 3 using the data obtained from IMR90 fibroblast cell line. 3DIV indicates that rs117176661 belongs to a different regulatory topologically associating domain compared to rs10419397 and rs10424198. 19p13.11-rs117176661 interacts with 12 genes. In contrast, the variant identified in the UK-Biobank study (rs10419397) interacts with just one gene when analysed with the same parameters. 12q15-rs7136238 interacts with chaperonin containing TCP1 subunit 2 (CCT2) gene and two micro-RNA (MIR3913–1, MIR3913–2). The interaction values and the list of genes are reported in Table 2.

We computed in the ESTHER GWAS a PGS that we called mitoscore using 44 SNPs (out of the 50 loci reported by Hägg as genome-wide significant, six were not present in the imputed dataset). The mitoscore explains 1.05% (Nagelkerke pseudo-r2 = 0.01049; p = 3.56 × 10−6) of the variation of mtDNAcn, while the PGS generated using the 44 variants plus the two novel variants identified in this study explains 1.11% (Nagelkerke pseudo-r2 = 0.01107; p = 5.93 × 10−7) of the variation. The mitoscore (44 SNP) was categorized in quintiles, and with a logistic regression the fifth quintile (high mtDNAcn) was compared with the first quintile (low mtDNAcn) showing a statistically significant association (beta = 6.03; 95% CI = 3.87–8.19; p = 4.33 × 10−8) with the mtDNAcn. The results of the analyses of mitoscore are reported in Supplementary Material, Table S3.

Discussion

The variation in mtDNAcn measured in circulating blood has been associated with various human traits and diseases (2,4–11,13–16). In the vast majority of the reports, mtDNAcn has been measured through qPCR quantification, which is the de facto standard for relative quantification of mtDNAcn (28). There is large heterogeneity in the reported results, which could be largely due to the different study designs used and the differences in DNA extraction protocols, storage and processing (30,31,39). We performed an additional GWAS considering mtDNAcn as the outcome and to validate our results, we used the summary statistics results of the study performed on 295 150 participants from the UK Biobank. We have identified two novel independent genome-wide significant associations for mtDNAcn.

The first of these associations is between the minor allele (T) of rs117176661 on chromosome 19 and increased mtDNAcn. This SNP is relatively close to two other SNPs already known to be associated with mtDNAcn. However, the bioinformatic analyses indicate that this SNP is not in LD with the known variants and is located in a different regulatory topologically associating domain. Graphical representation of the locus on chromosome 19 and the chromatin interaction is shown in Supplementary Material, Figure S2. The presence of two different regulatory domains tagged by two SNP identified in independent studies suggests the locus’s potential role and at the same time that there is more than one regulatory region inside the locus that can modulate separately and in synergy mtDNAcn phenotype. 19p13.11-rs117176661 has a RegulomeDB rank of 4, indicating that it lies in a DNAse accessible region and probably alters possible transcription factor’s binding sites. It is an eQTL of three genes in several tissues, and one of these is the mitochondrial ribosomal protein L34 (MRPL34) gene. The potential regulatory role of this SNP on MRPL34 is supported by 3DIV tool, which reports a chromatin interaction with the MRPL34 gene region with a DNIF of 3.8. The data obtained from the bioinformatics analyses are potentially in agreement with our results: we observed that 19p13.11-rs117176661-T is associated with an increase in the relative mtDNAcn, and the same allele is associated with increased expression of the MRPL34 gene, which is involved in the synthesis of mitochondrial ribosomal proteins. The increment of the synthesis of an element essential for the mitochondrial function could be potentially correlated with the number of mitochondria, suggesting a potential new nuclear locus directly involved in the regulation of mitochondrial homeostasis.

12q15-rs7136238 has a RegulomeDB rank of 5 indicating a low regulatory role and is a multi-tissue eQTL for two genes, lysozyme (LYZ), YEATS domain containing 4 (YEATS4) and a long noncoding RNA (RP11-1143G9.5). 3DIV tool reports that this SNP interact with CCT2 gene and two micro-RNAs (MIR3913–1, MIR3913–2) located at a distance of 225 000 bp. Although the annotation tools report some evidence for a functional role of 12q15-rs7136238, the available information does not allow us to hypothesize a possible role in regulating the number of mitochondria.

For leukocyte telomere length, a marker for several human diseases that suffers from the same technical problems as mtDNAcn, a genetic score with telomere length has been identified, validated and applied in several epidemiologic studies (40–45). We sought to do the same for mtDNAcn carrying out a homogeneous study. The mitoscore generated with 44 of the 50 loci identified in UK-Biobank study explains 1.05% (p = 3.56 × 10−6) of the phenotypic variability and 1.11% (p = 5.93 × 10−7) adding the two new SNP to the PGS. Many studies have reported a small r2 value < 5% and in some cases < 1% for various PGS in different traits (46–50), therefore reporting an r2 of 1.11% with the first PGS generated for a complex trait as mtDNAcn can be considered a starting point for future studies to implement new variants to the score. Adding the two new SNPs increased the variance (+0.6%) explained by the score, suggesting that the SNPs we identified are relevant predictors of this phenotype.

A potential limitation of this study is the absence of blood cell counts, a factor associated with mtDNAcn as reported by a recent study (24). One of the strengths of this study is that it has been carried out on a large number of subjects of the same ethnicity and of both sexes, without any kind of internal stratification and including several environmental/lifestyle potential confounders (i.e. dietary behaviours). Our study (in the discovery phase) is extremely homogeneous because we used a measurement technique that is considered the de facto standard for these studies, on all the 6836 samples that were collected, stored and extracted in the same way and analysed at the same time.

In addition, another point of strength is the two-step approach (discovery and a replication phase), the latter performed in one of the largest populations ever tested for this phenotype.

In conclusion, we identified two new genetic variants (19p13.11-rs117176661 and 12q15-rs7136238) involved in the variation of mtDNAcn, and we propose for the first time the mitoscore as a tool to estimate the genetic basis of mtDNAcn variation. We envision, in the future, the use of this score as a proxy of direct measurement of mitochondria, reducing time, cost, and possible bias inherent to the direct measurement of mtDNAcn, thus improving reproducibility of mtDNAcn analyses in epidemiological studies.

Table 3

Characteristics of the study population

TotalMaleFemale
Number of Individuals764034624178
Median age, years (25th–75th percentile)62 (57–67)63 (57–67)62 (57–67)
50–64 years at baseline4700 (62%)2126 (61%)2574 (62%)
65–75 years at baseline2940 (38%)1336 (39%)1604 (38%)
Number of individuals with information on fruit intake frequency738133524029
more than once/day21986871511
once/day241510951320
more than one/week22051220985
once/week29019298
less than once/week22112695
no523220
Number of individuals with information on vegetable intake frequency732333373986
more than once/day26679187
once/day16265311095
more than one/week461422632351
once/week681377304
less than once/week1207347
no16142
Number of individuals with information on BMI763134574174
Underweight (BMI < 18,5)341024
Normal range (BMI < 25)20357571278
Overweight (BMI < 30)361218521760
Obese (BMI > = 30)19508381112
Median BMI (25th–75th percentile)27.71 (25–30)27.86 (25–30)27.58 (24–30)
Number of individuals with information on alcohol consumption764034624178
Abstainer29388542084
Women 0–19.99 g/d or Man 0–39.99 g/d421123591852
Women 20–39.99 g/d or Man 40–59.99 g/d386182204
Women > =40 g/d or Man > =60 g/d1056738
Average g of alcohol consumed for a day (25th–75th percentile)9.00 (0–13)13.88 (1.70–19.71)4.95 (0–6.29)
Number of individuals with information on physical activity764034624178
Inactive: < 1 h of physical activity/week16285371091
Low: 1-2 hrs physical activity/week349615311965
Medium or high: > = 2 h of vigorous and > = 2 h of light physical activity/week251613941122
Number of individuals with information on smoking behavior742434014023
non-smoker370210142688
former smoker24651719746
current smoker1257668589
Pack-years of cigarette smoking (25th–75th percentile)a22.41 (7–34)25.07 (9–38)17.65 (4.8–27.5)
TotalMaleFemale
Number of Individuals764034624178
Median age, years (25th–75th percentile)62 (57–67)63 (57–67)62 (57–67)
50–64 years at baseline4700 (62%)2126 (61%)2574 (62%)
65–75 years at baseline2940 (38%)1336 (39%)1604 (38%)
Number of individuals with information on fruit intake frequency738133524029
more than once/day21986871511
once/day241510951320
more than one/week22051220985
once/week29019298
less than once/week22112695
no523220
Number of individuals with information on vegetable intake frequency732333373986
more than once/day26679187
once/day16265311095
more than one/week461422632351
once/week681377304
less than once/week1207347
no16142
Number of individuals with information on BMI763134574174
Underweight (BMI < 18,5)341024
Normal range (BMI < 25)20357571278
Overweight (BMI < 30)361218521760
Obese (BMI > = 30)19508381112
Median BMI (25th–75th percentile)27.71 (25–30)27.86 (25–30)27.58 (24–30)
Number of individuals with information on alcohol consumption764034624178
Abstainer29388542084
Women 0–19.99 g/d or Man 0–39.99 g/d421123591852
Women 20–39.99 g/d or Man 40–59.99 g/d386182204
Women > =40 g/d or Man > =60 g/d1056738
Average g of alcohol consumed for a day (25th–75th percentile)9.00 (0–13)13.88 (1.70–19.71)4.95 (0–6.29)
Number of individuals with information on physical activity764034624178
Inactive: < 1 h of physical activity/week16285371091
Low: 1-2 hrs physical activity/week349615311965
Medium or high: > = 2 h of vigorous and > = 2 h of light physical activity/week251613941122
Number of individuals with information on smoking behavior742434014023
non-smoker370210142688
former smoker24651719746
current smoker1257668589
Pack-years of cigarette smoking (25th–75th percentile)a22.41 (7–34)25.07 (9–38)17.65 (4.8–27.5)

avalues measured on former and current smokers’ groups

Table 3

Characteristics of the study population

TotalMaleFemale
Number of Individuals764034624178
Median age, years (25th–75th percentile)62 (57–67)63 (57–67)62 (57–67)
50–64 years at baseline4700 (62%)2126 (61%)2574 (62%)
65–75 years at baseline2940 (38%)1336 (39%)1604 (38%)
Number of individuals with information on fruit intake frequency738133524029
more than once/day21986871511
once/day241510951320
more than one/week22051220985
once/week29019298
less than once/week22112695
no523220
Number of individuals with information on vegetable intake frequency732333373986
more than once/day26679187
once/day16265311095
more than one/week461422632351
once/week681377304
less than once/week1207347
no16142
Number of individuals with information on BMI763134574174
Underweight (BMI < 18,5)341024
Normal range (BMI < 25)20357571278
Overweight (BMI < 30)361218521760
Obese (BMI > = 30)19508381112
Median BMI (25th–75th percentile)27.71 (25–30)27.86 (25–30)27.58 (24–30)
Number of individuals with information on alcohol consumption764034624178
Abstainer29388542084
Women 0–19.99 g/d or Man 0–39.99 g/d421123591852
Women 20–39.99 g/d or Man 40–59.99 g/d386182204
Women > =40 g/d or Man > =60 g/d1056738
Average g of alcohol consumed for a day (25th–75th percentile)9.00 (0–13)13.88 (1.70–19.71)4.95 (0–6.29)
Number of individuals with information on physical activity764034624178
Inactive: < 1 h of physical activity/week16285371091
Low: 1-2 hrs physical activity/week349615311965
Medium or high: > = 2 h of vigorous and > = 2 h of light physical activity/week251613941122
Number of individuals with information on smoking behavior742434014023
non-smoker370210142688
former smoker24651719746
current smoker1257668589
Pack-years of cigarette smoking (25th–75th percentile)a22.41 (7–34)25.07 (9–38)17.65 (4.8–27.5)
TotalMaleFemale
Number of Individuals764034624178
Median age, years (25th–75th percentile)62 (57–67)63 (57–67)62 (57–67)
50–64 years at baseline4700 (62%)2126 (61%)2574 (62%)
65–75 years at baseline2940 (38%)1336 (39%)1604 (38%)
Number of individuals with information on fruit intake frequency738133524029
more than once/day21986871511
once/day241510951320
more than one/week22051220985
once/week29019298
less than once/week22112695
no523220
Number of individuals with information on vegetable intake frequency732333373986
more than once/day26679187
once/day16265311095
more than one/week461422632351
once/week681377304
less than once/week1207347
no16142
Number of individuals with information on BMI763134574174
Underweight (BMI < 18,5)341024
Normal range (BMI < 25)20357571278
Overweight (BMI < 30)361218521760
Obese (BMI > = 30)19508381112
Median BMI (25th–75th percentile)27.71 (25–30)27.86 (25–30)27.58 (24–30)
Number of individuals with information on alcohol consumption764034624178
Abstainer29388542084
Women 0–19.99 g/d or Man 0–39.99 g/d421123591852
Women 20–39.99 g/d or Man 40–59.99 g/d386182204
Women > =40 g/d or Man > =60 g/d1056738
Average g of alcohol consumed for a day (25th–75th percentile)9.00 (0–13)13.88 (1.70–19.71)4.95 (0–6.29)
Number of individuals with information on physical activity764034624178
Inactive: < 1 h of physical activity/week16285371091
Low: 1-2 hrs physical activity/week349615311965
Medium or high: > = 2 h of vigorous and > = 2 h of light physical activity/week251613941122
Number of individuals with information on smoking behavior742434014023
non-smoker370210142688
former smoker24651719746
current smoker1257668589
Pack-years of cigarette smoking (25th–75th percentile)a22.41 (7–34)25.07 (9–38)17.65 (4.8–27.5)

avalues measured on former and current smokers’ groups

Materials and Methods

Study population

The study was based on the (ESTHER) cohort study, whose details have been reported elsewhere (37). Briefly, 9940 individuals, aged 50–75 years, have been enrolled between 2000 and 2002, and followed for 17 years. After providing informed consent, information about lifestyle, smoking habits, diet, and sociodemographic characteristics were collected at baseline as well as at follow-ups. The ESTHER study was approved by the ethics committees of University of Heidelberg and the state medical board of Saarland, Germany.

In this study we included the information on age (reported as continuous variable), sex (dichotomous variable), BMI (continuous variable), smoking status, pack-years of cigarette smoking among current and former smokers (continuous variable), physical activity (PA; classified as 0: ‘inactive’ (reference) < 1 hr PA/week; 1: ‘low’ 1-2 hrs PA/week; 2: ‘moderate’/‘high’ ≥2 hrs PA/week), fruit intake frequency (1: more than once/day; 2: once/day; > 3: more than one/week; 4: once/week; 5: less than once/week; 6: no), vegetable intake frequency (1: more than once/day; 2: once/day; > 3: more than one/week; 4: once/week; 5: less than once/week; 6: no), and alcohol intake (continuous variable, number of grams ingested daily). Main characteristics of the study population are reported in Table 3.

DNA extraction and sample preparation

Blood samples were taken during a routine health examination and stored at −80°C until analysis. DNA from whole blood samples was collected using a salting out procedure. DNA concentration of each sample was measured using PicoGreen dsDNA quantitation assay (Invitrogen|ThermoFisher, Waltham, MA, USA) to intercalate the dsDNA and the Tecan GENios Multireader microplate reader to read the UV absorbance. The measured DNA of each individual was diluted with water to the final concentration of 5 ng/μL.

qPCR measurement of mtDNA copy number

mtDNAcn measurement was performed in 7640 study subjects. Real-time multiplex polymerase chain reaction was used to quantify the copy number of the mitochondrial gene NADH dehydrogenase, subunit 1 (ND1), and of the nuclear single copy gene albumin (ALB) as housekeeping reference. All the DNA samples were diluted to a concentration of 2.5 ng/μL in a 96 well plates. The triplicates of each sample were obtained dispensing with an electronic multichannel pipette 2 μL of the sample in 3 wells of a 384-well plate (5 ng DNA of each individual in each well). We previously reported the details of the quantification technique (8). The PCR was performed using a Viia-7 sequence detection system (Applied Biosystems) to acquire the cycle threshold (Ct) values for copy number of ND1 and ALB gene.

Individual Ct values that deviated from the average of the triplicates by more than 5% of the standard deviation were discarded. This affected 467 (6%) samples. Furthermore, samples whose average Ct was not within the standard curve range were excluded and not included in the analyses (N = 337, 4%). Standard curves were generated in each plate with a serial dilution (1:2) from 30 ng to 0.47 ng of genomic DNA pooled from 50 random individuals belonging to the study. The standard pool (STP) was renewed every 4 ~ 5 plates (using however the same subjects), to preserve the quality of the pooled DNA during the laboratory measurement. For each reaction the RT-PCR efficiency (E) was calculated using standard curve points in the exponential phase according to the equation: E = 10[−1/slope].

After exclusion due to quality control (QC) failure, the mtDNAcn was obtained for 6836 individuals as the ratio between ND1/ALB copy numbers, based on the calculation introduced by Pfaffl (51). This approach is suited to qPCR results lacking identical efficiency between the amplification reaction of ND1 and ALB, using the Ct of the standard curve to the equivalent of 5 ng of DNA as a calibrator.

Genotyping and imputation

Genome-wide SNP genotyping was performed on all study subjects using the Illumina Infinium OncoArray and Global Screening Array BeadChips (Illumina, San Diego, CA, USA) as previously described in detail (52). Genotyping quality controls followed the protocols reported by Anderson et al. (53). Genotyping data were phased using SHAPEIT2 and imputed using the Michigan Imputation Server, and MiniMac 4 was used to impute to the haplotype reference consortium (HRC) Version r1.1 2016 reference panel (54,55).

Correlation analyses

We performed a correlation analysis (Pearson correlation) to estimate the linear relation between mtDNAcn and the covariates included in our study (age, sex, BMI, physical activity, alcohol consumption, pack-years of cigarettes smoking, fruit, and vegetable intake frequency). The plates used for each qPCR were classified in group on the base of the STP included in them, and this classification was used and evaluated as a covariate. The covariates that showed a statistically significant correlation (p < 0.05) were included in the association analyses as adjustment variables (Table 4).

Table 4

Correlations between mtDNA copy number and selected covariates

Variablerap-valueb
Age (years)−0.0191.20 × 10−1
Alcohol consumption (g/day)−0.0123.64 × 10−1
BMI−0.0362.77 × 10−3
Cigarette smoking (pack-years)−0.0593.37 × 10−6
Fruit intake frequency−0.0434.81 × 10−4
Physical activity0.0019.28 × 10−1
Sex−0.0656.75 × 10−8
Vegetable intake frequency−0.0142.43 × 10−1
STP (standard pool)0.0132.74 × 10−1
Variablerap-valueb
Age (years)−0.0191.20 × 10−1
Alcohol consumption (g/day)−0.0123.64 × 10−1
BMI−0.0362.77 × 10−3
Cigarette smoking (pack-years)−0.0593.37 × 10−6
Fruit intake frequency−0.0434.81 × 10−4
Physical activity0.0019.28 × 10−1
Sex−0.0656.75 × 10−8
Vegetable intake frequency−0.0142.43 × 10−1
STP (standard pool)0.0132.74 × 10−1

ar: Pearson correlation coefficient; bp-values in bold are statistically significant (p < 0.05).

Table 4

Correlations between mtDNA copy number and selected covariates

Variablerap-valueb
Age (years)−0.0191.20 × 10−1
Alcohol consumption (g/day)−0.0123.64 × 10−1
BMI−0.0362.77 × 10−3
Cigarette smoking (pack-years)−0.0593.37 × 10−6
Fruit intake frequency−0.0434.81 × 10−4
Physical activity0.0019.28 × 10−1
Sex−0.0656.75 × 10−8
Vegetable intake frequency−0.0142.43 × 10−1
STP (standard pool)0.0132.74 × 10−1
Variablerap-valueb
Age (years)−0.0191.20 × 10−1
Alcohol consumption (g/day)−0.0123.64 × 10−1
BMI−0.0362.77 × 10−3
Cigarette smoking (pack-years)−0.0593.37 × 10−6
Fruit intake frequency−0.0434.81 × 10−4
Physical activity0.0019.28 × 10−1
Sex−0.0656.75 × 10−8
Vegetable intake frequency−0.0142.43 × 10−1
STP (standard pool)0.0132.74 × 10−1

ar: Pearson correlation coefficient; bp-values in bold are statistically significant (p < 0.05).

Genome-wide association analysis

PLINK1.9 software was used to assess genotyping QC on the subjects with mtDNAcn available. Population structure was assessed by principal component analysis. SNPs with minor allele frequency (MAF) < 0.01, or evidence for violations of Hardy–Weinberg equilibrium (p < 10−5) were excluded, leaving for the analysis 6 386 351 SNPs of 6836 individuals. The lambda value, used to estimate the inflation rate, was 0.995 and a QQ plot is reported as Figure 2. We used linear regression models to examine the association between SNPs and mtDNAcn as a quantitative trait, adjusting for the variables associated with the mtDNAcn in the correlation analyses (sex, BMI, fruit intake frequency, pack-years of cigarette smoking), the top eight principal components and the STP. For statistical significance, we used a threshold of p < 5 × 10−8.

Quantile-quantile (QQ) plot for the whole genome association analysis of mtDNAcn.
Figure 2

Quantile-quantile (QQ) plot for the whole genome association analysis of mtDNAcn.

Summary statistics of UK biobank GWAS

The summary statistics of the GWAS performed using 295 150 participants from the UK Biobank (24) were used as replication. The mtDNAcn was estimated from the intensities of array genotyping probes on the mitochondrial chromosome. The betas, confidence intervals and p-values were obtained by logistic regression adjusted for the first ten principal components, age at baseline, sex, genotyping batch, genotyping missingness/call rate, and neutrophil percentage lymphocyte percentage and white blood cell count. The details of the study are reported in the original publication (24).

GWAS meta-analysis

A meta-analysis of the results obtained in the ESTHER GWAS and in the UK-Biobank GWAS was performed. The association results (beta, and standard error) for each SNP were combined with the METAL software (v2011-03-25) adopting a fixed effects model. The meta-analysis included 2 775 252 SNPs with genotype data available from the two studies. A p-value of 5 × 10−8 was used to establish a threshold for genome-wide significance.

Polygenic scores

We computed a PGS to estimate the relative mitochondrial content. The score was generated using the effect sizes (betas) reported in the summary statistics of the GWAS performed on the UK-Biobank population and was tested in the ESTHER population.

The score was calculated using PLINK software, and the association with mtDNAcn was tested with a generalised linear model using the quantification values measured in the ESTHER population.

We generated two scores, one including the variants identified by Hägg and colleagues. Of the 50 genome-wide significant loci reported by Hägg, 44 were present in the ESTHER GWAS and were included in the score. A second score was computed, adding to the 44 variants the new variants identified in this study. The PGS for each study subject was generated by adding up the product of the effect (beta) of each variant by the number of effect alleles. To take into account that not all subjects had a 100% call rate, the PGS was divided by the number of genotypes available for each subject (average subject call rate 0.99). The analyses for all PGSs were adjusted by the same covariates used for the genome-wide analyses of individual SNPs.

Bioinformatic analysis

Bioinformatic tools were used to evaluate the functional relevance of each SNP with statistically significant results in the genome-wide association analysis. Independent signals within each locus were defined with a correlation (r2) of less than 0.1 to the variant with the lowest p-value using the FUMA platform (56). FUMA was also used to integrate the functional annotation of associated variants. SNPnexus was used as database for functional annotation of SNPs (57). Regulome DB was used to identify the regulatory potential of each SNP (58), and Genotype-Tissue Expression GTEx Analysis Release V8 (data obtained from the GTEx Portal on 01 March 2021) was used to identify the relationship between the SNPs and the level of expression in the nearby genes (59). Finally, we used the web-tool 3D-genome interaction viewer (60,61) to evaluate the chromatin interaction and the epigenetic annotation in the newly identified risk loci.

Acknowledgements

We thank Drs. Sara Hägg and Felix Grassmann (Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden) for kindly sharing with us the summary statistics of the study reported in Hägg et al, 2021.

This work was supported by intramural funding of German Cancer Research Center (DKFZ) and of the University of Pisa. The ESTHER study was supported by grants from the Baden-Württemberg Ministry of Science, Research and Arts, the German Federal Ministry of Education and Research, the German Federal Ministry of Family, Senior Citizens, Women and Youth, the Saarland Ministry of Social Affairs, Health, Women and Family, and the Network Aging Research at Heidelberg University.

Conflict of Interest statement. The authors declare no competing interests.

References

1.

Castellani
,
C.A.
,
Longchamps
,
R.J.
,
Sun
,
J.
,
Guallar
,
E.
and
Arking
,
D.E.
(
2020
)
Thinking outside the nucleus: mitochondrial DNA copy number in health and disease
.
Mitochondrion
,
53
,
214
223
.

2.

Shen
,
J.
,
Platek
,
M.
,
Mahasneh
,
A.
,
Ambrosone
,
C.B.
and
Zhao
,
H.
(
2010
)
Mitochondrial copy number and risk of breast cancer: a pilot study
.
Mitochondrion
,
10
,
62
68
.

3.

Giaccherini
,
M.
,
Gentiluomo
,
M.
,
Fornili
,
M.
,
Lucenteforte
,
E.
,
Baglietto
,
L.
and
Campa
,
D.
(
2021
)
Association between telomere length and mitochondrial copy number and cancer risk in humans: a meta-analysis on more than 300,000 individuals
.
Crit. Rev. Oncol. Hematol.
,
167
, 103510.

4.

Campa
,
D.
,
Barrdahl
,
M.
,
Santoro
,
A.
,
Severi
,
G.
,
Baglietto
,
L.
,
Omichessan
,
H.
,
Tumino
,
R.
,
Bueno-de-Mesquita
,
H.B.A.
,
Peeters
,
P.H.
,
Weiderpass
,
E.
 et al. (
2018
)
Mitochondrial DNA copy number variation, leukocyte telomere length, and breast cancer risk in the European prospective investigation into cancer and nutrition (EPIC) study
.
Breast Cancer Res.
,
20
,
29
.

5.

Kumar
,
B.
,
Bhat
,
Z.I.
,
Bansal
,
S.
,
Saini
,
S.
,
Naseem
,
A.
,
Wahabi
,
K.
,
Burman
,
A.
,
Kumar
,
G.T.
,
Saluja
,
S.S.
and
Rizvi
,
M.M.A.
(
2017
)
Association of mitochondrial copy number variation and T16189C polymorphism with colorectal cancer in north Indian population
.
Tumor Biol.
,
39
.

6.

Hosgood
,
H.D.
,
Liu
,
C.S.
,
Rothman
,
N.
,
Weinstein
,
S.J.
,
Bonner
,
M.R.
,
Shen
,
M.
,
Lim
,
U.
,
Virtamo
,
J.
,
Cheng
,
W.
,
ling, Albanes
,
D.
 et al. (
2010
)
Mitochondrial DNA copy number and lung cancer risk in a prospective cohort study
.
Carcinogenesis
,
31
,
847
849
.

7.

Moore
,
A.
,
Lan
,
Q.
,
Hofmann
,
J.N.
,
Liu
,
C.-S.S.
,
Cheng
,
W.-L.L.
,
Lin
,
T.-T.T.
and
Berndt
,
S.I.
(
2017
)
A prospective study of mitochondrial DNA copy number and the risk of prostate cancer
.
Cancer Causes Control
,
28
,
529
538
.

8.

Gentiluomo
,
M.
,
Katzke
,
V.A.
,
Kaaks
,
R.
,
Tjønneland
,
A.
,
Severi
,
G.
,
Perduca
,
V.
,
Boutron-Ruault
,
M.-C.
,
Weiderpass
,
E.
,
Ferrari
,
P.
,
Johnson
,
T.
 et al. (
2020
)
Mitochondrial DNA copy-number variation and pancreatic cancer risk in the prospective EPIC cohort
.
Cancer Epidemiol. biomarkers Prev. a Publ. Am. Assoc. Cancer Res. cosponsored by Am. Soc. Prev. Oncol.
,
29
,
681
686
.

9.

Lynch
,
S.M.
,
Weinstein
,
S.J.
,
Virtamo
,
J.
,
Lan
,
Q.
,
Liu
,
C.-S.C.-S.
,
Cheng
,
W.-L.W.-L.
,
Rothman
,
N.
,
Albanes
,
D.
and
Stolzenberg-Solomon
,
R.Z.
(
2011
)
Mitochondrial DNA copy number and pancreatic cancer in the alpha-tocopherol Beta-carotene cancer prevention study
.
Cancer Prev. Res.
,
4
,
1912
1919
.

10.

Hosnijeh
,
F.S.
,
Lan
,
Q.
,
Rothman
,
N.
,
San Liu
,
C.
,
Cheng
,
W.-L.
,
Nieters
,
A.
,
Guldberg
,
P.
,
Tjønneland
,
A.
,
Campa
,
D.
,
Martino
,
A.
 et al. (
2014
)
Mitochondrial DNA copy number and future risk of B-cell lymphoma in a nested case-control study in the prospective EPIC cohort
.
Blood
,
124
,
530
535
.

11.

Kuznetsova
,
T.
and
Knez
,
J.
(
2017
)
Peripheral blood mitochondrial DNA and myocardial function
.
Advances in Experimental Medicine and Biology
,
Springer
 
New York LLC
, Vol.
982
, pp.
347
358
.

12.

Ashar
,
F.N.
,
Zhang
,
Y.
,
Longchamps
,
R.J.
,
Lane
,
J.
,
Moes
,
A.
,
Grove
,
M.L.
,
Mychaleckyj
,
J.C.
,
Taylor
,
K.D.
,
Coresh
,
J.
,
Rotter
,
J.I.
 et al. (
2017
)
Association of Mitochondrial DNA copy number with cardiovascular disease
.
JAMA Cardiol.
,
2
,
1247
1255
.

13.

Jiang
,
M.
,
Kauppila
,
T.E.S.
,
Motori
,
E.
,
Li
,
X.
,
Atanassov
,
I.
,
Folz-Donahue
,
K.
,
Bonekamp
,
N.A.
,
Albarran-Gutierrez
,
S.
,
Stewart
,
J.B.
and
Larsson
,
N.G.
(
2017
)
Increased Total mtDNA copy number cures male infertility despite unaltered mtDNA mutation load
.
Cell Metab.
,
26
,
429
436.e4
.

14.

Edwards
,
A.C.
,
Aggen
,
S.H.
,
Cai
,
N.
,
Bigdeli
,
T.B.
,
Peterson
,
R.E.
,
Docherty
,
A.R.
,
Webb
,
B.T.
,
Bacanu
,
S.-A.
,
Flint
,
J.
and
Kendler
,
K.S.
(
2016
)
Chronicity of depression and molecular markers in a large sample of HAN CHINESE women
.
Depress. Anxiety
,
33
,
1048
1054
.

15.

Bersani
,
F.S.
,
Morley
,
C.
,
Lindqvist
,
D.
,
Epel
,
E.S.
,
Picard
,
M.
,
Yehuda
,
R.
,
Flory
,
J.
,
Bierer
,
L.M.
,
Makotkine
,
I.
,
Abu-Amara
,
D.
 et al. (
2016
)
Mitochondrial DNA copy number is reduced in male combat veterans with PTSD
.
Prog. Neuro-Psychopharmacology Biol. Psychiatry
,
64
,
10
17
.

16.

Picard
,
M.
,
Juster
,
R.-P.
and
McEwen
,
B.S.
(
2014
)
Mitochondrial allostatic load puts the “gluc” back in glucocorticoids
.
Nat. Rev. Endocrinol.
,
10
,
303
310
.

17.

Curran
,
J.E.
,
Johnson
,
M.P.
,
Dyer
,
T.D.
,
Göring
,
H.H.H.
,
Kent
,
J.W.
,
Charlesworth
,
J.C.
,
Borg
,
A.J.
,
Jowett
,
J.B.M.
,
Cole
,
S.A.
,
MacCluer
,
J.W.
 et al. (
2007
)
Genetic determinants of mitochondrial content
.
Hum. Mol. Genet.
,
16
,
1504
1514
.

18.

Xing
,
J.
,
Chen
,
M.
,
Wood
,
C.G.
,
Lin
,
J.
,
Spitz
,
M.R.
,
Ma
,
J.
,
Amos
,
C.I.
,
Shields
,
P.G.
,
Benowitz
,
N.L.
,
Gu
,
J.
 et al. (
2008
)
Mitochondrial DNA content: its genetic heritability and association with renal cell carcinoma
.
J. Natl. Cancer Inst.
,
100
,
1104
1112
.

19.

Curran
,
J.E.
,
Jowett
,
J.B.M.
,
Abraham
,
L.J.
,
Diepeveen
,
L.A.
,
Elliott
,
K.S.
,
Dyer
,
T.D.
,
Kerr-Bayles
,
L.J.
,
Johnson
,
M.P.
,
Comuzzie
,
A.G.
,
Moses
,
E.K.
 et al. (
2010
)
Genetic variation in PARL influences mitochondrial content
.
Hum. Genet.
,
127
,
183
190
.

20.

Li
,
Z.
,
Zhu
,
M.
,
Du
,
J.
,
Ma
,
H.
,
Jin
,
G.
and
Dai
,
J.
(
2018
)
Genetic variants in nuclear DNA along with environmental factors modify mitochondrial DNA copy number: a population-based exome-wide association study
.
BMC Genomics
,
19
,
752
.

21.

López
,
S.
,
Buil
,
A.
,
Souto
,
J.C.
,
Casademont
,
J.
,
Martinez-Perez
,
A.
,
Almasy
,
L.
and
Soria
,
J.M.
(
2014
)
A genome-wide association study in the genetic analysis of idiopathic thrombophilia project suggests sex-specific regulation of mitochondrial DNA levels
.
Mitochondrion
,
18
,
34
40
.

22.

Cai
,
N.
,
Li
,
Y.
,
Chang
,
S.
,
Liang
,
J.
,
Lin
,
C.
,
Zhang
,
X.
,
Liang
,
L.
,
Hu
,
J.
,
Chan
,
W.
,
Kendler
,
K.S.
 et al. (
2015
)
Genetic control over mtDNA and its relationship to major depressive disorder
.
Curr. Biol.
,
25
,
3170
3177
.

23.

Guyatt
,
A.L.
,
Brennan
,
R.R.
,
Burrows
,
K.
,
Guthrie
,
P.A.I.
,
Ascione
,
R.
,
Ring
,
S.M.
,
Gaunt
,
T.R.
,
Pyle
,
A.
,
Cordell
,
H.J.
,
Lawlor
,
D.A.
 et al. (
2019
)
A genome-wide association study of mitochondrial DNA copy number in two population-based cohorts
.
Hum. Genomics
,
13
,
6
.

24.

Hägg
,
S.
,
Jylhävä
,
J.
,
Wang
,
Y.
,
Czene
,
K.
and
Grassmann
,
F.
(
2021
)
Deciphering the genetic and epidemiological landscape of mitochondrial DNA abundance
.
Hum. Genet.
,
140
,
849
861
.

25.

Workalemahu
,
T.
,
Enquobahrie
,
D.A.
,
Tadesse
,
M.G.
,
Hevner
,
K.
,
Gelaye
,
B.
,
Sanchez
,
S.E.
and
Williams
,
M.A.
(
2017
)
Genetic variations related to maternal whole blood mitochondrial DNA copy number: a genome-wide and candidate gene study
.
J. Matern. Neonatal Med.
,
30
,
2433
2439
.

26.

Yamamoto
,
K.
,
Sakaue
,
S.
,
Matsuda
,
K.
,
Murakami
,
Y.
,
Kamatani
,
Y.
,
Ozono
,
K.
,
Momozawa
,
Y.
and
Okada
,
Y.
(
2020
)
Genetic and phenotypic landscape of the mitochondrial genome in the Japanese population
.
Commun. Biol.
,
3
,
104
.

27.

Fazzini
,
F.
,
Schöpf
,
B.
,
Blatzer
,
M.
,
Coassin
,
S.
,
Hicks
,
A.A.
,
Kronenberg
,
F.
and
Fendt
,
L.
(
2018
)
Plasmid-normalized quantification of relative mitochondrial DNA copy number
.
Sci. Rep.
,
8
,
1
11
.

28.

Longchamps
,
R.J.
,
Castellani
,
C.A.
,
Yang
,
S.Y.
,
Newcomb
,
C.E.
,
Sumpter
,
J.A.
,
Lane
,
J.
,
Grove
,
M.L.
,
Guallar
,
E.
,
Pankratz
,
N.
,
Taylor
,
K.D.
 et al. (
2020
)
Evaluation of mitochondrial DNA copy number estimation techniques
.
PLoS One
,
15
, e0228166.

29.

O’Hara
,
R.
,
Tedone
,
E.
,
Ludlow
,
A.
,
Huang
,
E.
,
Arosio
,
B.
,
Mari
,
D.
and
Shay
,
J.W.
(
2019
)
Quantitative mitochondrial DNA copy number determination using droplet digital PCR with single-cell resolution
.
Genome Res.
,
29
,
1878
1888
.

30.

Andreu
,
A.L.
,
Martinez
,
R.
,
Marti
,
R.
and
García-Arumí
,
E.
(
2009
)
Quantification of mitochondrial DNA copy number: pre-analytical factors
.
Mitochondrion
,
9
,
242
246
.

31.

Nacheva
,
E.
,
Mokretar
,
K.
,
Soenmez
,
A.
,
Pittman
,
A.M.
,
Grace
,
C.
,
Valli
,
R.
,
Ejaz
,
A.
,
Vattathil
,
S.
,
Maserati
,
E.
,
Houlden
,
H.
 et al. (
2017
)
DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation
.
PLoS One
,
12
.

32.

Mengel-From
,
J.
,
Thinggaard
,
M.
,
Dalgård
,
C.
,
Kyvik
,
K.O.
,
Christensen
,
K.
and
Christiansen
,
L.
(
2014
)
Mitochondrial DNA copy number in peripheral blood cells declines with age and is associated with general health among elderly
.
Hum. Genet.
,
133
,
1149
1159
.

33.

Barrientos
,
A.
,
Casademont
,
J.
,
Cardellach
,
F.
,
Estivill
,
X.
,
Urbano-Marquez
,
A.
and
Nunes
,
V.
(
1997
)
Reduced steady-state levels of mitochondrial RNA and increased mitochondrial DNA amount in human brain with aging
.
Mol. Brain Res.
,
52
,
284
289
.

34.

Wong
,
J.
,
McLennan
,
S.V.
,
Molyneaux
,
L.
,
Min
,
D.
,
Twigg
,
S.M.
and
Yue
,
D.K.
(
2009
)
Mitochondrial DNA content in peripheral blood monocytes: relationship with age of diabetes onsetand diabetic complications
.
Diabetologia
,
52
,
1953
1961
.

35.

Xu
,
F.X.
,
Zhou
,
X.
,
Shen
,
F.
,
Pang
,
R.
and
Liu
,
S.M.
(
2012
)
Decreased peripheral blood mitochondrial DNA content is related to HbA1c, fasting plasma glucose level and age of onset in type 2 diabetes mellitus
.
Diabet. Med.
,
29
,
e47
e54
.

36.

Xia
,
P.
,
An
,
H.-X.
,
Dang
,
C.-X.
,
Radpour
,
R.
,
Kohler
,
C.
,
Fokas
,
E.
,
Engenhart-Cabillic
,
R.
,
Holzgreve
,
W.
and
Zhong
,
X.Y.
(
2009
)
Decreased mitochondrial DNA content in blood samples of patients with stage I breast cancer
.
BMC Cancer
,
9
,
454
.

37.

Raum
,
E.
,
Rothenbacher
,
D.
,
Löw
,
M.
,
Stegmaier
,
C.
,
Ziegler
,
H.
and
Brenner
,
H.
(
2007
)
Changes of cardiovascular risk factors and their implications in subsequent birth cohorts of older adults in Germany: a life course approach
.
Eur. J. Prev. Cardiol.
,
14
,
809
814
.

38.

Kim
,
K.
,
Jang
,
I.
,
Kim
,
M.
,
Choi
,
J.
,
Kim
,
M.-S.
,
Lee
,
B.
and
Jung
,
I.
(
2021
)
3DIV update for 2021: a comprehensive resource of 3D genome and 3D cancer genome
.
Nucleic Acids Res.
,
49
,
D38
D46
.

39.

Guo
,
W.
,
Jiang
,
L.
,
Bhasin
,
S.
,
Khan
,
S.M.
and
Swerdlow
,
R.H.
(
2009
)
DNA extraction procedures meaningfully influence qPCR-based mtDNA copy number determination
.
Mitochondrion
,
9
,
261
265
.

40.

Haycock
,
P.C.
,
Burgess
,
S.
,
Nounu
,
A.
,
Zheng
,
J.
,
Okoli
,
G.N.
,
Bowden
,
J.
,
Wade
,
K.H.
,
Timpson
,
N.J.
,
Evans
,
D.M.
,
Willeit
,
P.
 et al. (
2017
)
Association between telomere length and risk of cancer and non-neoplastic diseases a mendelian randomization study
.
JAMA Oncol.
,
3
,
636
651
.

41.

Mangino
,
M.
,
Hwang
,
S.-J.
,
Spector
,
T.D.
,
Hunt
,
S.C.
,
Kimura
,
M.
,
Fitzpatrick
,
A.L.
,
Christiansen
,
L.
,
Petersen
,
I.
,
Elbers
,
C.C.
,
Harris
,
T.
 et al. (
2012
)
Genome-wide meta-analysis points to CTC1 and ZNF676 as genes regulating telomere homeostasis in humans
.
Hum. Mol. Genet.
,
21
,
5385
5394
.

42.

Codd
,
V.
,
Nelson
,
C.P.
,
Albrecht
,
E.
,
Mangino
,
M.
,
Deelen
,
J.
,
Buxton
,
J.L.
,
Hottenga
,
J.J.
,
Fischer
,
K.
,
Esko
,
T.
,
Surakka
,
I.
 et al. (
2013
)
Identification of seven loci affecting mean telomere length and their association with disease
.
Nat. Genet.
,
45
,
422
427
.

43.

Campa
,
D.
,
Matarazzi
,
M.
,
Greenhalf
,
W.
,
Bijlsma
,
M.
,
Saum
,
K.-U.
,
Pasquali
,
C.
,
van
 
Laarhoven
,
H.
,
Szentesi
,
A.
,
Federici
,
F.
,
Vodicka
,
P.
 et al. (
2019
)
Genetic determinants of telomere length and risk of pancreatic cancer: a PANDoRA study
.
Int. J. Cancer
,
144
,
1275
1283
.

44.

Iles
,
M.M.
,
Bishop
,
D.T.
,
Taylor
,
J.C.
,
Hayward
,
N.K.
,
Brossard
,
M.
,
Cust
,
A.E.
,
Dunning
,
A.M.
,
Lee
,
J.E.
,
Moses
,
E.K.
,
Akslen
,
L.A.
 et al. (
2014
)
The effect on melanoma risk of genes previously associated with telomere length
.
JNCI J. Natl. Cancer Inst.
,
106
.

45.

Machiela
,
M.J.
,
Lan
,
Q.
,
Slager
,
S.L.
,
Vermeulen
,
R.C.H.
,
Teras
,
L.R.
,
Camp
,
N.J.
,
Cerhan
,
J.R.
,
Spinelli
,
J.J.
,
Wang
,
S.S.
,
Nieters
,
A.
 et al. (
2016
)
Genetically predicted longer telomere length is associated with increased risk of B-cell lymphoma subtypes
.
Hum. Mol. Genet.
,
25
,
1663
1676
.

46.

Zhang
,
Y.D.
,
Hurson
,
A.N.
,
Zhang
,
H.
,
Choudhury
,
P.P.
,
Easton
,
D.F.
,
Milne
,
R.L.
,
Simard
,
J.
,
Hall
,
P.
,
Michailidou
,
K.
,
Dennis
,
J.
 et al. (
2020
)
Assessment of polygenic architecture and risk prediction based on common variants across fourteen cancers
.
Nat. Commun.
,
11
,
3353
.

47.

Purcell
,
S.M.
,
Wray
,
N.R.
,
Stone
,
J.L.
,
Visscher
,
P.M.
,
O’Donovan
,
M.C.
,
Sullivan
,
P.F.
,
Ruderfer
,
D.M.
,
McQuillin
,
A.
,
Morris
,
D.W.
,
Oĝdushlaine
,
C.T.
 et al. (
2009
)
Common polygenic variation contributes to risk of schizophrenia and bipolar disorder
.
Nature
,
460
,
748
752
.

48.

Machiela
,
M.J.
,
Chen
,
C.Y.C.
,
Chen
,
C.Y.C.
,
Chanock
,
S.J.
,
Hunter
,
D.J.
and
Kraft
,
P.
(
2011
)
Evaluation of polygenic risk scores for predicting breast and prostate cancer risk
.
Genet. Epidemiol.
,
35
,
506
514
.

49.

Stahl
,
E.A.
,
Wegmann
,
D.
,
Trynka
,
G.
,
Gutierrez-Achury
,
J.
,
Do
,
R.
,
Voight
,
B.F.
,
Kraft
,
P.
,
Chen
,
R.
,
Kallberg
,
H.J.
,
Kurreeman
,
F.A.S.
 et al. (
2012
)
Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis
.
Nat. Genet.
,
44
,
483
489
.

50.

Choi
,
S.W.
,
Mak
,
T.S.-H.
and
O’Reilly
,
P.F.
(
2020
)
Tutorial: a guide to performing polygenic risk score analyses
.
Nat. Protoc.
,
15
,
2759
2772
.

51.

Pfaffl
,
M.W.
(
2001
)
A new mathematical model for relative quantification in real-time RT-PCR
.
Nucleic Acids Res.
,
29
,
e45
.

52.

Stocker
,
H.
,
Perna
,
L.
,
Weigl
,
K.
,
Möllers
,
T.
,
Schöttker
,
B.
,
Thomsen
,
H.
,
Holleczek
,
B.
,
Rujescu
,
D.
and
Brenner
,
H.
(
2021
)
Correction: prediction of clinical diagnosis of Alzheimer’s disease, vascular, mixed, and all-cause dementia by a polygenic risk score and APOE status in a community-based cohort prospectively followed over 17 years
.
Mol. Psychiatry
,
1
11
.

53.

Anderson
,
C.A.
,
Pettersson
,
F.H.
,
Clarke
,
G.M.
,
Cardon
,
L.R.
,
Morris
,
A.P.
and
Zondervan
,
K.T.
(
2010
)
Data quality control in genetic case-control association studies
.
Nat. Protoc.
,
5
,
1564
1573
.

54.

McCarthy
,
S.
,
Das
,
S.
,
Kretzschmar
,
W.
,
Delaneau
,
O.
,
Wood
,
A.R.
,
Teumer
,
A.
,
Kang
,
H.M.
,
Fuchsberger
,
C.
,
Danecek
,
P.
,
Sharp
,
K.
 et al. (
2016
)
A reference panel of 64,976 haplotypes for genotype imputation
.
Nat. Genet.
,
48
,
1279
1283
.

55.

Das
,
S.
,
Forer
,
L.
,
Schönherr
,
S.
,
Sidore
,
C.
,
Locke
,
A.E.
,
Kwong
,
A.
,
Vrieze
,
S.I.
,
Chew
,
E.Y.
,
Levy
,
S.
,
McGue
,
M.
 et al. (
2016
)
Next-generation genotype imputation service and methods
.
Nat. Genet.
,
48
,
1284
1287
.

56.

Watanabe
,
K.
,
Taskesen
,
E.
,
van
 
Bochoven
,
A.
and
Posthuma
,
D.
(
2017
)
(2017) functional mapping and annotation of genetic associations with FUMA
.
Nat. Commun.
,
81
(
8
),
1
11
.

57.

Dayem Ullah
,
A.Z.
,
Oscanoa
,
J.
,
Wang
,
J.
,
Nagano
,
A.
,
Lemoine
,
N.R.
and
Chelala
,
C.
(
2018
)
SNPnexus: assessing the functional relevance of genetic variation to facilitate the promise of precision medicine
.
Nucleic Acids Res.
,
46
,
W109
W113
.

58.

Boyle
,
A.P.
,
Hong
,
E.L.
,
Hariharan
,
M.
,
Cheng
,
Y.
,
Schaub
,
M.A.
,
Kasowski
,
M.
,
Karczewski
,
K.J.
,
Park
,
J.
,
Hitz
,
B.C.
,
Weng
,
S.
 et al. (
2012
)
Annotation of functional variation in personal genomes using RegulomeDB
.
Genome Res.
,
22
,
1790
1797
.

59.

GTEx Consortium
and
Gte
,
T.
(
2013
)
The genotype-tissue expression (GTEx) project
.
Nat. Genet.
,
45
,
580
585
.

60.

Yang
,
D.
,
Jang
,
I.
,
Choi
,
J.
,
Kim
,
M.S.
,
Lee
,
A.J.
,
Kim
,
H.
,
Eom
,
J.
,
Kim
,
D.
,
Jung
,
I.
and
Lee
,
B.
(
2018
)
3DIV: a 3D-genome interaction viewer and database
.
Nucleic Acids Res.
,
46
,
D52
D57
.

61.

Jung
,
I.
,
Schmitt
,
A.
,
Diao
,
Y.
,
Lee
,
A.J.
,
Liu
,
T.
,
Yang
,
D.
,
Tan
,
C.
,
Eom
,
J.
,
Chan
,
M.
,
Chee
,
S.
 et al. (
2019
)
A compendium of promoter-centered long-range chromatin interactions in the human genome
.
Nat. Genet.
,
51
,
1442
1449
.

Author notes

Manuel Gentiluomo and Matteo Giaccherini authors share the first position

Federico Canzian and Daniele Campa authors share the last position

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)