-
PDF
- Split View
-
Views
-
Cite
Cite
Daniel W. Belsky, Noah Snyder-Mackler, Invited Commentary: Integrating Genomics and Social Epidemiology—Analysis of Late-Life Low Socioeconomic Status and the Conserved Transcriptional Response to Adversity, American Journal of Epidemiology, Volume 186, Issue 5, 1 September 2017, Pages 510–513, https://doi.org/10.1093/aje/kwx145
- Share Icon Share
Abstract
Socially disadvantaged children face increased morbidity and mortality as they age. Understanding mechanisms through which social disadvantage becomes biologically embedded and devising measurements that can track this embedding are critical priorities for research to address social gradients in health. The analysis by Levine et al. (Am J Epidemiol. 2017;186(5):503–509) of genome-wide gene expression in a subsample of US Health and Retirement Study participants suggests important new directions for the field. Specifically, findings suggest promise in integrating gene expression data into population studies and provide further evidence for the conserved transcriptional response to adversity as a marker of biological embedding of social disadvantage. The study also highlights methodological issues related to the analysis of gene expression data and social gradients in health and a need to examine the conserved transcriptional response to adversity alongside other proposed measurements of biological embedding. Looking to the future, advances in genome science are opening new opportunities for sociogenomic epidemiology.
Socially disadvantaged children face excess morbidity and mortality as they age (1, 2). This burden arises at least partly from social exposures “getting under the skin” to influence pathways of disease and disability (3, 4). Advances in genome science offer new opportunities to understand and quantify mechanisms of this biological embedding. New assays allow direct measurement of genomic changes that are hypothesized to mediate effects of social disadvantage on health and aging, such as shortening of telomeres (5–7) and differential methylation of DNA (8–10). In this issue of the Journal, Levine et al. (11) focus their attention on another aspect of gene regulation: gene expression. They report preliminary evidence that older adults with lower socioeconomic status (SES) exhibit a pattern of increased inflammatory and decreased antiviral peripheral-blood gene expression known as “conserved transcriptional response to adversity” (CTRA). Their study highlights challenges and opportunities for the integration of genomics into social epidemiology.
The CTRA is characterized by differential expression of genes related to immune function (12). Broadly, these changes have been interpreted to indicate a down-regulation of antiviral defenses and a corresponding up-regulation of proinflammatory wound-healing and antibacterial defenses (13, 14). CTRA-related changes, in theory, facilitate an adaptive response to acute threats that could result in injury. However, chronic CTRA activation could leave an organism vulnerable to viral infection and subject to collateral damage from systemic inflammation, ultimately leading to disease, disability, and death.
Levine et al. used microarrays to measure genome-wide gene expression in peripheral blood RNA from 120 participants in the US Health and Retirement Study (HRS). They linked gene expression data with longitudinal data on HRS participants’ education and household income. They classified individuals as having low SES if they had lived in households meeting US Census poverty criteria in at least 1 of 5 previous HRS assessments or if they lacked a high school education and their household income fell below 200% of the poverty threshold in at least 1 of 5 previous HRS assessments. The authors used their sociogenomic database to test the hypothesis that older adults with low SES would exhibit stronger CTRA-related patterns of gene expression as compared with higher-SES adults.
The authors conducted 2 analyses. In one analysis, they used a traditional, hypothesis-driven approach to test whether low SES was associated with expression levels of 53 CTRA candidate genes. The authors composed a summary score of CTRA gene expression by applying weightings to array-measured expression levels of the candidate genes. Weights of −1 were applied to genes for which the CTRA model hypothesizes down-regulation in response to stress (antiviral-response genes) and weights of +1 were applied to genes for which the CTRA model hypothesizes up-regulation in response to stress (proinflammatory genes). Weighted gene expression values were then summed to form a CTRA gene expression score for each study participant. Analysis showed that HRS participants with low SES had higher CTRA expression scores than did HRS participants with higher SES.
In their second analysis, the authors conducted hypothesis-free genome-wide screening of the roughly 35,000 transcripts assayed for association with SES. They selected transcripts that exhibited an at least 1.2-fold difference in expression between the low-SES group and those with higher SES. Differentially expressed transcripts were localized to genes, although this procedure was not described. The analysis identified 141 unique genes as being differentially expressed in the low- versus higher-SES groups, a little less than 1% of the genes assayed. The authors then conducted bioinformatic analysis of the identified genes. They scanned the DNA sequences “upstream” of transcription start sites (called “promoters”) for putative binding sites of 4 CTRA-related transcription factors. Transcription factors are proteins that bind to a DNA sequence and increase or decrease expression of nearby genes (although regulation of more distal genes has also been observed (15)). The goal of transcription factor analysis was to identify genomic mechanisms that might mediate SES effects on gene expression. The set of 141 SES-associated genes was enriched for binding sites of 3 of the 4 transcription factors implicated in CTRA gene regulation, including the proinflammatory nuclear factor κ-light-chain-enhancer of activated B cells (NF-κB) family of transcription factors. NF-κB is directly regulated by the glucocorticoid receptor (16), a key regulator of the biological response to stress. Thus, findings suggest that stress-induced proinflammatory gene expression is one mechanism linking socioeconomic disadvantage with increased inflammation and related morbidity.
The work of Levine et al. complements a growing literature identifying dysregulated immune function as a key mediator of social gradients in health. Social disadvantage in childhood and adulthood predict increased systemic inflammation and chronic infection (17–20). The present study by Levine et al., combined with others (21, 22), suggests differential regulation of gene expression as a potential mechanism in this social gradient. These human studies are bolstered by experiments conducted with nonhuman primates (23, 24). Researchers, including one of the authors of this commentary, have conducted experiments to manipulate dominance rank, the primate equivalent of social status, in captive rhesus monkeys through a procedure of staged introduction to new social groups. This design rules out confounding by genetic or personal history factors. It also allows for tests of reversibility. Similar to the low-SES HRS participants studied by Levine et al., the low-status monkeys showed increased proinflammatory gene expression and evidence for a mediating role of NF-κB. Critically, just as experimental assignment to low social status induced these changes, experimental reassignment to higher social status reversed them, suggesting potential for interventions with humans to reverse biological embedding of social stress. One difference between human studies and the primate experiments is that the primate experiments identified large fractions of the genome as being differentially expressed across the social gradient (15%–20% compared with the ~1% reported by Levine et al.). This finding suggests that larger-scale human studies or, possibly, studies of natural experiments (25) could uncover further genomic correlates of social-stress exposure.
The sociogenomic analysis by Levine et al. highlights both the promise and the challenges facing the field. Below, we briefly discuss 5 issues related to the integration of genomics in general and the CTRA specifically into social epidemiology.
Drinking from the firehose with a straw
A challenge in analysis of genomic data is the very large number of potential hypotheses. Genomic assays typically comprise tens or even hundreds of thousands of measurements. An established approach is hypothesis-free discovery analysis, essentially testing of all hypotheses possible and correcting for the number of tests (26). The study by Levine et al. illustrates an alternative approach. The authors formulated hypotheses and then used a genomic database to conduct targeted tests. In their analysis of CTRA expression score (what Levine et al. call contrast score), they drew measurements of 53 candidate genes identified in previous studies from their whole-genome database and used expression levels to compose a single measure for use in a single hypothesis test. This method mirrors “genetic risk score” analysis commonly applied to DNA sequence data (27). In their bioinformatics analysis, the authors integrated hypothesis-driven and hypothesis-free approaches, first conducting a hypothesis-free search and then feeding results from that search into a hypothesis-driven analysis of a targeted set of transcription factors. Both methods provide models for future studies of whole-genome databases that lack adequate power for hypothesis-free discovery research.
Cell type heterogeneity: Confounder? Phenotype?
Levine et al. analyzed gene expression levels in samples of whole blood, which included a mixture of different cell types (e.g., T cells, neutrophils, and others). Cell type heterogeneity between samples can confound genomic analysis because different cell types have characteristic methylation and expression profiles (28, 29). To isolate effects on expression within cell types, researchers often include statistical covariates for counts or proportions of different cell types within samples. Levine et al. lacked information on cell counts or proportions. Instead, following established practice (30), they used their expression data to impute this information based on reference data sets. They then included these imputed values as covariates in their analysis. This approach attempts to estimate average differences in expression across the social gradient as if cell type composition of blood samples is the same for everyone.
Cell type heterogeneity is also a plausible phenotype for analysis of biological embedding of social stress. For example, in the primate studies (23, 24), experimentally induced changes in social status caused not only within-cell-population changes in gene expression but also changes in relative proportions of several cell populations. One implication is that SES-related differences in the relative proportions of different cell types could be a phenotype of biological embedding. More consideration of this possibility is warranted. Levine et al. helpfully conducted their analysis with and without statistical adjustment for imputed cell type composition in the blood samples. Estimates were similar in both analyses.
Future genomic analyses of biological embedding must carefully address the issue of cell composition in samples. Ignoring heterogeneity of cell types across samples risks mistaking genomic phenotypes that reflect differences in cell populations for something else. On the other hand, applying methods to exclude cell type heterogeneity across blood samples without first examining it risks throwing out the proverbial baby with the bathwater.
Chronic disease status: a collider in studies of biological embedding of social stress?
Collider bias arises when statistical adjustment is made for a variable that is caused by both exposure and outcome (see Greenland et al. (31)). In the study by Levine et al., social disadvantage (the exposure) and the CTRA (the outcome) are conceptualized as causes of chronic disease. This conceptual model produces the causal diagram E→C←D, where E is the exposure (here, low SES), C is the collider (here, chronic disease), and D is the outcome (here, the CTRA). In this model, the association between SES and the CTRA will be biased by adjustment for chronic disease status (32). As with cell type heterogeneity, Levine et al. helpfully report results with and without adjustment for chronic disease status. Similarity between the 2 sets of results indicates that bias is small in this case. Future studies should nevertheless consider this issue when selecting covariates in models testing associations between social exposures and genomic phenotypes.
Translating the CTRA into a tool for epidemiology
The gene expression CTRA score reported in the article by Levine et al. represents a potentially reproducible index of the CTRA that can be implemented in other studies. The 53-gene set that composes the score aims to track the multiple changes in immune function that are observed in socially stressed humans and animals and which, in turn, predict increased risk for infection and chronic disease (33). As a multimarker measurement of a multidimensional, cumulative process of biological embedding, the CTRA is substantially aligned with allostatic load (34) and the emerging literature on so-called biological aging (35–37). Studies are needed that can compare these different approaches to quantifying biological embedding. Among the questions to answer are: “Do the different measures measure the same thing?” “Can they be quantified with comparable acuity at different stages in the life course?” and “To what extent can they be modified by either social or clinical intervention?”
Charting a course for sociogenomic epidemiology
Looking beyond the CTRA, large-scale discovery studies are needed to better evaluate the range of genes and gene networks that are differentially regulated across social gradients. In designing these studies, investigators will face tradeoffs between more precise operationalizations of social exposures and sample size. An example from genome-wide association studies may be instructive. In genome-wide association studies to identify genetic correlates of individual differences in human behavior, favoring sample size has paid off. Genome-wide association studies of educational attainment, a very coarse behavioral phenotype, conducted in very large samples has generated replicable discoveries that predict a wide range of human characteristics and outcomes, ranging from early cognitive development to intergenerational socioeconomic mobility (38–40). In parallel, because non-DNA sequence elements of the genome can be changed by the social environment, there may also be a role for carefully conducted experiments. Studies that randomly assign social exposures and conduct repeated assays of genomic phenotypes before and after may offer a complementary path to discovery.
ACKNOWLEDGMENTS
Author affiliations: Department of Population Health Sciences, Department of Medicine, School of Medicine, Duke University, Durham, North Carolina (Daniel W. Belsky); Division of Geriatrics, Department of Medicine, School of Medicine, Duke University, Durham, North Carolina (Daniel W. Belsky); Center for the Study of Aging and Human Development, Duke University, Durham, North Carolina (Daniel W. Belsky, Noah Snyder-Mackler); Duke Population Research Institute, Durham, North Carolina (Daniel W. Belsky); and Department of Evolutionary Anthropology, Duke University, Durham, North Carolina (Noah Snyder-Mackler).
D.W.B. acknowledges support from the National Institute on Aging (grants R01AG032282, P30AG028716, and P30AG034424) and from the Jacobs Foundation. N.S.-M. is supported by the National Institute on Aging (grant K99AG051764).
Conflict of interest: none declared.
REFERENCES
Author notes
Abbreviations: CTRA, conserved transcriptional response to adversity; HRS, Health and Retirement Study; NF-κB, nuclear factor κ-light-chain-enhancer of activated B cells; SES, socioeconomic status.