-
PDF
- Split View
-
Views
-
Cite
Cite
Yanfeng Zhang, Xinrui Li, Andrew Gibson, Jeffrey Edberg, Robert P Kimberly, Devin M Absher, Skewed allelic expression on X chromosome associated with aberrant expression of XIST on systemic lupus erythematosus lymphocytes, Human Molecular Genetics, Volume 29, Issue 15, 1 August 2020, Pages 2523–2534, https://doi.org/10.1093/hmg/ddaa131
- Share Icon Share
Abstract
A common feature of autoimmune diseases, including systemic lupus erythematosus (SLE), is an increased prevalence in women. However, the molecular basis for sex disparity in SLE remains poorly understood. To examine the role of X-linked transcription in SLE adaptive immune cells, we performed RNA-seq in T cell and B cell subsets from either healthy donors or patients with SLE. Analyses of allelic expression (AE) profiles identified a pattern of increased allelic imbalance across the entire X chromosome in SLE lymphocytes. X-linked genes exhibiting AE in SLE had an extensive overlap with genes known to escape X chromosome inactivation (XCI). XIST RNA was overexpressed in SLE patients. Differential XIST expression correlated with AE profiles more positively at X-linked genes than the genome-wide background. Analysis of three independent RNA-seq data verified the XIST-associated skewed AE on X chromosome in SLE. Integrative analyses of DNA methylation profiles showed an increased variability of DNA methylation levels at these AE-related X-linked genes. In cultured lymphoblastic cells, knockdown of XIST specifically altered allelic imbalance patterns between X chromosomes. Our study provides genetic evidence that upregulation of XIST accompanied with more skewed allelic expression on X chromosome is associated with the pathogenesis of SLE and may provide mechanistic insights into the increased incidence of SLE in females.
Introduction
X chromosome inactivation (XCI), occurring specifically in female mammals, represents a sex-specific phenomenon that provides equalized gene expression between the sexes and dosage compensation. One of the X chromosomes is randomly silenced in normal female cells through a process involving long noncoding RNA (lncRNA), nucleic acid binding factors and chromatin regulators (1,2). Once silenced, most genes on the inactive X chromosome (Xi) remain silent through subsequent somatic cell divisions, with few exceptions at genes that can escape XCI (eXCI) to some degree in somatic cells.
Due to the random choice of Xi in each cell, females have varying XCI ratios, defined as the proportions of cells expressing alleles from the maternal or the paternal X chromosome. In the general female population, XCI ratios have a normal distribution, with an average of ∼50:50. Skewed XCI is a bias of X-inactivation ratios towards cells overwhelmingly expressing one allele from two X chromosomes, e.g. a skewed ratio of >80:20 (3). Skewed XCI can occur in both normal females and those with X-linked diseases and can result from either a primary nonrandom inactivation occurring early in development or a secondary, acquired nonrandom inactivation in somatic tissues (4). More recently, skewed XCI has been reported in association with several autoimmune disorders (5,6) and cancers (7,8). Thanks to high-throughput sequencing techniques, determining skewed XCI is more readily accessible in various ways. For example, the presence of a heterozygous polymorphism on the two X chromosomes enables allelic expression (AE) mapping from deep RNA-sequencing (RNA-seq) data and has been a direct approach to examine allelic imbalance (AI) and XCI (9,10). Studies focusing on the epigenome have also uncovered the differences of DNA methylation and chromatin modification profiles on active and inactive X chromosomes (11,12).
X-inactive-specific transcript (XIST), a lncRNA that enables most, if not all, X-linked gene silencing by recruiting multiple proteins to the Xi (13), plays an essential role in the initiation and maintenance of XCI in female cells. Due to its role in XCI, XIST RNA is also one of the genes with a strong sex-specific expression pattern (14). This makes XIST lncRNA itself a strong candidate gene for sex-biased diseases.
Systemic lupus erythematosus (SLE) is an autoimmune disease characterized by the dysfunction of immune cells and the production of pathogenic autoantibodies. Like many other autoimmune disorders, there is a strong female bias in the incidence of SLE (15). However, the molecular basis for the sex disparity in SLE remains poorly understood, although both hormonal and genetic factors have been proposed (16–20). Recently, the role of XIST lncRNA itself in pathogenesis and sex bias of autoimmune disorders was explored. For example, using cellular imaging, Wang et al. reported different XIST RNA localization patterns but no alteration of XIST RNA levels, in B cells from pediatric patients with SLE (21). Syrett et al. revealed that dynamic maintenance of XCI is dependent on XIST RNA localization during murine B cell development (22). To pursue the role of XIST and XCI in the sex disparity observed in autoimmune diseases, we have conducted deep RNA-seq in lymphocytes and performed AE analyses in both CD4+ T and CD19+ B cells isolated from female adult SLE patients and healthy controls. Seventy-two RNA-seq datasets from three independent studies in SLE patients were included for validation. Our study reveals an overwhelming allele-specific expression pattern on X chromosome on SLE lymphocytes, which is positively correlated with the upregulation of XIST RNA in SLE lymphocytes. Functional studies further demonstrate that aberrant expression of XIST is responsible for the skewed AE between two X chromosomes in cultured lymphoblastic cells.
Results
Isolation and RNA-sequencing of T cell subsets
In the present study, 17 female participants, including five SLE patients and 12 controls, were recruited for the isolation of T cell subtypes. All participants are adult with the average age of ~30 years (Supplementary Material, Table S1). In total, 66 cell populations across five types of CD4+ T cells were collected from the 17 individuals (Fig. 1A).

Allelic expression profiles on X chromosome in T cell subtypes from SLE patients. (A) Summary of 66 T cell samples collected from five SLE patients and 12 healthy controls (CNTL). 1 and 0 denote the collected or not collected T cell subtypes from participants, respectively. (B) Distribution of ARs for all heterozygotes identified on SLE and control T cells. Data on autosomes and X chromosome were plotted separately. (C) Plot of the proportion of SLE-AE sites on all chromosomes. The proportion (y-axis) was calculated by counting the number of SLE-AE SNPs divided by all heterozygous SNPs counted on each chromosome. (D) Dot plot showing the ARs of SLE-AE sites on the X chromosome for all informative T cells. (E) Plot of the adjusted ARs (left panel) and the allelic fold changes (right panel) across all T cell subsets in a comparison between control and SLE groups.
We performed RNA-sequencing on these cells. On average, nearly 130 million raw reads with ~90% mapping rate to reference human genome were obtained for each sample (Supplementary Material, Table S2). Such ultra-deep coverage data is favorable for AE analysis.
Allelic expression signals on the X chromosome across T cells
In a search for heterozygous loci that would be useful for measuring allelic expression levels, we identified 48 760 heterozygous single nucleotide polymorphisms (SNPs) occurring at least once in both case and control T cells (see Materials and Methods). Using two alignment strategies, the sequence reads were mapped to either the standard reference genome or the modified reference genome (see Materials and Methods); our results showed that the distribution of allelic ratios (ARs) for those heterozygotes was close to 0.5 for both mapping strategies (Supplementary Material, Fig. S1), indicating that there was no significant bias of variant versus reference alleles for either alignment strategy. It is important to note that the number of reads carrying the reference allele in the actual dataset may not necessarily be 50% (23). The presence of inherent biases, such as allele-specific loci due to uniparental imprinting, can tilt the numbers (23,24).
We then compared the global distribution of ARs at 48 760 heterozygous SNPs between SLE and controls. We found that there was an SLE-specific distribution of allelic ratios on the X chromosome but no significant difference on autosomes, in a comparison between the two groups (Fig. 1B). The result also showed a greater skewing pattern towards the two allelic ratio extremes (close to 0 or 1) for X-linked loci in female lupus T cells, relative to the pattern in controls. These results indicate that lupus T cells may exist as skewed XCI.
Next, based on those heterozygous SNPs (n = 48 760) shared between cases and controls, we used a beta-binomial Bayesian model to identify loci with allelic bias in gene expression (see Materials and Methods). We defined heterozygous SNPs showing statistically significant (P < 0.05) allelic RNA expression imbalance in at least one type of T cell within SLE patients but not significant across any T cell subtypes among controls, as SLE-associated AE sites (SLE-AEs). On the basis of this definition, we identified 1718 SLE-AE sites across the genome. By calculating the proportion distributed on chromosomes for these SLE-AEs shown in Figure 1C, we found that there was a higher proportion of SLE-AEs (n = 144) on the X chromosome, relative to the proportions on other chromosomes. Additionally, we ruled out the possibility that overrepresentation of SLE-AE signals on the X chromosome was due to the biased distribution of 48 760 SNPs analyzed in this study. Together, these results suggest a nonrandom excess of SLE-AE signals on the X chromosome in female lupus patients.
To address whether SLE-AE signals are distributed regionally or chromosome widely, we plotted the X chromosome wide view of 144 SLE-AE signals (Fig. 1D). The result showed that these X-linked SLE-AE signals were broadly distributed on the entire X chromosome. We then used two metrics (see Materials and Methods) to quantify the AE difference between case and control T cells. As shown in Figure 1E, a 2- to 3-fold increasing on average of AE signals in all 5 T cell subtypes was detected for 144 SLE-AEs in patients, compared with control counterparts. In addition, we did not find a remarkable difference (more than 1.5-fold change) of AE profiles among T cell subtypes within SLE patients. Based on these results, we infer that changes in the skewed AE pattern on the X chromosome may exist prior to T cell lineage commitment in females with lupus.
Positive correlation between gene-based allelic RNA expression and XIST expression levels in T cells
By comparing allelic fold change between two sets of X-linked genes in normal T cells (see Materials and Methods), we evaluated the effectiveness of the gene-based allelic analysis. As expected, we observed that the set of genes with skewed XCI had significantly higher allelic expression levels than the other set of genes with eXCI (P = 1.2 × 10−7, two-sided Wilcoxon rank sum test, Supplementary Material, Fig. S2A). We also presented the XCI status for five genes with known skewed XCI and three genes with known eXCI (Supplementary Material, Fig. S2B). Interestingly, these five genes showed high variability of XCI skewing in normal T cells, which is concordant with reports in other studies (25,26). Collectively, the method of gene-based allelic analysis is effective to determine the allelic difference of X-linked genes.
Next, we performed the gene-based allelic analysis for 144 SLE-AE sites that map onto 88 genes. Concordant with SNP-based allelic analysis, these 88 X-linked genes had a dramatic increase in allelic RNA expression (Supplementary Material, Fig. S3A and B) but didn’t remarkably change the overall expression levels (Supplementary Material, Fig. S4), in T cells from SLE patients, relative to controls.
Interestingly, of these 88 genes, 64 (72.7%) were enriched (shown in blue, Fig. 2C) and had a significant overlap (P = 2.4 × 10−5, two-sided binomial test) with genes showing escape (or variable escape) from XCI reported by previous studies in normal human tissues and cells, including lymphocytes (25). For example, two genes (MSL3 and ZFX, Fig. 2C) known to eXCI previously identified in lymphoblastoid cells (27) also showed escapee in T cell subsets from healthy individuals.

Association of allelic expression profiles on X chromosome with XIST expression levels across T cell subtypes. (A) Comparison of XIST expression levels in T cell subtypes from SLE patients and controls. (B) Distribution of PCCs between gene-based allelic fold changes and XIST expression levels for the observation and the background. (C) Barplot of the Pearson correlations between gene-based allelic fold changes and XIST expression levels for those X-linked genes harboring SLE-AE signals. The resulting PCCs are also seen as the observation that is used below. Genes blue highlighted are the previously reported genes showing eXCI.
Gene expression analysis identified a sparse but significant upregulation of XIST RNA in SLE T cell populations (P = 0.01, Student’s t-test, Fig. 2A). When stratified by T cell subsets, the XIST RNA was mainly upregulated in activated T cell subsets from SLE patients, relative to the corresponding T cells from controls. By calculating the Pearson’s correlation coefficients (PCCs) between XIST expression levels and gene-wise allelic fold changes for these 88 genes across all T cell subsets, we detected that allelic fold changes on these 88 genes show much higher, although apparently weak (Fig. 2C), positive correlation with the XIST expression levels (P = 0.02, two-sided Wilcoxon rank sum test), relative to the background sampling in same bin size (Fig. 2B). Taken together, our results suggest that elevated expression of XIST is associated with X-linked genes showing SLE-AE profiles among T cell subsets.
RNA-sequencing of naïve B cells
To further determine whether the XCI defect is present in other lymphocytes from patients with SLE, seven female donors (five SLE patients and two healthy controls) were recruited and their naïve B cells were sorted. By RNA-sequencing (RNA-seq) on these cells, we also obtained sequencing data with deep coverage. On average, approximately 73 million raw reads with ~90% mapping rate were acquired for these naïve B cells (Supplementary Material, Table S2).
Allelic expression signals in B lymphocytes
Using the same approaches shown above, we conducted single SNP-based AE analysis in B cells. Analysis of the chromosomal distribution of heterozygous SNPs with allelic ratios showed that the X chromosome, but not autosomes, harbors much denser AE in naïve B cells from SLE patients, compared with controls (Fig. 3A), a pattern that was also seen in lupus T cells.

Allelic expression profiles on SLE naïve B cells. (A) Distribution of ARs for heterozygotes identified on autosomes and X chromosome in Naïve B cells from patients with SLE and controls. (B) Barplot showing the proportional distribution of SLE-AE sites on chromosomes. (C) Dotplot showing ARs for SLE-AE signals on the whole X chromosome. (D and E) Violin plot showing the adjusted ARs and the allelic fold change in a comparison between controls and SLE B naïve cells.
Of 17 248 heterozygous SNPs analyzed in this study, we identified 933 SLE-AE sites in naïve B cells from lupus patients. We found a substantial excess of these SLE-AE signals on the X chromosome (Fig. 3B). These X-linked SLE-AE signals (n = 51) were scattered across the X chromosome (Fig. 3C). Based on the same quantitative analyses, our results confirmed a significant difference with 2- to 3-fold increase in SLE B cells for these X-linked SLE-AE signals, relative to controls (Fig. 3D and E). These results are consistent with findings in the T cell data.
Allelic expression patterns are positively correlated with the XIST expression levels in B cells
By comparing the RNA levels of XIST between SLE and control B cells, we observed higher expression of XIST in four of five SLE B cell samples, relative to controls (Fig. 4A). RT-qPCR analysis in four additional naïve B cells further confirmed the elevated expression of XIST RNA in SLE B cells (P = 0.016, Student’s t-test, Fig. 4B).

Association of allelic expression profiles on X chromosome with XIST expression levels across B cell subtypes. (A) Comparison of XIST expression levels (represented with FPKM values) in naïve B cells from SLE and controls. Arrow points to a SLE sample with the lowest XIST expression level. (B) Quantitative RT-PCR assay shows an upregulation of XIST on SLE naïve B cells. Error bars indicate the standard deviations from three replicates. (C) Heatmap showing gene-based AE profiles for seven B cell samples, where five patients with SLE were grouped and shown in rectangle. Genes overlapped with the genes detected in T cell subtypes (Fig. 2C) are highlighted in blue. The PCCs between gene-based allelic fold changes and XIST expression levels were shown on the top in a range of −1 (negative correlation) to 1 (positive correlation). (D) Distribution of PCCs between gene-based allelic fold changes and XIST expression levels for the observation and the background.
Gene-wise AE analysis aggregating heterozygous SNPs at 36 genes showing SLE-AE signals confirmed a tremendous difference of AE profiles between SLE and controls (Fig. 4C). It should be noted that the AE patterns were variable within patient B cells, particularly for the subject with SLE, which was the SLE patient with the lowest XIST expression levels (arrow shown in Fig. 4A). Similar to T cells, we did not observe large total expression changes (greater than 2-fold change) at these 36 genes in lupus B cells (Supplementary Material, Fig. S5), except for XIST RNA itself. Thus, the allelic ratio changes we observe are not due to elevated overall expression.
Consistent with findings in the study of T cells, we observed a considerably higher positive correlation (P = 1.4 × 10−7, two-sided Kolmogorov–Smirnov test) between XIST RNA levels and allelic fold changes for 36 genes, relative to genome-wide background (Fig. 4D).
To further determine whether these genes showing SLE-AE signals observed in naïve B cells could be related to B cell maturation, we compared the AE profiles of 36 genes between naïve and transitional B cells within patients of SLE. We observed a slight difference of AE signals between naïve and transitional B cells (Supplementary Material, Fig. S6A and B). Meanwhile, the AE pattern also showed a greater positive correlation with the XIST RNA levels, relative to genome-wide background (P = 0.004, two-sided Kolmogorov–Smirnov test, Supplementary Material, Fig. S6C–E). Similarly, among these 36 genes, no individual showed differential overall expression between B cell subsets (Supplementary Material, Fig. S6F). Taken together, our data indicate that XIST expression is directly related to AE on the X chromosome and is a common feature across lymphocyte populations.
Validation and refinement of results from independent RNA-seq data
To further validate the XIST-associated increase of AE on the X chromosome in lupus lymphocytes, we collected and analyzed a series of 83 RNA-seq data from lupus B or T cell types in three independent studies (Supplementary Material, Table S3), where 11 RNA-seq data were excluded due to the presence of genome-wide AE pattern with unknown reason (see an example in Supplementary Material, Fig. S7), resulting in 72 RNA-seq data for AE analysis.
Analyzing 56 published RNA-seq data on two independent cohorts of SLE B cells (n = 38 for one study with ID, GSE118254; n = 18 for the other study with ID, GSE110999, Supplementary Material, Table S3), we observed a significant enhanced expression of XIST in lupus B cells, compared with healthy controls (Supplementary Material, Figs S8A and S9A). In line with our RNA-seq study, we found that there was a trend showing the gradual increasing of XIST RNA levels in B cell differentiation from transitional to naive stage within lupus patients (Supplementary Material, Fig. S8B). Based on AE profiling analyses from six SLE patients across three B cell subsets and three SLE patients across two B cell subtypes, we discovered that there were present of two different AE patterns on X chromosomes during lupus B cell maturation. One pattern was increased AE between two X chromosomes in association with XIST RNA levels (Supplementary Material, Figs S8C and S9B), which was consistent with the results from our RNA-seq data. The other pattern appeared to be XIST expression-independent AE on the X chromosome (Supplementary Material, Figs S8D and S9C).
We further analyzed 16 RNA-seq data in pediatric SLE from 2 T cell types (accession ID, GSE109843, Supplementary Material, Table S3) and observed that XIST expression levels were significantly increased in follicular helper T (Tfh) cells, relative to memory T cells (Tmem) in 8 SLE patients (Supplementary Material, Fig. S10A). Comparative analysis of AE patterns on X chromosome indicated that there was a significant increase of X-linked AE signals in Tfh cells, relative to the pattern in Tmem cells (Supplementary Material, Fig. S10B). In contrast, there was no difference of AE profiles on autosomes between those two T cell populations. We also observed a significantly higher positive correlation (P = 8.9 × 10−8, two-sided Kolmogorov–Smirnov test) between XIST RNA levels and allelic fold changes for X-linked genes, relative to genome-wide background (Supplementary Material, Fig. S10C). Taken together, these results confirm the observation of increased AE on X chromosome in lymphocyte differentiation for lupus patients.
Reduced allelic imbalance on two X chromosomes when blocking XIST function
To further demonstrate the causal relationship between XIST expression and allelic imbalance of X-linked genes, we performed experiments to modulate levels of XIST RNA. We inhibited XIST activity in REH lymphocytic cells via knockdown of XIST expression by short hairpin (sh) RNA. We confirmed that the RNA level of XIST was greatly reduced in shRNA-XIST cells by both RNA-seq (Fig. 5A) and qPCR analyses (Fig. 5B), relative to cells transfected with a scrambled shRNA.

Reduced allelic expression pattern when blocking the XIST function in REH cells. (A) Comparison of expression levels of XIST between scrambled shRNA and shRNA-XIST REH lymphoblastic cells. Two tracks are the read coverage in 20-bp bins across the XIST and its adjacent TSIX locus. (B) Quantitative RT-PCR assay quantifies the expression levels of XIST in shRNA-XIST cells, relative to control cells. (C) Empirical cumulative distribution of allelic fold change between scrambled shRNA and shRNA-XIST for heterozygous SNPs mapped onto autosomes and X chromosome.
A comparative analysis showed that there was a remarkable decline of allelic fold changes (P = 0.004, two-sided Wilcoxon rank sum test) for these X-linked polymorphisms when XIST lncRNA expression was inhibited (Fig. 5C). In contrast, we did not observe a significant change of AE on autosomes when XIST expression was blocked (Fig. 5C). Collectively, our data provide evidence that aberrant expression of XIST could specifically alter the AE pattern between X chromosomes.
Greater variability of DNA methylation levels on X-linked genes exhibiting AE signals in lupus patients
Finally, by integrating the independent dataset from four case-control association studies (n = 287, including 148 cases and 139 controls, Supplementary Material, Table S4) of the DNA methylation profiling in patients with lupus, we explored whether DNA methylation levels were associated with XIST-induced AE patterns on Xs. For those AE-harboring X-linked genes detected in patients with lupus, we did not find a remarkable difference of DNA methylation levels between cases and healthy controls in all four datasets (data not shown). The result is in agreement with the findings that the overall expression levels for those genes did not differ between lupus and control lymphocytes in this study.
However, under disease states like SLE, in which the XCI fidelity has been compromised, X-linked genes may have greater variation in DNA methylation levels among patients. We tested this hypothesis by comparing the variability of DNA methylation levels between SLE and controls by comparing the coefficient of variation (CV) of DNA methylation levels on those X-linked genes exhibiting AE signals detected in lupus T and B cells. We observed increased CVs in SLE patients compared with controls in independent datasets, where DNA methylation profiles were measured in CD4+ T cells and CD19+ B cells (Fig. 6A and B). However, we did not observe increased variability of DNA methylation levels in another two datasets (Supplementary Material, Fig. S11) in which myeloid cells (monocytes and neutrophils) were analyzed. These results together suggest a skewing towards greater variability of DNA methylation levels on those SLE-associated AE-harboring X-linked genes in SLE lymphoid but not myeloid cells, compared to the controls.

Variability of DNA methylation levels on promoters of X-linked genes exhibiting AE. (A and B) Comparison of the coefficient of variation (CV) of DNA methylation levels on promoters of SLE-AE X-linked genes discovered in T cell subsets (A) and naïve B cells (B) between patients with SLE and controls from two independent studies (head shown).
Discussion
Allelic expression analysis is a powerful approach to identify cis-regulatory variations and skewed XCI patterns on the X chromosome. By analyzing AE patterns from deep RNA-sequencing data, we have observed allele-specific expression patterns in lupus T and B cells. We demonstrate that aberrant expression of XIST RNA is associated with and likely drives the observed AE pattern. Given the crucial role of XIST in XCI, and the concentration of lupus-specific AE on the X chromosome, our results likely indicate a dysfunctional XCI process in lupus lymphocytes. Our results are consistent with previous findings in mouse models that demonstrate that altered expression levels of XIST RNA can trigger X-linked gene silencing (28). There have also been studies indicating skewed XCI in several autoimmune disorders, including autoimmune thyroid diseases (6) and scleroderma (5). In this regard, these data contribute to a growing body of literature supporting the importance of altered XCI feature in the pathogenesis of many autoimmune diseases. Given that XCI is limited to female cells, these finding may explain the female predisposition to autoimmune diseases, including SLE, if a defective XCI process increases the propensity to generate autoreactive lymphocytes or reduces the ability of patients to suppress or eliminate autoreactive lymphocytes.
Since not all of X-linked genes showed an AE pattern in SLE lymphocytes, it indicates that skewed XCI is unlikely to have occurred early in embryogenesis, when X-inactivation patterns are first established. Furthermore, the observation that AE was detected across all T cell subsets within SLE, including in naïve T cells, suggests that XCI defects arise at an early hematopoietic stage, potentially through the aberrant expression of XIST.
Two X-linked regions, spanning the PRPS2-TLR7 and IRAK1-MECP2 genes, have been reported to be associated with SLE (29,30). Our result indicated that these two risk loci escaped the XCI in healthy controls but are subjected to XCI in SLE patients. Another interesting X-linked gene, CCDC22, showing a strong skewed XCI in SLE lymphocytes in the present study, is associated with SLE susceptibility (31). The CCDC22 protein functions in the regulation of NF-κB signaling (32), a well-known regulator being aberrantly activated in lupus disease (33,34). Therefore, X-linked SLE-related genes may serve as a selective target in patients as a failure to maintain XCI fidelity that could create a population of dysfunctional lymphocytes carrying aberrant expression of specific risk alleles at these genes.
In the study of Wang et al., they reported that altered distribution of XIST RNA patterns, rather than expression levels, could contribute to silenced X chromosome reactivation in pediatric SLE B cells (21). On the one hand, we observed a comparable result in a SLE patient where reduced AE pattern on X chromosome but no change of XIST expression was detected in memory B cells, relative to naïve B cells (Supplementary Material, Fig. S9C). Thus, our analyses partially support their findings. On the other hand, distinct from their findings, we discovered skewed AE associated with aberrant expression of XIST in lupus patients. Some studies have suggested a pronounced difference of clinical manifestations between pediatric SLE and adult SLE (35–37). For example, childhood-onset SLE has a more severe clinical course than that seen in adults, with a higher prevalence of lupus nephritis, hematologic and immunologic anomalies. Meanwhile, the gender bias towards female is commonly observed in adults with SLE, while the gender ratio shows even in childhood-onset SLE (36). This difference may provide a possible explanation of the difference of our analysis of adult-onset SLE with Wang et al. findings in pediatric SLE.
Yildirim et al. reported that XIST RNA is a potential tumor suppressor of hematologic cancer in mouse model (38). In contrast, Yao et al. found that XIST expression was upregulated in human glioblastoma stem cells and its downregulation suppressed tumor growth (39). These results suggest that XIST not only plays a role in the XCI but also has roles in cell growth and proliferation. Engreitz et al. proposed a model that XIST RNA coats the X chromosome via chromatin conformation interaction (40). This means that aberrant expression of XIST might also alter the DNA topological architecture on the X chromosome in SLE lymphocytes. Thus, we speculated that the up-expression of XIST might have some additional influences to SLE patient cells and may alter the proliferative potential of clonal populations that carry aberrant XIST levels.
One caveat to our study is that expression levels of XIST RNA within female lupus lymphocytes and skewed AE profiles were highly variable. This result implies that some additional factors linked with SLE pathogenesis and gender bias are present. However, the statistical power in the present study is insufficient to link the AE with SLE clinical features. Indeed, besides genetic components, several environmental factors, including infections, endogenous and exogenous hormones (19,20,41), have been proposed to explain the sex difference in SLE. Thus, we anticipate that AE mapping across more hematopoietic cell states and larger samples, as well as systematic analyses by integrating genetic, epigenetic and disease-linked phenotypic data, will yield new insight into the variable XCI and the potential factors (or cofactors) associated with sex bias in autoimmune diseases.
In conclusion, our study provides evidence that XIST-altered AE on X chromosome may be implicated in the pathogenesis of female patients with SLE and may also help explain the significant female predisposition to the autoimmunity.
Materials and Methods
Sorting of lymphocyte subtypes
Peripheral blood mononuclear cells (PBMC) and neutrophils were isolated from blood following fractionation by Ficoll density centrifugation. CD14+ monocytes were isolated from the PBMC using anti-CD14 coated beads (Invitrogen/Thermo Fisher). CD4+ cells were isolated using anti-CD4 coated beads, detached from the beads and stained with a panel of fluorochrome-conjugated antibodies (CD4, CD56, CD45RA, CD45RO, CD25, CD127, CXCR3, CCR4 and CCR6 from BD Pharmingen and eBioscience). Cells were then sorted by FACS analysis to obtain CD56− populations representing naïve (CD45RO−, CD45RA+), memory (CD45RO+, CD45RA−) and Treg (CD25+, CD127−) T cells. Memory T cells were further sorted to obtain Th1 (CXCR3+, CCR4−, CCR6−), Th1Th17 (CXCR3+, CCR4−, CCR6+) and Th17 (CXCR3−, CCR4+, CCR6+) subsets.
CD19+ B cells were isolated from the PBMC using anti-CD19-coated beads. CD19+ cells were detached from the beads and then incubated with anti-CD27-coated beads to separate memory and plasma cells (CD19+, CD27+) from immature, transitional and naïve B cells (CD19+, CD27−). Following detachment from the beads, memory and plasma cells were then stained with fluorochrome-conjugated anti-CD19, anti-CD27 and anti-CD24 and sorted to obtain memory cells (CD19+, CD27+, CD24+) and plasma cells (CD19+, CD27+, CD24−). The mixed population of immature, transitional and naïve cells was further stained with anti-IgD and sorted to isolate immature cells (CD19+, CD27−, IgD−) from the transitional and naïve cells (CD19+, CD27−, IgD+). This latter mixed population was then stained with anti-CD38 and anti-CD24 and sorted to separate transitional cells (CD19+, CD27−, IgD+, CD38hi, CD24hi) from naïve cells (CD19+, CD27−, IgD+, CD38lo, CD24lo). The purity of the cell subsets was ascertained by FACS analysis of a portion of the sorted cells to determine the percentage that fell within the gate initially used to target that population, and typically purity was 91% or greater.
Plasmid constructs
pSIREN-RetroQ scrambled construct (5′-GCAAGCTGCCCGTGCCCTG-3′, as control (42)) was a gift from Dr Gilmore (Boston University, USA). Short hairpin RNAs (shRNAs) for XIST sequences were designed using the shRNA Sequence Designer (Clontech) and were subcloned into the pSIREN-RetroQ retroviral vector (Clontech) according to the manufacturer’s protocol.
Cell culture, RNA isolation and real-time RT-PCR
REH (purchased from ATCC, cat. no. CRL-8286) pre-B lymphoblastic cells were cultured in RPMI-1640 medium, supplemented with 10% FBS (Thermo Fisher Scientific, Waltham, MA, USA), 2 mM L-glutamine and 1% penicillin–streptomycin at 37°C with 5% CO2. Transfections were carried out using Lipofectamine 2000 Transfection Reagent (Thermo Fisher) according to the manufacturer’s instruction. 72 h after transfection, the cells were washed once with PBS. Then, RNA was isolated from cells using TRIzol Reagent (Invitrogen) according to the manufacturer’s protocol. 1 μg total RNA was reverse transcribed using SuperScript III (Thermo Fisher Scientific) and Oligo-dT primer. One-tenth of the RT reaction was used as a template for real-time PCR using PowerUp SYBR Green Master Mix (Thermo Fisher Scientific) on a QuantStudi 6 system. Relative expression was calculated with 2−ΔΔCt using the average value of housekeeping gene GAPDH. Data are presented as mean ± SD of three replicates. All qPCR primers are listed in Supplementary Material, Table S5.
RNA-seq library construction
For RNA-sequencing, indexed libraries were constructed from ~500 ng total RNA using the NEBNext Ultra RNA Library Prep Kit for Illumina with the Poly(A) mRNA Magnetic Isolation Module (NEB). We confirmed the final cDNA library size range by gel electrophoresis on a 1.7% agarose gel. Samples were barcoded during library preparation and sequenced on a HiSeq 2500 or NextSeq (Illumina) instrument as 2 × 100 or 2 × 75 bp paired-end libraries at the HudsonAlpha Institute Sequencing Facility.
RNA-seq data processing
RNA-seq data were analyzed as described elsewhere (43). Briefly, reads were demuxed based on their index sequence at the Genomic Services Laboratory at HudsonAlpha. After removal of adapter sequences, low-quality reads and trimmed reads that were shorter than 20 bp by using cutadapt (v1.3.1), filtered reads were aligned with Tophat2 with default parameters. Mapped reads in BAM format were assembled with Cufflinks (v2.2.1). We used FPKM values for representing the expression levels of genes. We retained genes with mean FPKM ≥1 for fold change analysis.
Identification of allelic expression polymorphic sites
We used a similar approach described in previous study to call genetic variants (44). Briefly, mapped reads in BAM format were first subject to the removal of PCR duplicates and then were realigned and recalibrated, and genetic variants were called in a multiple-sample joint manner implemented in the GATK toolkit (version 3.3). We next filtered out variants as follows: (1) mapping quality score <20, (2) ≥3 SNPs detected within 10 bp distance, (3) variant confidence/quality by depth <2, (4) strand bias score >50, (5) genotype score <15 and (6) read depth <10. Then, we extracted SNPs annotated from dbSNP (Build 142) that were called as heterozygotes for each sample. For a reasonable comparison, those heterozygous SNPs identified at least once in both case and control cells were retained for allelic analysis.
We also applied alternative alignment strategies described in the study of Smith et al. to identify heterozygous SNPs and their allelic ratios (9). Briefly, we built a modified hg19 reference genome by replacing the reference allele with the variant SNP allele annotated in dbSNP 142 using the GATK FastaAlternateReference tool. We remapped all reads to this modified reference genome using the same parameters as used for the standard reference genome. Then, ARs and Bayesian model-derived significance for allelic bias was calculated as above. Finally, we defined those heterozygous polymorphisms with the posterior probability for allelic ratio with P < 0.05 estimated from both the standard alignment and the modified genome alignment as AE sites.
By collapsing heterozygous SNPs mapped to Refseq genes by averaging the allelic fold changes, we calculated the gene-wise allelic fold change. We collected a dataset from Tukiainen et al. study (10), where they identified two sets of X-linked genes being subject to XCI or eXCI across various human normal tissues. By comparing gene-wise allelic fold change between two sets of genes in T cells from healthy controls, we evaluated the effectiveness of this approach.
Correlation analysis
We calculated the Pearson correlations between gene-based allelic fold changes and XIST RNA levels using PCC. The background Pearson correlations were calculated between gene-based allelic fold changes, and XIST RNA levels for all other genes and the same size were randomly selected for histogram.
Public data collection and analysis
We collected RNA-seq data (Accession ID, GSE110999, GSE118254 and GSE109843) from three independent studies in SLE patients of B and T cells (47–49). We downloaded the raw reads data from the Sequence Read Archive (SRA) database. We then used the same method described above to process the RNA-seq data and AE analysis.
The Infinium Human Methylation450 (HM450) microarray dataset used in this study was acquired from the Gene Expression Omnibus (GEO) database (50). We used the processed beta values to represent the DNA methylation levels. We first annotated CpG probes and collected those probes mapped to promoter regions (transcription start sites ±1 kb) of these X-linked genes exhibiting ASE sites. We then calculated the coefficient of variation (CV), a measurement for the relative variability, for each CpG probe within gene promoters in cases and controls for each of five independent studies. For statistical analysis, we employed one-sided Wilcoxon rank sum test to compare the difference of CV distribution between lupus and controls.
Availability of data and materials
The RNA-seq data generated during the current study are available in the GEO database under accession number [GSE152767].
Acknowledgements
We thank the members of the Genomic Services Laboratory at HudsonAlpha for providing sequence data. We thank Dr Le Su, Dr Jun Song and Dr Kenneth Day for critical reading and suggestive discussions. This study was supported by the HudsonAlpha institutional funds. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.
Conflict of Interest statement The authors declare that they have no competing interests.
Authors’ contributions
Y.Z. conceived and designed the allelic expression study, performed the experiments, collected and analyzed the data, prepared figures and wrote and revised the manuscript. X.L., A.G., J.E., R.K. and D.A. designed and collected the lymphocytes. D.A. conceived and designed the RNA-seq study involved in lymphocyte subsets and edited the manuscript. All authors read, interpreted and approved the final manuscript.