-
PDF
- Split View
-
Views
-
Cite
Cite
Valentina Casa, Valeria Runfola, Stefano Micheloni, Arif Aziz, F. Jeffrey Dilworth, Davide Gabellini, Polycomb repressive complex 1 provides a molecular explanation for repeat copy number dependency in FSHD muscular dystrophy, Human Molecular Genetics, Volume 26, Issue 4, 15 February 2017, Pages 753–767, https://doi.org/10.1093/hmg/ddw426
- Share Icon Share
Abstract
Repression of repetitive elements is crucial to preserve genome integrity and has been traditionally ascribed to constitutive heterochromatin pathways. FacioScapuloHumeral Muscular Dystrophy (FSHD), one of the most common myopathies, is characterized by a complex interplay of genetic and epigenetic events. The main FSHD form is linked to a reduced copy number of the D4Z4 macrosatellite repeat on 4q35, causing loss of silencing and aberrant expression of the D4Z4-embedded DUX4 gene leading to disease. By an unknown mechanism, D4Z4 copy-number correlates with FSHD phenotype. Here we show that the DUX4 proximal promoter (DUX4p) is sufficient to nucleate the enrichment of both constitutive and facultative heterochromatin components and to mediate a copy-number dependent gene silencing. We found that both the CpG/GC dense DNA content and the repetitive nature of DUX4p arrays are important for their repressive ability. We showed that DUX4p mediates a copy number-dependent Polycomb Repressive Complex 1 (PRC1) recruitment, which is responsible for the copy-number dependent gene repression. Overall, we directly link genetic and epigenetic defects in FSHD by proposing a novel molecular explanation for the copy number-dependency in FSHD pathogenesis, and offer insight into the molecular functions of repeats in chromatin regulation.
Introduction
Despite repetitive elements representing the largest portion of the human genome (1–4), they are poorly characterized because of their intrinsic complexity. Nevertheless, increasing evidence suggests that DNA repeats play key roles in the regulation of gene expression at multiple levels (5–9). For example, mammalian genomic repeats have been shown to harbor the greatest proportion of histone modifications (1,10,11) and to provide binding sites for regulatory factors modulating the activity of tissue-specific promoters or enhancers (6,12–18).
Tandem repeats represent more than 20% of the human genome (1) and account for a significant source of genomic variation, since their copy-number is usually highly polymorphic among individuals (1,2,19,20). They can span from a few base pairs in microsatellites to several kilobases in macrosatellites (2,21,22), and they play crucial roles in key cellular processes. With centromeric and telomeric satellites, tandem repeats provide an essential contribution to genome integrity (23–27) and chromosome segregation (28). In addition, tandem repeats play a role in heterochromatin formation (27), X-chromosome inactivation (XCI) (14,29,30), but also in the evolution and generation of genetic diversity (31–33). Finally, with their intrinsic instability, they can contribute to chromosome rearrangements and diseases (34–36). Accordingly, to preserve genome integrity (11,37,38), tandem repeats can be targeted by multiple repressive pathways leading to the formation of constitutive or facultative heterochromatin (39). This unveiled an unexpected cross-talk between two apparently distinct mechanisms of repression, which can coordinate to regulate fundamental processes (40).
FacioScapuloHumeral muscular Dystrophy (FSHD) (MIM 158900) is one of the most prevalent neuromuscular disorders (41). In its major form, accounting for 95% of cases, the disease is linked to deletions reducing the copy-number of a macrosatellite repeat called D4Z4, located in 4q35 (42). While healthy subjects display 11-100 D4Z4 units, FSHD patients usually present with only 1-10 D4Z4 units (43–45). Interestingly, the residual D4Z4 copy number at the deleted 4q35 allele correlates with disease onset and progression. FSHD patients carrying 1-3 residual D4Z4 units tend to develop the disease earlier and with a more severe outcome than patients displaying 9-10 units (46–53).
Several epigenetic alterations have been described on the affected FSHD locus, including DNA hypomethylation (54) and reduction of heterochromatic histone marks H3K9me3 and H3K27me3 (tri-methylation of histone H3 Lysine 9 and Lysine 27) (55–58). Ultimately, this leads to aberrant expression at 4q35 (58–63) and in particular to the toxic overexpression of the D4Z4-embedded DUX4 retrogene, which is considered the major gene responsible for the disease (reviewed in (64,65)). Nevertheless, the molecular bases for the FSHD copy-number dependency are poorly known.
Given the importance of DUX4 in disease, we developed a reporter system to monitor the influence of repeat copy-number on gene expression driven by the DUX4 promoter. We found that high GC content and repeat copy number lead to efficient gene repression. We analyzed the main epigenetic pathways playing a role in FSHD and found that the Polycomb Repressive Complex 1 (PRC1) plays a major role in the repeat copy-number dependent regulation of gene expression. Our results provide a novel molecular explanation for the peculiar link between genetic and epigenetic alterations taking place in one of the most important neuromuscular disease.
Results
DUX4p mediates a copy-number dependent repression
The molecular characterisation of FSHD muscular dystrophy has revealed a highly complex interaction between genetic and epigenetic components (65). One of the prominent aspects of this disease is that patients carrying few copies of D4Z4 (1–3) tend to develop a more severe form of the disease and with earlier onset than patients with 8-10 residual copies (46–53,66). Consistent with this, the aberrant expression of DUX4, the main FSHD gene, inversely correlates with repeat copy number (67). In order to explore the molecular mechanism at the basis of the copy-number dependency in DUX4 gene regulation, we generated three different reporter gene constructs carrying an EGFP reporter gene under the control of the DUX4 proximal promoter (DUX4p) and displaying a copy number typically found either in healthy subjects (12), or in mild (5) and severe (1) FSHD forms (Supplementary Material, Fig. S1A–F). We focused on a 180 bp D4Z4 fragment containing the regulatory elements of the DUX4 proximal promoter (61) (Supplementary Material, Fig. S1G). To evaluate the effect of DUX4p copy-number on gene expression, the terminal DUX4p unit was prolonged at the 3’ end until the ATG of DUX4 ORF (68), driving the expression of the EGFP reporter (Supplementary Material, Fig. S1G).
Initially, we tested the constructs in transient transfection (Supplementary Material, Fig. S2, ). To this aim, we transfected equimolar quantities of the DUX4p constructs together with a plasmid constitutively expressing ECFP, as transfection control. The median EGFP fluorescence intensity (Supplementary Material, Fig. S2D) and the percentages of EGFP positive cells (Supplementary Material, Fig. S2E) were analysed in the total population of ECFP positive cells through flow cytometry and expressed as relative to 1-DUX4p. Intriguingly, we observed a copy-number dependent effect of DUX4p repeats on both EGFP intensity and percentage of positive cells (Supplementary Material, Fig. S2). In particular, cells transfected with the 12-DUX4p construct showed a significantly reduced relative EGFP intensity, when compared to both 1-DUX4p and 5-DUX4p constructs. Interestingly, the EGFP signal from the 5-DUX4p repeat construct stands on an intermediate level, significantly higher and significantly lower than those of 12-DUX4p and 1-DUX4p copies, respectively (Supplementary Material, Fig. S2D). Similarly, the number of DUX4p units has an effect on the relative number of EGFP expressing cells, since a significantly smaller percentage of EGFP positive cells were present in the 12-DUX4p and 5-DUX4p populations when compared to 1-DUX4p. Again, the effect increases as a function of the number of DUX4p units (Supplementary Material, Fig. S2E). Overall, an increase in the DUX4p copy-number correlates with a decreased reporter gene signal.

DUX4p repeats mediate copy-number dependent repression in isogenic muscle cells. A–C. Top, schematic view of constructs used for Flp-FRT-mediated recombination and carrying 1 (A), 5 (B), or 12 (C) DUX4p repeats and an EGFP reporter gene under the control of the DUX4 promoter. Upon site-specific integration in C2C12-frt muscle cells, the isogenic cells differing for DUX4p copy number (A–C) were tested for EGFP expression. For relative quantification of EGFP protein levels among samples, flow cytometry analyses of EGFP Median Fluorescence Intensity (MFI) (D) and of percentage of EGFP+ events (E) are shown. Examples of histogram analyses with EGFP (x axis) versus counts (y axis) are reported (A–C, bottom), compared to a non-fluorescent (wild type) cell line. For relative quantification of EGFP at the RNA level, RT-qPCR analyses of EGFP (F) and Gapdh negative control (G) are presented over β-Actin. Results are expressed over 1 DUX4p (D–G). The mean of the signals obtained from six (D,E) or four (F,G) independent experiments is shown. The error bars represent SEM of independent biological replicates. Asterisks indicate statistical significance (p value) as evaluated by One way ANOVA Bonferroni’s multiple comparison test. ****=P < 0.0001; **= P < 0.01.
A high CpG/GC density and the repetitive nature contribute to the repressive function of DUX4p

Contribution of CpG/GC-rich and repetitive nature to copy-number dependent repression. A-C. Schematic view of constructs named: 12 DUX4p (A), CGI B3gnt6 (B), 11-, 5-, 1-HbG (C). The constructs carry: 11 copies of DUX4p repeats (overall, 110 CpG and 72% GC, A); a non-repetitive sequence containing a CpG island (CGI) from B3gnt6 locus (110 CpG and 60% GC, B); 11, 5 or 1 copies of HbG repeats (each with 0 CpG and 38% GC, C). Additionally, all constructs retain a DUX4p unit prolonged to include DUX4 ATG sequence, and an EGFP reporter gene (A-C). Upon site-specific integration of the constructs in C2C12-frt muscle cells, the isogenic cells (A–C) were tested for EGFP expression. For relative quantification of EGFP protein levels among samples, flow cytometry analyses of EGFP Median Fluorescence Intensity (MFI) (D) and of percentage of EGFP+ events (E) are shown. For relative quantification of EGFP at the RNA level, RT-qPCR analyses of EGFP (F) and Gapdh negative control (G) are presented over β-Actin. Results are expressed over 1 DUX4p (D-G). The mean of the signals obtained from five (D,E) or four (F,G) independent experiments is shown. The error bars represent SEM of independent biological replicates. Asterisks indicate statistical significance (p value) as evaluated by One way ANOVA Bonferroni’s multiple comparison test. ****= P < 0.0001; **= P < 0.01.
Significantly, flow cytometry analyses expressed as fold over 1-DUX4p showed that 12-DUX4p retains the strongest ability to silence EGFP expression (Fig. 2), since the 12-DUX4p cells display the least intense median EGFP signal (Fig. 2D). Moreover, 12-DUX4p showed a significantly lower percentage of EGFP positive cells compared to CGI B3gnt6 and 1-5-11 HbG (Fig. 2E). Similar results were observed at RNA level, since EGFP transcript levels were lower in 12-DUX4p than in all other cells lines (Fig. 2F), while mRNA levels of the Gapdh control remained constant in all the samples (Fig. 2G). Overall, these results indicate that a high CpG/GC DNA content combined with the presence of repeats results in the strongest silencing effect, suggesting that both features are important in the repression of DUX4.
DNA methylation and H3K9me3 constitutive heterochromatin marks are not copy-number dependent

DNA methylation and histone H3K9me3 enrichment does not correlate with DUX4p copy number. A-C. Schematic view of isogenic muscle cells carrying 1 (A), 5 (B), or 12 (C) DUX4p repeats and an EGFP reporter gene, used for MeDIP and ChIP analyses. D-I. Isogenic muscle cells differing for DUX4p copy-number were analyzed by MeDIP for DNA methylation (D-F) and by ChIP for H3K9me3 (G-I), on DUX4p (D, G), on negative (E, H) or positive (F, I) control regions, respectively, Zeocine gene (E, H) and Major Satellite repeats (F, I). ChIP-qPCR data are shown relative to total H3. The mean of the signals obtained from three independent experiments is shown (D-F, G-I). The error bars represent SEM of independent biological replicates.
DUX4p drives copy-number dependent PRC1 recruitment

DUX4p arrays recruit Bmi1 (PRC1) and Suz12 (PRC2), but only Bmi1 shows copy-number dependent recruitment. A–C. Schematic view of isogenic muscle cells carrying 1 (A), 5 (B), or 12 (C) DUX4p repeats and an EGFP reporter gene, used for ChIP assays. D-I. Isogenic muscle cells differing for DUX4p copy-number were analyzed by ChIP for the recruitment of Bmi1 (D–F) and Suz12 (G–I), on DUX4p (D, G), and on genomic regions used for negative (E,H) or positive (F,I) controls, respectively: a non-target genomic region located 5 Kb upstream of the TSS of the Sfrp1 gene (87) (E,H) and the bona fide PcG target gene Hoxc13 (F, I). All ChIP-qPCR data are shown relative to the bona fide PcG target gene Evx1 (79). The mean of the signals obtained from three independent experiments is shown (D–F, G–I). The error bars represent SEM of independent biological replicates. Asterisk indicates statistical significance (p value) as evaluated by One way ANOVA analysis. *= P < 0.05.
PRC1 is required for DUX4p-induced gene silencing
In order to test whether Polycomb is directly involved in DUX4p-mediated silencing, we performed RNAi-depletion of PRC1 and PRC2 core components Bmi1 (siBmi1) and Suz12 (siSuz12) in muscle cells carrying 1-5-12 DUX4p copies (Supplementary Material, Fig. S3A–C). RT-qPCR and immunoblotting analyses showed that we obtained efficient and comparable knockdowns for both Bmi1 and Suz12 in all cells (Supplementary Material, Fig. S3D–K). Moreover, ChIP-qPCR experiments indicated that siRNA-treatments were effective in depleting Bmi1 and Suz12 from DUX4p (Supplementary Material, Fig. S4).

Bmi1 (PRC1) but not Suz12 (PRC2) is required for copy-number dependent gene repression. A–J. Muscle cells carrying 1, 5, or 12 DUX4p repeats and an EGFP reporter, knockdown for the PRC1 core subunit Bmi1 (siBmi, A–E), for the PRC2 core subunit Suz12 (siSuz12, F–J) or expressing a non-silencing control (Ctrl, A–J) were tested for EGFP expression. For relative quantification of EGFP protein levels among samples, flow cytometry results (A–B, F–G) of EGFP Median Fluorescence Intensity (MFI) (A,F) and of percentage of EGFP+ events (B,G) are shown. For relative quantification of EGFP at the RNA level, RT-qPCR analyses of EGFP (C, H), Gapdh negative control (D, I) and Hoxc13 positive control (E,J) are presented over β-Actin. Results are expressed over Ctrls (A–J). The mean of the signals obtained from five (A,B, F–J) or four (C–E) independent experiments is shown. The error bars represent SEM of independent biological replicates. Asterisks indicate statistical significance (p value) as evaluated by One way ANOVA Bonferroni’s multiple comparison test. ****= P < 0.0001; ***= P < 0.001; **= P < 0.01; ns = non significant.
Although Suz12 enrichment on DUX4p is not copy number-dependent (Fig. 4G), we investigated whether it plays a role in DUX4p-mediated gene silencing. To this aim, we analyzed EGFP expression upon treatment with siRNAs targeting Suz12. Differently to RNAi of the PRC1 subunit Bmi1, the EGFP reporter expression remained unchanged upon treatment with siRNAs against Suz12 compared with non-silencing controls (Fig. 5F–J). This was verified at the protein level through flow cytometry analyses of EGFP median intensity (Fig. 5F) and percentage of positive cells (Fig. 5G), but also at the transcriptional level by RT-qPCR quantification of EGFP mRNA (Fig. 5H). Importantly, while Gapdh transcript levels remain unchanged (Fig. 5I), Suz12 knockdown caused a good de-repression of the bona fide PcG target Hoxc13 (Fig. 5J), indicating that the Suz12 knockdown was effective in causing specific transcriptional de-repression of PRC2 target genes. Collectively, these results indicate that DUX4p-mediated copy-number dependent repression of gene expression requires PRC1 but it is independent from PRC2.
PRC1 is required to maintain repression of the endogenous DUX4 gene

PRC1 is required to maintain the endogenous DUX4 gene repressed in human primary muscle cells. A. RT-qPCR of BMI1 expression in human primary myotubes from healthy donor, transfected with siRNA targeting BMI1 (siBMI1) or a non-targeting control (siNT). B. RT-qPCR showing selective DUX4 de-repression upon BMI1 depletion (siBMI1), in the same cells in (A). C. RT-qPCR of DYSTROPHIN (DYS) expression, showing no significant alteration of muscle differentiation, in the same cells in (A). BMI1, DUX4 and DYS levels are relative to GAPDH, used as normalizator. Error bars represent SEM of four independent experiments. Asterisks indicate statistical significance (P value) as evaluated by Student’s t-test. ***= P < 0.001; * P < 0.05; ns = non significant.
Discussion
Among genetic disorders, FSHD is one of the most peculiar for the major role played in its pathogenesis by epigenetic components. While a number of epigenetic alterations of the disease locus have been described (64), the molecular bases of D4Z4 copy-number dependency and incomplete penetrance of FSHD phenotype are not fully understood (65). Our results show how the size of DUX4 promoter tandem repeat arrays directly correlates with recruitment of and repression by the Polycomb Repression Complex 1, thus contributing to the comprehension of the molecular mechanisms responsible of the copy-number dependency in FSHD (65).
For practical reasons, we focused on a 180 bp region inside D4Z4 containing the DUX4 proximal promoter and displaying a CpG/GC density comparable to that of the entire D4Z4 repeat. We generated isogenic reporter muscle cell lines with DUX4p arrays of 12, 5 and 1 units, which would respectively approximate the status of the FSHD locus in healthy, mildly and severely affected patients. Remarkably, while with the 5 copies array we obtained only a partial repression, with the 12 copies array we obtained an almost complete shutdown of DUX4 promoter reporter expression, which is reminiscent of the threshold effect observed in DUX4 expression and FSHD pathogenesis (65,67).
D4Z4 is highly GC and CpG rich and it can be repeated in as many as 100 copies at each 4q35 allele. Since in mammals both features can harbor repressive machineries (88–91), it was important to address if the GC/CpG-dense or the repetitive nature of D4Z4 play a role in its ability to regulate gene expression. To dissect this, we generated reporter cell lines carrying either a high GC/CpG density, but in a single copy sequence, or a high number of repeats, but in a low GC/CpG context. By comparing gene expression in these reporters to 12 DUX4p array, we determined that the repetitive and GC/CpG-dense natures of DUX4p arrays are both important for establishing efficient repression of gene expression.
To determine the molecular mechanism responsible for the copy number-dependent repression by DUX4p, we analyzed the most relevant epigenetic repressive traits described on the FSHD locus (54,55,57,58). Despite the fact that DNA methylation, H3K9me3, PRC1 and PRC2 complexes are all enriched on DUX4p reporters at levels comparable to those of the endogenous FSHD locus in human cells, only PRC1 emerged as a good candidate to explain copy-number dependency in gene repression. In fact DNA methylation, H3K9me3 and the PRC2 core component Suz12 showed similar enrichments irrespectively of DUX4p copy number. On the contrary, the enrichment of the PRC1 core component Bmi1 is a function of DUX4p copy-number. Notably, we demonstrated that PRC1 is required for the silencing of the endogenous DUX4 in muscle cells of healthy subjects. Moreover, PRC1 is also required to maintain copy-number dependent gene silencing, since Bmi1 depletion caused a de-repression of the reporter that was significantly stronger with high DUX4p copy number. On the contrary, we found that repression by DUX4p arrays is PRC2-independent.
That PRC2 is enriched on but does not play a significant role in repression mediated by DUX4p arrays was somewhat unexpected. Nevertheless, at the FSHD locus, we previously showed that PRC2 and the associated H3K27me3 histone mark peak at the TSS of the lncRNA DBE-T to repress its transcription (58). Hence, it is tempting to speculate that PRC1 and PRC2 could play multiple and diverse roles at the FSHD locus.
In Drosophila, PcG proteins are recruited to genetic elements called Polycomb Response Elements (PREs). Much less is known about PcG recruitment to target genes in vertebrates (92). Cardinal features of Drosophila PREs are their ability to recruit PcG complexes and mediate PcG-dependent repression when inserted to ectopic sites. Accordingly, we found that DUX4p is capable to mediate robust ectopic recruitment of PRC1, PRC2 and mediate PcG-dependent gene silencing. Based on this, it is tempting to speculate that D4Z4 could be the first PRE involved in a human genetic disease. Besides our previous work on D4Z4 (58), there are several reports of Polycomb binding to repetitive elements (3,10,29,30,93–97) suggesting an important role of Polycomb proteins in genome stability through the stabilization of the repetitive genome (95). Moreover, recent data suggest that Polycomb might use repeats to achieve efficient chromatin compaction. Indeed, super-resolution images indicate that the chromatin packing density of Polycomb-repressed domains increases with domain length (98). Therefore, it has been suggested that a concatenation of PcG domains would lead to a more compact chromatin packaging and a more efficient repression of gene expression (98). Importantly, this function depends on PRC1 (98). This recent work (98) and the observation that chromatin compaction inversely correlates with FSHD disease (67,99) suggest that a copy number-dependent DUX4 promoter chromatin packing mediated by PRC1 contributes to DUX4 silencing in healthy subjects.
Recently, a correlation between DNA methylation levels and cumulative D4Z4 repeat copy number (including D4Z4 sequences from different human chromosomes) has been described (85). However, while an inverse correlation between D4Z4 copy number and DUX4 expression has been reported in FSHD1 (67), DNA methylation does not correlate with DUX4 expression (67), suggesting that additional epigenetic differences could account for DUX4 expression levels relative to D4Z4 copy number. Nevertheless, it is well established that DNA methylation of D4Z4 plays a protective role, since low levels of DNA methylation are required for the manifestation of FSHD symptoms independently from D4Z4 copy number (67,83,85).
To conclude, FSHD muscular dystrophy represents an extraordinary example of how copy-number variations in genomic elements can affect chromatin structure and gene expression (64). Our work, by exploring the role of repetitive elements in recruiting different heterochromatin regulators to repress gene expression, is relevant for understanding the contribution of repeats to organism complexity. Indeed, besides FSHD, there are a number of copy-number variations associated with diseases. In particular, similarly to FSHD, repeat expansion in neurological disorders leads to epigenetic changes and a correlation between repeat copy-number and disease phenotype has been documented (100–102). Since the molecular mechanism underlying these phenomena has remained elusive (101), the present work could shed new light on these disorders, especially when involving GC-rich regions like in Fragile X syndrome (102). In this context, the possibility to modulate PcG binding to repeats could be explored in the search of new epigenetic therapies.
Materials and Methods
Plasmid design and construction
For the generation of the 12-DUX4p-pMA construct, a sequence was designed in order to contain: an array of 11-DUX4p units with a size of 180 bp per unit, flanked by AvrII restriction sites; a terminal DUX4p unit prolonged to include DUX4 promoter until the ATG of DUX4 ORF; the EGFP gene sequence; the SV40 polyA site. This construct was synthesized by GeneArt (Life Technologies) according to the above plasmid design, in pMA backbone.
By digestion of the 12-DUX4p-pMA construct with AvrII (NEB), 11-DUX4p copies were excised, thus producing the 1-DUX4p-pMA construct.
By PCR amplification from 12-DUX4p-pMA construct, with primers DUX4p-F and DUX4p-R (in Supplementary Material, Table S1), a fragment containing 4-DUX4p units was isolated and subcloned into 1-DUX4p-pMA construct exploiting AvrII restriction site, thus generating the 5-DUX4p-pMA construct.
For the generation of FRT-constructs, the pcDNA5/FRT (Flp-In System, Life Technologies) backbone was modified as follows: the CMV promoter was excised by digestion with MfeI- NheI enzymes (NEB); a SV40 polyA site was inserted into the HindIII restriction site of the MCS of pcDNA5/FRT.
For the generation of 12-DUX4p-FRT, 5-DUX4p-FRT and 1-DUX4p-FRT constructs, SacI – NotI restriction fragments containing DUX4p arrays and EGFP reporter were excised respectively from 12-DUX4p-pMA, 5-DUX4p-pMA and 1-DUX4p-pMA constructs and subcloned into the modified backbone of pcDNA5/FRT.
For the generation of the CGI-B3gnt6-FRT construct, a 2 Kb fragment containing 110 CpGs flanked by SacI restriction sites was PCR amplified from C2C12 genomic (g) DNA with primers CGI-B3gnt6-F and CGI-B3gnt6-R (Supplementary Material, Table S1). The isolated fragment was subcloned into the 1-DUX4p-FRT plasmid in a 3’-5’ orientation, thus generating the CGI-B3gnt6-FRT construct carrying 110 non-repetitive CpGs, the DUX4 promoter unit and the EGFP reporter gene cassette.
For the generation of the 11-HbG-FRT construct, an array of 11 HbG copies of 180 pb per unit, flanked by 5’-SacI- 3'-AvrII restriction sites, was synthesized by GeneArt (Life Technologies) and subcloned into the 1-DUX4p-FRT construct, thus generating the 11-HbG-FRT construct carrying 11 HbG copies with 0 CpGs, the DUX4 promoter unit and the EGFP reporter gene cassette.
By digestion with BamHI (NEB) of the 11-HbG-FRT construct, 6-HbG copies were excised, thus producing the 5-HbG-FRT construct.
By digestion with HindIII (NEB) of the 11-HbG-FRT construct, 10-HbG copies were excised, thus producing the 1-HbG-FRT construct.
For screening and propagation purposes, Top10F’ (Life technologies) competent E.coli cells were employed. Bacteria were transformed by heat shock (42 °C – 1 min) with 1 ng of purified plasmids or 1-3 μl of ligation products. Cells were grown in LB medium plus ampicillin (100 μg/ml; Sigma) or kanamycin (50 μg/ml; Sigma) at 37 °C in a shaking incubator and plated in LB Agar + ampicillin (100 μg/ml) or + kanamycin (50 μg/ml) plates. When transformed with pGEM-T (Promega), cells were plated in LB + Amp plates supplemented with 20 μl of 50 mg/ml X-Gal (Sigma) to allow blue/white colony screening.
Mammalian cell lines culture
HEK293T cells were obtained by ATCC. The C2C12-frt cell line was derived using the Flp-In System (Life Technologies), where clones were screened for random integration of a single FRT site into the genome as previously described (103). Cells were maintained in DMEM-HIGH (Dulbecco’s Modified Eagle’s Medium, High Glucose with Sodium Pyruvate and L-Glutamine; EuroClone) supplemented with 10% FBS (Fetal Bovine Serum; EuroClone) and 1% Penicillin/Streptomycin (100 U/ml final concentration; EuroClone). Zeocine (100 µg/ml final concentration, Life Tecnologies) was added to the C2C12-frt medium.
For cell transfection, Lipofectamine LTX (Life Technologies) was used. For transient assays, HEK293T cells were transfected with equimolar quantities of either 1-DUX4p-pMA, or 5-DUX4p-pMA or 12-DUX4p-pMA, and co-transfected with pECFP-C1 (Clontech) in a ratio 5:1, where pMA constructs where 5 times more concentrated than pECFP-C1. Cells were collected after 48h for gene expression and flow cytometry analyses.
For the generation of 1-DUX4p-C2C12-frt, 5-DUX4p-C2C12-frt, 12-DUX4p-C2C12-frt, CGI-B3gnt6-C2C12-frt, 1-HbG-C2C12-frt, 5-HbG-C2C12-frt and 11-HbG-C2C12-frt stable cell lines, site-specific integration with corresponding constructs was performed via a Flp-FRT-mediated recombination (Flip In System, Life Technologies). C2C12-frt myoblasts were co-transfected with FRT constructs and the Flp recombinase expression vector, pOG44, in a ratio 9:1, where pOG44 vector was 9 times more concentrated than FRT construct. Positively transfected cells were selected with Hygromycin (100 µg/ml final concentration; InVivoGen). The Flp recombinase-mediated integration confers hygromycin resistance and zeocine sensitivity to correctly integrated cells. A minimal of two independent stable cell lines were generated for each construct.
For the generation of 1-DUX4p-C2C12-frt, 5-DUX4p-C2C12-frt and 12-DUX4p-C2C12-frt knock-down for Bmi1 or Suz12, cells were transfected with 25 nM siRNAs against Bmi1 (siRNA Mm_Bmi1_2, SI00167090 and Mm_Bmi1_7, SI02707642 in 1:1 ratio, Qiagen) or non- silencing control (All Stars Negative Control, 1027280, Qiagen) or Suz12 (L-040180-00-0005, ON-TARGETplus SMARTpool, Mouse, Thermo Scientific) or non-silencing control (D-001810-10, ONTARGETplus Non targeting pool, Thermo Scientific) following manufacturer’s instructions. Transfections were performed 72h prior to collection for gene expression and flow cytometry analyses.
For freezing, cell pellets were resuspended in FBS and transferred to cryovials. DMSO (dimethyl sulfoxide; Sigma) was added as cryoprotector at a final concentration of 10%. Cryovials were frozen into isopropanol-filled cryoboxes, stored at −80 °C overnight and, then, long-term stored in liquid nitrogen.
Culture and transfection of human primary muscle cells
All procedures involving human samples were approved by Fondazione San Raffaele del Monte Tabor Ethical Committee.
Human primary myoblasts were obtained from the University of Massachusetts Medical School Senator Paul D. Wellstone Muscular Dystrophy Cooperative Research Center for FSHD (http://www.umassmed.edu/wellstone; date last accessed December 30, 2016) in Worcester, MA, USA.
Human primary myoblasts (07Ubic) were derived from the biceps of a healthy donor. They were cultured in a humidified atmosphere at 37 °C with 5% O2 and 5% CO2.
Myoblasts were seeded in 0.1% gelatin-coated dishes and cultured in Ham’s F-10 medium (Nutrient Mixture F10 Ham, with L-glutamine and sodium bicarbonate, Sigma) supplemented with 20% FBS (Fetal Bovine Serum; EuroClone), 0.5% chicken embryo extract, 1.2 mM CaCl2 (Sigma) and 1% penicillin/streptomycin (100 U/ml final concentration, EuroClone). When cells reached confluence of 90%, differentiation was induced by replacing growth medium with 4:1 D-MEM:199 Medium (Dulbecco’s Modified Eagle’s Medium, High Glucose with Sodium Pyruvate and L-Glutamine, EuroClone; Medium 199, Gibco), supplemented with 2% horse serum (Euroclone), 1% ITS (Insulin, transferrin and sodium selenite solution, Sigma) and 1% penicillin/streptomycin (Euroclone) for 4 days.
For siRNA transfection, Lipofectamine 3000 in Opti-MEM (Thermo Fisher Scientific) was used, following manufacturer’s instructions. siRNA for BMI1 and the non-silencing control were purchased from Thermo Fisher Scientific (SMARTpool: ON-TARGETplus Human BMI1 siRNA L-005230-01; non-targeting control D-001810-10) and used at a final concentration of 25nM. 48h after transfection, differentiation was induced, and after 4 days myotubes were collected.
Flow cytometry
For flow cytometry analysis of fluorescence intensity in HEK293T, cells transiently co-transfected with ECGP and either 1-DUX4p-pMA, 5-DUX4p-pMA, or 12-DUX4p-pMA constructs were collected with trypsinization followed by centrifugation 48h after transfection.
For flow cytometry analysis of fluorescence intensity in C2C12-frt cells, 1-DUX4p-C2C12-frt, 5-DUX4p-C2C12-frt, 12-DUX4p-C2C12-frt, CGI-B3gnt6-C2C12-frt, 1-HbG-C2C12-frt, 5-HbG-C2C12-frt or 11-HbG-C2C12-frt muscle cells expressing EGFP were collected with trypsinization followed by centrifugation at different passages from cell line generations.
For flow cytometry analysis of fluorescence intensity upon Bmi1 or Suz12 knockdown, 1-DUX4p-C2C12-frt, 5-DUX4p-C2C12-frt or 12-DUX4p-C2C12-frt myoblasts were collected with trypsinization followed by centrifugation 72h after siRNA transfection.
Aliquot of 0.5–1x106 cells were transferred into assay tubes, rinsed by centrifugation with growth medium and resuspended in 100 µl normal growth medium per assay tube. Analysis on flow cytometers was performed within 1h from trypsinization. LSR Fortessa (Becton Dickinson) was used for acquisition of EGFP, ECFP and morphological parameters, while Accuri C6 (BD Biosciences) for acquisition of EGFP signal and morphological parameters.
Fluorochrome spectral overlap of EGFP and ECFP signals was corrected with set up of automatic compensation with FACSDIVA software (BD Biosciences). Analyses were performed with FCSExpress 4.0 (De Novo), and EGFP intensity and percentage of EGFP positive cells were quantified in the subpopulation of ECFP positive events in the case of HEK293T co-transfected cells, or in the total population of living cells (singlet events) in the case of stable cell lines expressing EGFP.
RNA extraction, RT-PCR, and RT-qPCR analysis
Reporter gene expression and gene down-regulation upon siRNA treatment were evaluated by RT-qPCR analysis.
Total RNA from cultured cells was extracted with Trizol, treated with DNaseI and purified using PureLink RNA Mini Kit (Ambion).
Total RNA from human primary muscle cells was extracted with Lysis buffer supplemented with 0.01% β-mercaptoethanol and purified with RNA spin columns (PureLink RNA Mini kit, Ambion), including a DNAseI treatment step, following the manufacturer's instructions.
cDNA (using up to 1 µg of RNA) was synthesized using SuperScript III First-Strand Synthesis Super-Mix (Life Technologies). For DUX4 expression analyses in human primary cells, 2 µg of RNA were retrotranscribed using SuperScript III First-Strand Synthesis System for RT-PCR (Life Technologies). RNA was mixed with oligo(dT) and the RT-PCR reaction was performed following the manufacturer's protocol.
qPCRs were performed with SYBR GreenER qPCR SuperMix Universal (Life Technologies) using CFX96 or CFX384 Real-time System (Biorad). Validated primers are listed in Supplementary Material, Table S1. Expression analyses were performed using the ΔΔCT method (ΔCT method for human primary muscle cells). β-actin or GAPDH (for human primary muscle cells) were used as housekeeping genes for sample normalization.
Real-time PCR analysis
Biorad’s CFX96 or CFX384 Real-time System with CFX Manager Software V.1.6 were used. Real-time PCRs were conducted as follows: initial denaturation: 95 °C, 10 min; 40 cycles of denaturation (95 °C, 30 sec), annealing (temperature according to primers, 30 sec), amplification (72 °C, 30 sec). For Major Satellite analysis only, amplification time was 15 sec.
The specificity of the amplified products was monitored by performing melting curves at the end of each amplification reaction. The efficiency of each primer was assessed by performing primer validation qPCRs with serial dilutions of reference material (cDNA for qRT-PCR, gDNA for MeDIP, Input DNA for ChIP).
MeDIP
MeDIP analysis of 1-DUX4p-C2C12-frt, 5-DUX4p-C2C12-frt and 12-DUX4p-C2C12-frt muscle cells was carried out mainly as described in (104).
Briefly, cell pellets were resuspended in 300 μl of TE per 1x106 of cells and lysed with one volume of Lysis buffer (20 mM Tris pH 8.0, 4 mM EDTA, 20 mM NaCl and 1% SDS) containing 20 μl of proteinase K (10mg/ml, Promega). Samples were incubated at 55 °C for at least 5 h.
gDNA was extracted with phenol (Fluka) and chloroform (Fluka) extractions, followed by 10 min centrifugation at maximum speed at RT. Supernatant was precipitated using 2 volumes of 100% ethanol and 75 mM of Sodium Acetate, and DNA was resuspended in TE containing 20 μg/ml of RNAse A (Sigma) for 30 min. Then DNA was fragmented (200–400 bp) per 5 cycles 30 sec ON 30 second OFF at low intensity with Bioruptor sonication device (Diagenode). DNA was denatured for 10 min in boiling water and immediately cool down on ice for 10 min.
500-1000 ng of sonicated gDNA was precipitated using 5 μg of α-5mC Clone 33D3 (Diagenode) or normal mouse IgG (015-000-003, Jackson ImmunoResearch) in IP buffer (10 mM Na-Phospate pH 7.0, 140 mM NaCl, 0.005% Triton x-100) overnight at 4ºC with overhead incubation. Next day, 50 μl of Protein G Dynabeads (Life Technologies), previously washed with PBS-BSA 0.1%, were added to the mix and incubated for 2 h at 4ºC with overhead mixing. Sonicanted gDNA/antibody/beads complex were washed three times with IP buffer.
Immunoprecipitated DNA was extracted in proteinase K buffer (50 mM Tris pH 8.0, 10 mM EDTA and 0.5% SDS) with 7 μl of proteinase K (10 mg/ml, Promega) followed by phenol chloroform extraction. DNA was precipitated using 400 mM NaCl, glycogen (1 μl) and 2 volumes of 100% ethanol. Then DNA was resuspended in 60 μl of TE. 100% of the initial amount of DNA was used as Input.
DNA was analyzed by qPCR with SYBR GreenER qPCR SuperMix Universal (Life technologies) using Biorad’s CFX96 or CFX384 Real-time System with an amount of template equivalent to 1 µl of the original elution. For Major Satellite analysis only, the template was further diluted 1:1000. Validated primers are listed in Supplementary Material, Table S1. Analyses were performed using the ΔCT method, normalizing MeDIP-qPCR data for input DNA (reported as Relative enrichment to Input in the figures).
Chromatin immunoprecipitation
ChIP-qPCR was carried out mainly as described in (58) on 1-DUX4p-C2C12-frt, 5-DUX4p-C2C12-frt and 12-DUX4p-C2C12-frt muscle cells for ChIP evaluation of histone mark and protein enrichment, and on 1-DUX4p-C2C12-frt, 5-DUX4p-C2C12-frt and 12-DUX4p-C2C12-frt muscle cells treated with siBmi1 or siSuz12 for evaluation of protein downregulation on target sites.
Briefly, cells were washed once in PBS and fixed for 10 min in 1% formaldehyde in PBS (from a 37.5% formaldehyde/10% methanol stock, Sigma). After formaldehyde quenching with Glycine (final concentration 125 mM) for 5 min, cells were washed with PBS, harvested by scraping and pelleted at 1350 g for 5 min at 4 °C.
The pellet was lysed in a solution containing 50 mM Hepes-KOH pH 7.5, 140 mM NaCl, 1 mM EDTA, 10% glycerol, 0.5% NP40 and 0.25% Triton X100 for 10 min in ice. Nuclei were pelleted at 1350 g for 5 min at 4 °C and washed from detergents in a solution containing 10 mM Tris-HCl pH 8.0, 200 mM NaCl, 1 mM EDTA and 0.5 mM EGTA with gentle swirl for 10 min. Next, samples were centrifuged and the nuclei pellet was lysed in a solution with 10 mM Tris-HCl pH 8.0, 100 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, 0.1% Na-Deoxycholate and 0.5% N-laurylsarcosine. Chromatin was sheared by sonication using Bioruptor (Diagenode), typically 15 min (high intensity 30 sec ON 30 sec OFF) or as much as needed to reach an optimal size of the fragments between 200 bp and 700 bp. Triton X-100 was added to lysates at a final concentration of 1%, and a clarification step of 10 min at 16,360 g at 4 °C followed.
100 μg of chromatin were used for each immunoprecipitation (10 μg for ChIP to histones and histone modifications). Incubations were carried out at 4 °C overnight with 10 μg (5 μg for ChIP to histones and histone modifications) of the following antibodies: α-Bmi1 (ab85688, Abcam), α-Suz12 (ab12073, Abcam), α-H3K9me3 (Ab8898, Abcam), α-H3 (ab1791, Abcam) and whole molecule rabbit IgG (011-000-003, Jackson ImmunoResearch). Immunoprecipitation with 50 µl of Protein G Dynabeads (Life Technologies), previously washed with BSA 0.1% in PBS, was carried out for 3 h at 4 °C. Before starting the washes, 5% of the total ChIP volume was taken from the control IgG supernatant as Input fraction.
Immunoprecipitated chromatin was washed extensively with a solution containing 50 mM Hepes-KOH pH 7.6, 500 mM LiCl, 1 mM EDTA, 1% NP125 40 and 0.7% Na-Deoxycholate, and protein–DNA cross-links were reverted by heating at 65 °C overnight in TE buffer 2% SDS.
DNA was purified with QIAquick PCR Purification Kit (Qiagen) and eluted in 50 µl TE. qPCRs were performed with SYBR GreenER qPCR SuperMix Universal (Life Technologies), in CFX96 or CFX384 Real-time System (Biorad) with an amount of template equivalent to 1 µl of the original elution. For Major Satellite analysis only, the template was further diluted 1:1000. Validated primers are listed in Supplementary Material, Table S1. Analyses were performed using the ΔCT method, normalizing ChIP-qPCR data for input chromatin. For histone modification antibodies, the enrichment is expressed as percentage of total histone (reported in the figures as: Relative enrichment to H3). For non-histone antibodies, ChIP-qPCR data are shown relative to the bona fide PcG target gene Evx1 (reported in the figures as: Relative enrichment to Input – fold over Evx1), as described (79). ΔΔCT method was used for analyses versus non-silencing control in ChIP of RNAi samples.
SDS-PAGE and immunoblotting
Protein down-regulation was evaluated by immunoblotting with mouse α-Bmi1 (ab85688, Abcam) or α-Suz12 (ab12073, Abcam), using α-Vinculin (V9131, Sigma) as loading control. To this aim, SDS-polyacrylamide gels were prepared following the recipes described by Sambrook and Russel in Molecular cloning: a laboratory manual (105) for 10% or 12% gels.
Electrophoresis was performed at 100-120V at RT in 1X Tris-Glycine SDS (TGS) running buffer pH 8.3 [25mM Trizma (Sigma), 192 mM Glycine (Sigma), 0.1% SDS (National Diagnostics)] prepared in milliQ water.
For immunoblotting, transfers were performed using the iBlot dry blotting system (Life technologies), following manufacturer recommended conditions (Life technologies) with P3 program.
For immunohybridization, filters were saturated in 5% non-fat milk (Sigma) TBS-Tween 0.1% solution (Tris Buffered Saline: 10 mM TrisHCl pH 7.4, 140 mM NaCl, Tween-20; National Diagnostics) for 1h at RT or overnight at 4 °C. Incubation with the primary antibody was performed in 5% non-fat milk TBS-Tween 0.1% solution for 2h at RT or overnight at 4 °C. Filters were then washed 3 times (10 min each) with TBS-Tween 0.1% at RT. HRP-conjugated secondary antibodies α-mouse (715-035-150 Jackson ImmunoResearch) or α-rabbit (711-035-152, Jackson ImmunoResearch) were incubated 5% non-fat milk TBS-Tween 0.1% solution for 1-2 h at RT. Filters were washed again as described.
For detection, filters were incubated for 5 min with horseradish peroxidase (HRP) chemiluminescent substrate (SuperSignal West Pico or Dura Chemiluminescent Substrate; Thermo scientific).
Statistical analyses
All statistical analyses were performed using GraphPad Prism version 5.0a (GraphPad Software, San Diego, USA). The type of statistical test, P value, the number of independent experiments, mean and standard error of the mean are provided for each data set in the corresponding figure legends.
Supplementary Material
Supplementary Material are available at HMG online.
Acknowledgements
We thank the Flow cytometry Resource and Advanced Cytometry Technical Applications Laboratory. This work is a partial fulfillment of V.C. PhD in Neuroscience, Università Vita-Salute San Raffaele, Milan, Italy. D.G. is a Dulbecco Telethon Institute Senior Scientist.
Conflict of Interest statement. None declared.
Funding
This work was supported by the Italian Telethon Foundation [TCR13001 to D.G.]; the ERA-Net for Research on Rare Diseases [E-Rare-2 to D.G., E-Rare-2 through Canadian Institutes of Health Research and Muscular Dystrophy Canada to F.J.D.]; the Italian Epigenomics Flagship Project [to D.G.]; the Italian Ministry of Health [to D.G.]; and the FSHD Global Research Foundation [to D.G.]. Funding bodies had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript should be declared. Funding to pay the Open Access publication charges for this article was provided by Fondazione Telethon.