Abstract

The presence of the methylated nucleobase 5Me dC in CpG islands is a key factor that determines gene silencing. False methylation patterns are responsible for deteriorated cellular development and are a hallmark of many cancers. Today genes can be sequenced for the content of 5Me dC only with the help of the bisulfite reagent, which is based exclusively on chemical reactivity differences established by the additional methyl group. Despite intensive optimization of the bisulfite protocol, the method still has specificity problems. Most importantly ∼95% of the DNA analyte is degraded during the analysis procedure. We discovered that the reagent O-allylhydroxylamine is able to discriminate between dC and 5Me dC. The reagent, in contrast to bisulfite, does not exploit reactivity differences but gives directly different reaction products. The reagent forms a stable mutagenic adduct with dC, which can exist in two states (E versus Z). In case of dC the allylhydroxylamine adduct switches into the E-isomeric form, which generates dC to dT transition mutations that can easily be detected by established methods. Significantly, the 5Me dC-adduct adopts exclusively the Z-isomeric form, which causes the polymerase to stop. O-allylhydroxylamine does allow differentiation between dC and 5Me dC with high accuracy, leading towards a novel and mild chemistry for methylation analysis.

INTRODUCTION

DNA methylation is an epigenetic mechanism for transcriptional regulation ( 1–2 ). The process controls cellular differentiation and is defective in many diseases including cancer ( 3–5 ). DNA methylation involves substitution of the H atom at C5 of the cytosine base (dC) by a methyl group to give 5-methylcytosine ( 5Me dC) ( 6 ). These methylated 5Me dC bases trigger processes that finally lead to gene silencing. Today, the detection of 5Me dC bases in a given DNA sequence is possible with restriction enzymes, which cleave selectively unmethylated DNA ( 7 ), or with antibodies, which recognize methylated DNA ( 8–10 ). The current gold standard, however, is bisulfite sequencing. This reagent selectively deaminates dC to dU but does not affect 5Me dC ( 11 , 12 ). The position of the newly formed dU base is then detected with standard sequencing methods after PCR amplification of the treated DNA material ( 13 ). Bisulfite sequencing is currently in widespread use ( 14–18 ). However, during bisulfite analysis ∼95% of the DNA material is degraded, which limits 5Me dC detection if only a small amount of sample is available ( 19–21 ). This drawback fuels current research to find alternative 5Me dC detection and sequencing methods ( 22 ). Research in this direction has furnished the result that Br + ions add selectively to 5Me dC to give cleavable DNA sites ( 23 ). OsO 4 and reagents derived thereof were found to cis- hydroxylate dC bases with some selectivity ( 24 , 25 ). Furthermore photochemical methods were reported ( 26 , 27 ) that allow some discrimination and finally even compounds were reported that are added during bisulfite sequencing to trap intermediates of the bisulfite reaction ( 28 ). All reported approaches target the reactivity differences between dC and 5Me dC imposed by the additional methyl group. However, as this difference is small, all new procedures are currently unable to distinguish between the dC and 5Me dC efficiently or their general approach interferes with current sequencing techniques.

Here we report a novel concept that does not rely on reactivity differences but in contrast allows detection of 5Me dC directly based on the reaction product. Incubation of dC and 5Me dC with hydroxylamine derivatives gives E - and Z -configured oxime-type adducts ( Z / E - 1 and Z / E - 2 , see Scheme 1 ) ( 29 , 30 ). The E - and Z -configured products are in equilibrium via the amino isomeric forms A- 1 and A- 2 . Each of these isomers ( Z , A and E ) features different H-bonding characteristics. Whereas the amino tautomers (A) base pair like dC, the E -isomers are expected to code like dT. The Z- isomers likely cause stalling of polymerases during replication ( Scheme 1 ) because they interfere with normal base pairing. NMR studies have shown that the pyrimidine oximes like 1 and 2 are present mainly in the imino tautomer and that the Z -isomers are the most stable forms ( 31 ). In the case of 5Me dC, we expected that the isomer E - 2 would be disfavored because of the steric strain imposed between the methyl group and the O -allyl chain. The 5Me dC-adduct 2 should therefore prefer Z-configuration ( Z - 2 ) or exist in the amino tautomeric form (A- 2 ). As such, we expected that this product would prefer base pairing with dG. In the case of the dC adduct 1 steric strain with the hydrogen atom at C5 is much lower and we hoped that this adduct would in contrast adopt at least to some extent the E –configuration, which would allow base pairing with dATP. The resulting dC to dT transition mutation we thought could be exploited as the desired signal that allows dC to be distinguished from 5Me dC. As our approach is based on mutagenicity, it is sequence independent and can also detect methylated cytosines that are not in a CpG context as found, for instance, in plant genomes. This is an advantage over restriction enzyme based methods that are limited by the specificity of the enzyme.

Scheme 1.

Reaction of dC and 5Me dC with hydroxylamine derivatives. Depiction of the tautomeric equilibrium and the interconversion process between the Z -isomers and the E -isomers. 1: dC- O -allylhydroxylamine—adduct; 2:5Med C– O –allylhydroxylamine—adduct.

Scheme 1.

Reaction of dC and 5Me dC with hydroxylamine derivatives. Depiction of the tautomeric equilibrium and the interconversion process between the Z -isomers and the E -isomers. 1: dC- O -allylhydroxylamine—adduct; 2:5Med C– O –allylhydroxylamine—adduct.

MATERIALS AND METHODS

Incubation of DNA strands

In a thermocycler (Eppendorf Mastercycler personal) with a heatable lid a single-stranded oligodeoxynucleotide ODN (10 µM) (Metabion, Martinsried, Germany) was incubated with NH 2 OAllyl (1 M, pH 5.2 with HNEt 2 , Fluka, Buchs, Switzerland) for 4 h at 60°C. For incubation kinetics 200 pmol of the DNA were directly injected into a HPLC without prior desalting (for detailed incubation kinetics please refer to the Supplementary Data ). If the sample was to be sequenced it was desalted using BioSpin 6 Tris columns (BioRad, Hercules, CA, USA) that were previously equilibrated three times with 500 µl ddH 2 O. The filtrate was concentrated in a SpeedVac (Christ RVC-2-33 IR), re-dissolved in NH 3 (28% in H 2 O, Fluka, Buchs, Switzerland) and heated to 60°C for 90 min in a thermocycler with a heatable lid. The solution was again desalted using BioSpin 6 Tris columns. The concentration of the solution was determined with a NanoDrop photometer (PeqLab ND-1000). The treatment with ammoniumhydroxide is optional but proved to be beneficial for the subsequent sequencing process because it converts the bis-hydroxylamineadduct ( Supplementary Data ) which is to a small extent formed into the desired compound 1 in base promoted elimination reaction.

Hybridization of DNA

An amount of 1.2 µM ODN, 1 µM primer and a suitable buffer (NEBuffer 2, NEB, Frankfurt am Main, Germany) were mixed and heated to 95°C for 4 min in a thermocycler with a heatable lid. The solution was allowed to cool at a rate of 1°C per minute and stored at 4°C in the dark.

Direct sequencing

An amount of 10 pmol dsDNA in 25 µl Annealing Buffer (Qiagen, Hilden, Germany) were sequenced in a PyroMark Q24 Pyrosequencer. All bases except the lesion generated by incubation were sequenced using standard conditions. Because the incorporation of nucleotides opposite the incubated dC or 5Me dC is slower than opposite canonical bases, addition of the nucleotide was repeated 10 times. The data was analyzed by the software provided by the manufacturer. The software sums up the peak heights and determines the ratio of dATPαS/dGTP incorporation. The last methylation site cannot be detected, because the last position in a strand generally yields no results in pyrosequencing. Please note that due to the special pyrosequencing procedure dATPαS instead of dATP was used.

Primer extension based sequencing

An amount of 10 pmol of the incubated sample were hybridized with a biotinylated primer. 0.1 U/µl Klenow(Exo-) polymerase, 500 µM dNTPs and 1 µM dsDNA in 20 µl 1× NEB Buffer 2 were incubated for 10–60 min at 60°C. To the solution were added 2 µl Streptavidin Sepharose beads (GE Healthcare, Uppsala, Sweden), 40 µl Binding Buffer (Qiagen) and 18 µl ddH 2 O. After agitation at 1400 rpm for 15 min the beads were captured with a Vacuum Prep Tool (Qiagen), washed with 70% EtOH, 0.1 M NaOH and Washing Buffer (Qiagen). The beads were dissolved in 25 µl Annealing Buffer (Qiagen) containing 10 pmol sequencing primer (Metabion). Pyrosequencing was performed on a PyroMark Q24 Pyrosequencer using standard conditions (Qiagen). The data was analyzed by the software provided by the manufacturer. The average dTTP incorporation of all experiments was calculated and the data plotted relative to this value. Negative deviations from average correspond to 5Me dC, positive values to dC. The data displayed are the average of three measurements.

Real-time primer extension based quantification

ODN1C and ODN1M were mixed at different ratios, incubated with NH 2 Oallyl and hybridized with ODN2 as described earlier. In a 96-well plate (NUNC, Roskilde, Denmark) were mixed per well: 10 pmol dsDNA (10 µl), 0.15 U ATP sulfurylase (0.5 µl), 1 U Klenow(exo–) polymerase (0.2 µl), 500 pmol APS (1 µl), 78.3 µl ENLITEN luciferase, luciferin reagent (Promega, Madison, WI, USA). In a 96-well reader (TECAN Genios Pro) at 30°C 10 µl dATPαS (20 µM) were injected at 200 µl/s. The plate was shaken for 5 s (amplitude 5 mm) and the luminescence measured every 250 ms for 400 cycles. The data was collected by the Microsoft Excel Makro provided by the manufacturer. The average of four blank measurements (enzymes without DNA) was subtracted from the average of four measurements of the sample. The data for dATPαS incorporation opposite undamaged dC was taken as baseline. In the first 15–20 s the curves are unstable due to mixing effects, so the data is only shown from second 20 onwards. The slope of the curves shows the speed of incorporation of dATPαS which corresponds to the fraction of dC in the sample.

Protein purification

The polymerase subunit of DNA polymerase I from Geobacillus stearothermophilus (DSM No. 22) was amplified via PCR and subcloned in pENTR-TEV-D-TOPO (Invitrogen, Karlsbad, USA) to introduce an N-terminal TEV-protease recognition site. The fragment was then transferred to the expression plasmid pDEST007 ( 32 ), yielding a cleavable N-terminal Strep-tag II. The protein was expressed in Escherichia coli BL21 at 37°C by induction at OD 600  = 1 with 0.2 mg/l anhydrotetracycline for 2 h. Cells were resuspended in 100 mM Tris–Cl pH 7.5, 150 mM NaCl, 10 mM ß-ME and Complete protease inhibitor (Roche), lysed by French-press and heated to 50°C for 10 min to denature E. coli proteins. 0.1% NP-40 and Tween-20 were added to the lysate prior to heat-treatment and maintained during Strep-tag purification. The cleared lysate was loaded on a Streptactin column (IBA) and eluted with desthiobiotin. Subsequently, the tag was cleaved by incubation on ice with 10 U/mg AcTEV protease (Invitrogen) over night. Further purification via Heparin affinity chromatography and crystallization was performed as described in ( 33 ).

Co-crystallization

For co-crystallization the template containing lesion 1 was annealed to the corresponding primer (for sequences see Supplementary Data ) in the protein storage buffer (10 mM Na–cacodylate pH 7, 50 mM NaCl, 0.5 mM EDTA 10 mM MgSO 4 ). Prior to crystallization protein and DNA were mixed in a 1 to 3 molar ratio. The final concentration of Pol I and dsDNA were 5 mg/ml and 0.5 mM, respectively. Crystals were grown by mixing an equal volume of protein-DNA complex with 47.5–51.0% (NH 4 ) 2 SO 4 , 3.0–3.5% MPD and 100 mM MES pH 5.8, using the hanging-drop vapor diffusion method. The crystallization plates were incubated at 18°C and crystals appeared after 1–2 days. Crystals were frozen in 24% sucrose, 55% (NH 4 ) 2 SO 4 , 3.0–3.5% MPD, 100 mM MES and stored in liquid nitrogen for data collection. Best crystals were obtained with 49.5% (NH 4 ) 2 SO 4 and 3% MPD.

Collection and processing of X-ray diffraction data, phase determination and structure refinement

Data were collected at the beamline PXIII [Swiss Light Source (SLS), Villigen, Switzerland] The data were processed with the programs XDS ( 34 ) and SCALA ( 35 , 36 ). Structure solution was carried out by molecular replacement with Phaser ( 37 ) using the coordinates of 1U45 ( 38 ). In order to reduce model bias, the temperature factors were reset to 20 for main chain and 40 for side chain and DNA atoms, respectively. Prior to model building in COOT ( 39 ) a simulated annealing omit map, removing the area around the lesion, was calculated with PHENIX ( 40 ). Restrained refinement was carried out on REFMAC5 ( 41 ). Data processing and refinement statistics are summarized in Supplementary Table S1 .

RESULTS

What are the coding properties of the DNA lesions 1 and 2?

In order to investigate whether the dC-adduct 1 and 5Me dC-adduct 2 have different coding properties we reacted the oligodeoxynucleotide ODN 1 ( Figure 1 A) containing either a dC or a 5Me dC (ODN 1C and ODN 1M ) with various hydroxylamine derivatives (10 µM DNA, 1 M NH 2 OR, pH = 5.2, 4 h, 60°C). The best results were finally obtained with O –allylhydroxylamine, which furnished lesions 1 and 2 cleanly and in high yields ( Supplementary Data ). We observed a small reactivity difference between dC and 5Me dC, with the latter reacting three to five times slower ( 42 ). Most importantly, no DNA degradation or reaction with other DNA bases was observed.

Figure 1.

( A ) Primer extension reactions of incubated (but not purified) ODN 1 with primer ODN 2 [1 U Klenow (exo–) , 5 µM dNTP, 10 min, 30°C]. ODN 1C : X = dC, ODN 1M : X =  5Me dC, ODN 1T : X = dT. (A1) With all triphosphates present all strands are fully elongated before and after treatment. (A2) With ddGTP instead of GTP present, all strands are elongated by one base. After incubation with O –allylhydroxylamine (red) only ODN 1C but not ODN 1M shows the dC→dT transition mutation and is fully elongated. (A3) Without dGTP again only ODN 1C is elongated after O –allylhydroxylamine treatment. ( B ) Primer extension reactions of analytically pure ODN 3 with Primer ODN 4 [1 U Klenow (exo–) , 50 µM dNTP, 30°C, 10 min] of ODN 3 with X = dC and X =  1 . While dC can only base pair with dGTP, 1 can induce incorporation of dATP, thus generating a C→ T transversion mutation. ( C ) Primer extension of analytically pure ODN 3 with X =  1 and X =  2 and primer ODN 4 using dATP and dGTP. [1 U Klenow (exo–) , 1.5 µM dNTP, 30°C, 10 min]. ODN 3 with X =  1 can be elongated with dATP and dGTP, while elongation of the strand containing 2 is not possible.

Figure 1.

( A ) Primer extension reactions of incubated (but not purified) ODN 1 with primer ODN 2 [1 U Klenow (exo–) , 5 µM dNTP, 10 min, 30°C]. ODN 1C : X = dC, ODN 1M : X =  5Me dC, ODN 1T : X = dT. (A1) With all triphosphates present all strands are fully elongated before and after treatment. (A2) With ddGTP instead of GTP present, all strands are elongated by one base. After incubation with O –allylhydroxylamine (red) only ODN 1C but not ODN 1M shows the dC→dT transition mutation and is fully elongated. (A3) Without dGTP again only ODN 1C is elongated after O –allylhydroxylamine treatment. ( B ) Primer extension reactions of analytically pure ODN 3 with Primer ODN 4 [1 U Klenow (exo–) , 50 µM dNTP, 30°C, 10 min] of ODN 3 with X = dC and X =  1 . While dC can only base pair with dGTP, 1 can induce incorporation of dATP, thus generating a C→ T transversion mutation. ( C ) Primer extension of analytically pure ODN 3 with X =  1 and X =  2 and primer ODN 4 using dATP and dGTP. [1 U Klenow (exo–) , 1.5 µM dNTP, 30°C, 10 min]. ODN 3 with X =  1 can be elongated with dATP and dGTP, while elongation of the strand containing 2 is not possible.

To investigate the coding properties of the reaction products 1 and 2 we performed primer extension reactions with the primer ODN 2 , Klenow(exo-) polymerase and ODN 1C or ODN 1M ( Figure 1 A1–3). For both oligonucleotides we observed full extension before and after treatment with O -allylhydroxylamine. This proves that the polymerase can indeed read through the hydroxylamine adduct 1 . When dGTP was removed from the primer extension assay ( Figure 1 A3), we observed that after incubation with O –allylhydroxylamine only unmethylated ODN 1C but not methylated ODN 1M was copied. This shows that adduct 1 is replicatively bypassed and that it is base-paired with a triphosphate other than dGTP. Further studies, depicted in Figure 1 B, showed that the polymerase is able to pair 1 with dATP. In order to investigate the relative incorporation efficiency of dATP and dGTP opposite 1 the primer extension studies were repeated with dGTP replaced by ddGTP ( Figure 1 A2). The data show that Klenow(exo-) pairs 1 with dGTP and dATP with similar efficiencies. Other polymerases tested including low fidelity polymerases such as Pol η and Pol κ showed the same behavior. In the case of ODN 1M we observed full extension only when all four triphosphates are present ( Figure 1 A1). No elongation was detected in the absence of dGTP, showing that adduct 2 either only allows incorporation of dGTP or blocks the polymerase. Further studies, shown in Figure 1 C, with ODN 3 in which we synthetically inserted the 5Me dC adduct 2 , confirmed that bypass of adduct 2 is indeed difficult and not possible with dATP.

In summary, the results show that oxime 1 instructs a polymerase to introduce dG and dA into the primer. Adduct 2 in contrast hinders replication and instructs the polymerase to incorporate a dG. Both adducts 1 and 2 consequently differ dramatically regarding their coding characteristics, and only the adduct derived from dC gives rise to dC to dT transition mutations, providing the requested readout for epigenetic sequencing.

What is the structural basis of the different coding properties?

In order to prove that the formation of a E - 1 :dA base pair is the reason for the mutational bypass we crystallized a duplex containing the base pair with the protein B.st Pol I ( 38 ). The full structure is depicted Figure 2 . It shows the E - 1 :dA base pair determined at a resolution of 2.9 Å. The O–allyl–hydroxylamine-dC adduct 1 is indeed present in the E –configuration and it base pairs with dA via two perfect Watson–Crick type H–bonds as expected.

Figure 2.

Crystal structure of BstPolI in complex with a DNA double strand containing a 1 :dATP base pair (PDB: 2xo7). 2Fo–Fc electron density for the base pair is depicted at the 1σ level.

Figure 2.

Crystal structure of BstPolI in complex with a DNA double strand containing a 1 :dATP base pair (PDB: 2xo7). 2Fo–Fc electron density for the base pair is depicted at the 1σ level.

Can the coding properties be exploited for a direct readout of the methylation status?

In order to study if the coding difference of the dC and 5Me dC oxime adducts 1 and 2 can be exploited for epigenetic sequencing, we analyzed the promoter sequence of the cyclin-dependent kinase inhibitor gene p15 , known to be aberrantly methylated in acute myeloid leukemia patients ( 43–45 ). For the study we used ODN 5 ( Figure 3 A) containing two either methylated or unmethylated CpG sites and one dC site that does not neighbor a dG. After incubation of ODN 5 with O –allylhydroxylamine followed by hybridization with a sequencing primer ODN 6 the three DNA constructs were directly subjected to pyrosequencing, which visualizes elongation of the primer strand (ODN 6 ) by a luminescence signal ( 46 ). The results for position 23 and 26 are depicted in Figure 3 B. As expected, both dC and 5Me dC in untreated oligonucleotides direct the addition of dGTP to the primer. The treated DNA strands, in contrast, showed significant incorporation of dATP opposite the dC. Opposite 5Me dC this misincorporation was in all cases not detected (<5% which is the error range of the instrument, Figure 3 B). Important is the finding that the O –allylhydroxylamine modified dC base 1 instructs incorporation of ∼50% dATPαS regardless of the position of the dC base, showing that no sequence effects occur.

Figure 3.

( A ) Fragment of the p15 promoter used for direct sequencing. ( B ) Direct sequencing of ODN5 with primer ODN6 ; Workflow: Incubation with NH 2 Oallyl converts dC to 1 , 5Me dC is to a small extent converted to 2 . 1 mispairs with dATPαS, while 5Me dC pairs with dGTP and 2 blocks. Note that position 31 cannot be sequenced because pyrosequencing does not allow sequencing of the last position.

Figure 3.

( A ) Fragment of the p15 promoter used for direct sequencing. ( B ) Direct sequencing of ODN5 with primer ODN6 ; Workflow: Incubation with NH 2 Oallyl converts dC to 1 , 5Me dC is to a small extent converted to 2 . 1 mispairs with dATPαS, while 5Me dC pairs with dGTP and 2 blocks. Note that position 31 cannot be sequenced because pyrosequencing does not allow sequencing of the last position.

Does the novel method allow sequencing of DNA fragments usually employed in pyrosequencing?

We extended the new epigenetic sequencing possibility to the analysis of a longer promoter sequence. For this experiment we choose an 84-bp fragment with three either methylated or unmethylated CpG sites (ODN 7 , CpG at positions 23, 34 and 57, for sequence see Supplementary Data ) of the p15 promoter. Such a promoter sequence is a typical target of bisulfite sequencing. For the experiment ( Figure 4 A) we first incubated the different 84–mer oligonucleotides with O –allylhydroxylamine and subsequently hybridized the treated strands with the biotinylated primer ODN 8 , (for sequence see Supplementary Data ). We next performed the primer extension reaction with the Klenow(exo-) polymerase and isolated the product DNA strand via streptavidin-coated sepharose beads. Following hybridization with the sequencing primer (ODN 9 , for sequence see Supplementary Data ) the obtained oligonucleotides were finally analyzed via pyrosequencing. The results are depicted in Figure 4 B.

Figure 4.

Primer extension based sequencing of ODN 7 in four different methylation states, ( A ) Workflow: Incubation with NH 2 Oallyl converts dC to 1 , 5Me dC is to a small extent converted to 2 . 1 mispairs with dATP, while 5Me dC pairs with dGTP and 2 blocks. After primer extension original dC is thus detected by a dT signal, while original 5Me dC still only shows a dC signal. ( B ) Results of the measurements, the deviation of dTTP incorporation from average is shown. Positive values show umethylated, negative values show methylated cytosines. For sequences of ODN 7–9 see Supplementary Data .

Figure 4.

Primer extension based sequencing of ODN 7 in four different methylation states, ( A ) Workflow: Incubation with NH 2 Oallyl converts dC to 1 , 5Me dC is to a small extent converted to 2 . 1 mispairs with dATP, while 5Me dC pairs with dGTP and 2 blocks. After primer extension original dC is thus detected by a dT signal, while original 5Me dC still only shows a dC signal. ( B ) Results of the measurements, the deviation of dTTP incorporation from average is shown. Positive values show umethylated, negative values show methylated cytosines. For sequences of ODN 7–9 see Supplementary Data .

The sequencing data clearly furnished a dT signal at all original dC positions in the promoter. The signal was in all cases at least 15–20% above background. Again, the same signal intensities are obtained at all dC positions showing again that sequence effects are negligible. The O –allylhydroxylamine based epigenetic sequencing method in the reported form is consequently only limited by the pyrosequencing method, which currently limits sequencing to <100- to 150-base long oligonucleotides. As the result provides the same final readout as bisulfite sequencing it is fully compatible with current epigenetic sequencing equipment.

Can the ratio of dC to 5Me dC be determined in a sample?

Generally, the extent of the methylation can be determined by comparing the relative heights of the dT and dC signals as done in bisulfite sequencing. Additionally, we aimed to develop an alternative approach that is based on a real time primer extension reaction. By coupling a polymerase to a luciferase, elongation of a DNA strand can be monitored by light. In the case of 5Me dC no light upon addition of dATPαS can be detected. If a mixture of dC and 5Me dC is present the amount of luminescence represents the fraction of dC in the sample. As shown in Figure 5 the extent of methylation of a specific CpG site can be readily determined after calibration.

Figure 5.

Real time primer extension based quantification of the extent of the methylation in ODN1. The slope of the curves shows the speed of incorporation of dATPαS which corresponds to the fraction of dC in the sample.

Figure 5.

Real time primer extension based quantification of the extent of the methylation in ODN1. The slope of the curves shows the speed of incorporation of dATPαS which corresponds to the fraction of dC in the sample.

Can the method distinguish between 5Me dC and 5HOMe dC?

Recently, 5-hydroxymethylcytosine ( 5HOMe dC) has been found as a second post-replicatively formed DNA base ( 47 , 48 ). Newly developed methods allow for precise quantification of 5HOMe dC levels in tissues, but do not yield sequence-specific information ( 49 ). It is highly desired to have a method that can distinguish 5HOMe dC and 5HOMe dC at a single base resolution. Unfortunately, as the detection principle presented in this work is based on sterics and the new base 5HOMe dC even imposes a larger steric strain it is not possible to distinguish 5Me dC and 5HOMe dC after incubation with O –allylhydroxylamine.

CONCLUSION

In conclusion, we show that the E / Z equilibrium of O –allylhydroxylamine dC/ 5Me dC adducts is influenced by the presence of a methyl group at C5. The lesion generated from dC after O –allylhydroxylamine treatment is bypassed by base pairing with dA and dG. In contrast, after incubation of 5Me dC only incorporation of dG can be observed. This allows for the first time detection of the methylation status of a cytosine not only based on reaction rates but based on different reaction products. We used this observation to establish the first bisulfite independent 5Me dC sequencing principle. The selectivity of the method is controlled by two chemical filters. First, dC reacts faster with NH 2 Oallyl and secondly, only the dC adduct 1 but not the 5Me dC adduct 2 can induce dC to dT transition mutations, which can be detected by standard sequencing methods. As this is the same readout as for the bisulfite method our system is fully compatible with current sequencing equipment. The next focus in our research will be to couple our novel detection principle to modern sequencing-by-synthesis methods that could allow our method to be routinely applied for the methylation analysis of small amounts of DNA. Alternatively one could imagine PCR based approaches that are employed in bisulfite sequencing or to exploit the mutagenic potential of hydroxylamines by subcloning in bacteria.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online. The crystal structure can be accessed under pdb code 2xo7.

FUNDING

SFB 646 and the Center for Integrated Protein Science, Munich (CIPSM); Verband der chemischen Industrie (VCI) for a Kekulé Fellowship. Funding for open access charge: Center for Integrated Protein Science Munich (CIPSM).

Conflict of interest statement . None declared.

ACKNOWLEDGEMENTS

The authors thank Claudia Szeibert and Andrea Kneuttinger for their help during the project, Sabine Schneider for her help solving the structure and the beamline staff at the SLS for technical support. M. Münzel likes to thank the Verband der chemischen Industrie (VCI) for a Kekulé Fellowship.

REFERENCES

1
Bird
A
Perceptions of epigenetics
Nature
 , 
2007
, vol. 
447
 (pg. 
396
-
398
)
2
Suzuki
MM
Bird
A
DNA methylation landscapes: provocative insights from epigenomics
Nat. Rev. Genet.
 , 
2008
, vol. 
9
 (pg. 
465
-
476
)
3
Egger
G
Liang
G
Aparicio
A
Jones
PA
Epigenetics in human disease and prospects for epigenetic therapy
Nature
 , 
2004
, vol. 
429
 (pg. 
457
-
463
)
4
Esteller
M
Epigenetics in cancer
N. Engl. J. Med.
 , 
2008
, vol. 
358
 (pg. 
1148
-
1159
)
5
Jones
PA
Baylin
SB
The fundamental role of epigenetic events in cancer
Nat. Rev. Genet.
 , 
2002
, vol. 
3
 (pg. 
415
-
428
)
6
Goll
MG
Bestor
TH
Eukaryotic cytosine methyltransferases
Annu. Rev. Biochem.
 , 
2005
, vol. 
74
 (pg. 
481
-
514
)
7
Bird
AP
Southern
EM
Use of restriction enzymes to study eukaryotic DNA methylation
J. Mol. Biol.
 , 
1978
, vol. 
118
 (pg. 
27
-
47
)
8
Cross
SH
Charlton
JA
Nan
X
Bird
AP
Purification of CpG islands using a methylated DNA binding column
Nat. Genet.
 , 
1994
, vol. 
6
 (pg. 
236
-
244
)
9
Irizarry
RA
Ladd-Acosta
C
Carvalho
B
Wu
H
Brandenburg
SA
Jeddeloh
JA
Wen
B
Feinberg
AP
Comprehensive high-throughput arrays for relative methylation (CHARM)
Genome Res.
 , 
2008
, vol. 
18
 (pg. 
780
-
790
)
10
Weber
M
Davies
JJ
Wittig
D
Oakeley
EJ
Haase
M
Lam
WL
Schübeler
D
Chromosome-wide and promoter-specific analyses identify sites of differential DNA methylation in normal and transformed human cells
Nat. Genet.
 , 
2005
, vol. 
37
 (pg. 
853
-
862
)
11
Hayatsu
H
Wataya
Y
Kai
K
Iida
S
Reaction of sodium bisulfite with uracil, cytosine, and their derivatives
Biochemistry
 , 
1970
, vol. 
9
 (pg. 
2858
-
2865
)
12
Shapiro
R
Servis
R
Welcher
M
Reactions of uracil and cytosine derivatives with sodium bisulfite
J. Am. Chem. Soc.
 , 
1970
, vol. 
92
 (pg. 
422
-
424
)
13
Frommer
M
McDonald
LE
Millar
DS
Collis
CM
Watt
F
Grigg
GW
Molloy
PL
Paul
CL
A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands
Proc. Natl Acad. Sci. USA
 , 
1992
, vol. 
89
 (pg. 
1827
-
1831
)
14
Ball
MP
Li
JB
Gao
Y
Lee
JH
LeProust
EM
Park
IH
Xie
B
Daley
GQ
Church
GM
Targeted and genome-scale strategies reveal gene-body methylation signatures in human cells
Nat. Biotechnol.
 , 
2009
, vol. 
27
 (pg. 
361
-
368
)
15
Cokus
SJ
Feng
S
Zhang
X
Chen
Z
Merriman
B
Haudenschild
CD
Pradhan
S
Nelson
SF
Pellegrini
M
Jacobsen
SE
Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning
Nature
 , 
2008
, vol. 
452
 (pg. 
215
-
219
)
16
Deng
J
Shoemaker
R
Xie
B
Gore
A
LeProust
EM
Antosiewicz-Bourget
J
Egli
D
Maherali
N
Park
I-H
Yu
J
, et al.  . 
Targeted bisulfite sequencing reveals changes in DNA methylation associated with nuclear reprogramming
Nat. Biotechnol.
 , 
2009
, vol. 
27
 (pg. 
353
-
360
)
17
Lister
R
Pelizzola
M
Dowen
RH
Hawkins
RD
Hon
G
Tonti-Filippini
J
Nery
JR
Lee
L
Ye
Z
Ngo
Q-M
, et al.  . 
Human DNA methylomes at base resolution show widespread epigenomic differences
Nature
 , 
2009
, vol. 
462
 (pg. 
315
-
322
)
18
Lister
R
O'Malley
RC
Tonti-Filippini
J
Gregory
BD
Berry
CC
Millar
AH
Ecker
JR
Highly integrated single-base resolution maps of the epigenome in Arabidopsis
Cell
 , 
2008
, vol. 
133
 (pg. 
523
-
536
)
19
Grunau
C
Clark
SJ
Rosenthal
A
Bisulfite genomic sequencing: systematic investigation of critical experimental parameters
Nucleic Acids Res.
 , 
2001
, vol. 
29
 pg. 
e65
 
20
Wang
RY
Gehrke
CW
Ehrlich
M
Comparison of bisulfite modification of 5-methyldeoxycytidine and deoxycytidine residues
Nucleic Acids Res.
 , 
1980
, vol. 
8
 (pg. 
4777
-
4790
)
21
Warnecke
PM
Stirzaker
C
Song
J
Grunau
C
Melki
JR
Clark
SJ
Identifcation and resolution of artifacts in bisulfite sequencing
Methods
 , 
2002
, vol. 
27
 (pg. 
101
-
107
)
22
Okamoto
A
Chemical approach toward efficient DNA methylation analysis
Org. Biomol. Chem.
 , 
2009
, vol. 
7
 (pg. 
21
-
26
)
23
Bareyt
S
Carell
T
Selective detection of 5-methylcytosine sites in DNA
Angew. Chem. Int. Ed.
 , 
2008
, vol. 
47
 (pg. 
181
-
184
)
24
Tanaka
K
Tainaka
K
Kamei
T
Okamoto
A
Direct labeling of 5-methylcytosine and its applications
J. Am. Chem. Soc.
 , 
2007
, vol. 
129
 (pg. 
5612
-
5620
)
25
Tanaka
K
Tainaka
K
Umemoto
T
Nomura
A
Okamoto
A
An osmium-DNA interstrand complex: application to facile DNA methylation analysis
J. Am. Chem. Soc.
 , 
2007
, vol. 
129
 (pg. 
14511
-
14517
)
26
Ogino
M
Taya
Y
Fujimoto
K
Highly selective detection of 5-methylcytosine using photochemical ligation
Chem. Commun.
 , 
2008
, vol. 
45
 (pg. 
5996
-
5998
)
27
Tanabe
K
Yamada
H
Nishimoto
S-i
One-electron photooxidation and site-selective strand cleavage at 5-methylcytosine in DNA by sensitization with 2-methyl-1,4-naphthoquinone-tethered oligonucleotides
J. Am. Chem. Soc.
 , 
2007
, vol. 
129
 (pg. 
8034
-
8040
)
28
Oka
Y
Peng
T
Takei
F
Nakatani
K
Synthesis and reaction of DNA oligomers containing modified cytosines related to bisulfite sequencing
Org. Lett.
 , 
2009
, vol. 
11
 (pg. 
1377
-
1379
)
29
Brown
DM
Schell
P
The reaction of hydroxylamine with cytosine and related compounds
J. Chem. Soc.
 , 
1965
, vol. 
47
 (pg. 
208
-
215
)
30
Freese
E
Bautz
E
Freese
EB
The chemical and mutagenic specificity of hydroxylamine
Proc. Natl Acad. Sci. USA
 , 
1961
, vol. 
47
 (pg. 
845
-
855
)
31
Niedzwiecka-Kornas
A
Kierdaszuk
B
Stolarski
R
Shugar
D
Tautomerism, acid-base properties and conformation of methylated analogues of the promutagenic N4-hydroxycytosine
Biophys. Chem.
 , 
1998
, vol. 
71
 (pg. 
87
-
98
)
32
Ober
M
Müller
H
Pieck
C
Gierlich
J
Carell
T
Base pairing and replicative processing of the formamidopyrimidine-dG DNA lesion
J. Am. Chem. Soc.
 , 
2005
, vol. 
127
 (pg. 
18143
-
18149
)
33
Kiefer
JR
Mao
C
Hansen
CJ
Basehore
SL
Hogrefe
HH
Braman
JC
Beese
LS
Crystal structure of a thermostable Bacillus DNA polymerase I large fragment at 2.1 A resolution
Structure
 , 
1997
, vol. 
5
 (pg. 
95
-
108
)
34
Kabsch
W
Automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constants
J. Appl. Crystallogr.
 , 
1993
, vol. 
26
 (pg. 
795
-
800
)
35
CCP4
The CCP4 suite: programs for protein crystallography
Acta Crystallogr. D Biol. Crystallogr.
 , 
1994
, vol. 
50
 (pg. 
760
-
763
)
36
Evans
P
SCALA
Joint CCP4 and ESF-EACMB Newsletter on Protein Crystallography
 , 
1997
, vol. 
33
 (pg. 
22
-
24
)
37
McCoy
AJ
Grosse-Kunstleve
RW
Storoni
LC
Read
RJ
Likelihood-enhanced fast translation functions
Acta Crystallogr. D Biol. Crystallogr.
 , 
2005
, vol. 
61
 (pg. 
458
-
464
)
38
Hsu
GW
Ober
M
Carell
T
Beese
LS
Error-prone replication of oxidatively damaged DNA by a high-fidelity DNA polymerase
Nature
 , 
2004
, vol. 
431
 (pg. 
217
-
221
)
39
Emsley
P
Cowtan
K
Coot: model-building tools for molecular graphics
Acta Crystallogr. D Biol. Crystallogr.
 , 
2004
, vol. 
60
 (pg. 
2126
-
2132
)
40
Adams
PD
Grosse-Kunstleve
RW
Hung
L-W
Ioerger
TR
McCoy
AJ
Moriarty
NW
Read
RJ
Sacchettini
JC
Sauter
NK
Terwilliger
TC
PHENIX: building new software for automated crystallographic structure determination
Acta Crystallogr. D Biol. Crystallogr.
 , 
2002
, vol. 
58
 (pg. 
1948
-
1954
)
41
Murshudov
GN
Vagin
AA
Dodson
EJ
Refinement of macromolecular structures by the maximum-likelihood method
Acta Crystallogr. D Biol. Crystallogr.
 , 
1997
, vol. 
53
 (pg. 
240
-
255
)
42
Janion
C
Shugar
D
Mutagenicity of hydroxylamine: reaction with analogues of cytosine, 5(6)-substituted cytosines and some 2-keto-4-ethoxypyrimidines
Acta Biochim. Pol.
 , 
1965
, vol. 
12
 (pg. 
337
-
355
)
43
Dodge
JE
List
AF
Futscher
BW
Selective variegated methylation of the p15 CpG island in acute myeloid leukemia
Int. J. Cancer
 , 
1998
, vol. 
78
 (pg. 
561
-
567
)
44
Herman
JG
Civin
CI
Issa
JP
Collector
MI
Sharkis
SJ
Baylin
SB
Distinct patterns of inactivation of p15INK4B and p16INK4A characterize the major types of hematological malignancies
Cancer Res.
 , 
1997
, vol. 
57
 (pg. 
837
-
841
)
45
Melki
JR
Vincent
PC
Clark
SJ
Concurrent DNA hypermethylation of multiple genes in acute myeloid leukemia
Cancer Res.
 , 
1999
, vol. 
59
 (pg. 
3730
-
3740
)
46
Ronaghi
M
Uhlen
M
Nyren
P
A sequencing method based on real-time pyrophosphate
Science
 , 
1998
, vol. 
281
 (pg. 
363
-
365
)
47
Kriaucionis
S
Heintz
N
The nuclear DNA base 5-hydroxymethylcytosine is present in Purkinje neurons and the brain
Science
 , 
2009
, vol. 
324
 (pg. 
929
-
930
)
48
Tahiliani
M
Koh
KP
Shen
Y
Pastor
WA
Bandukwala
H
Brudno
Y
Agarwal
S
Iyer
LM
Liu
DR
Aravind
L
, et al.  . 
Conversion of 5-methylcytosine to 5-hydroxymethylcytosine in mammalian DNA by MLL partner TET1
Science
 , 
2009
, vol. 
324
 (pg. 
930
-
935
)
49
Münzel
M
Globisch
D
Brückl
T
Wagner
M
Welzmiller
V
Michalakis
S
Müller
M
Biel
M
Carell
T
Quantification of the sixth DNA base hydroxymethylcytosine in the brain
Angew. Chem. Int. Ed.
 , 
2010
, vol. 
49
 (pg. 
5375
-
5377
)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

Comments

0 Comments