Abstract

Tumor classifiers based on molecular patterns promise to define and reliably classify tumor entities. The high tissue- and cell type-specificity of DNA methylation, as well as its high stability, makes DNA methylation an ideal choice for the development of tumor classifiers. Herein, we review existing tumor classifiers using DNA methylome analysis and will provide an overview on their emerging impact on cancer classification, the detection of novel cancer subentities and patient stratification with a focus on brain tumors, sarcomas and hematopoietic malignancies. Furthermore, we provide an outlook on the enormous potential of DNA methylome analysis to complement classical histopathological and genetic diagnostics, including the emerging field of epigenomic analysis in liquid biopsies.

Background and Underlying Biology

Epigenetic modifications act in concert with transcription factors to regulate and fine-tune tissue and cell type-specific gene expression patterns (1). During development and in tissue regeneration, epigenetic patterns control the differentiation process of individual cells from immature stem and progenitor cells to terminally differentiated cell types with highly specialized functions (2). While epigenetic patterns are faithfully transmitted to daughter cells during mitosis, it is well documented that environmental stimuli and the simple passing of time confer specific changes to the epigenetic landscape of each individual cell. As a result, the epigenetic landscape reflects the cell’s developmental stage, the cell type, the cell’s history of exposure to environmental stimuli and its age (3,4).

Enzymes that establish, detect and modify epigenetic marks (the so-called epigenetic writers, readers and editors) are frequent and recurrent targets for genetic mutations in cancer (5,6). As a consequence of these mutations, the epigenomic landscape of the affected cells is altered, which contributes to the process of malignant transformation, although the detailed underlying molecular mechanisms are largely unknown. Of note, cancer-specific epigenomic alterations occur in the context of normal development- and differentiation-associated epigenomic changes (7–9). This explains why the same mutation can have different effects depending on the tissue context or the differentiation stage where it is acquired.

DNA methylation is an epigenetic mark that regulates genome interpretation during development, differentiation and disease (10). DNA methyltransferases catalyze the covalent attachment of a methyl group to the 5′ carbon of a cytosine base. In eukaryotes, the resulting 5-methylcytosine (5mC) is mainly found in the context of CpG dinucleotides, although a minor fraction of non-CpG methylation has been reported in some cell types (11,12). DNA methylation patterns of tumor cells or tissues can serve as powerful biomarkers for diagnosis, disease sub-classification, patient stratification and for the prediction of response to treatment (8,13–19).

Here, we review the use of DNA methylome analysis for cancer diagnosis, disease sub-classification and patient stratification. We will further discuss how DNA methylome analysis of cell-free tumor DNA derived from blood might serve as a novel universal biomarker for early diagnosis and monitoring of treatment response in the near future.

Development of Genomic Classifier Models for Potential Diagnostic Purposes from the Statisticians’ Perspective

Most published examples using DNA methylation patterns for the diagnostic classification of malignant tumors are based on Infinium methylation arrays. While this technology allows simultaneous, robust, reproducible and rapid measurements of DNA methylation of hundreds of thousands of CpG positions across the human genome, it is not straightforward to use this information for classification purposes. In the following, we will discuss the most important aspects to consider in this context from a statistics perspective.

The minimal requirement for a classification model that is applied diagnostically to guide clinicians in decision-making is that the classifier, in addition to a predicted class, also generates a ‘well-calibrated’ probability as confidence measure for the prediction (20–23). Besides giving confidence in the prediction, such a probabilistic output allows for easy comparison of different classifier models and might also be of importance for standardization of clinical trials that apply classifiers for stratification. A probabilistic classifier score is called ‘well-calibrated’ if for cases predicted to be in a class with the probability P, the actual proportion of samples truly belonging to this class is equal to P. Like other classifier performance-metrics, the calibration of probability outputs can be assessed on independent test data. Many classifier algorithms provide some score for the classification. For example, for Random Forests (RFs), the number of trees in the ensemble voting for a class is already a score that measures the confidence in the prediction. Like a probability score, it also sums up to one when taking the sum over all classes. However, this score is likely not ‘well-calibrated’, and often the observed score distributions between different classes are not comparable (14). ‘Well-calibrated’ scores can be achieved through various model-updating/post-processing strategies as well as fitting logistic models like the famous Platt scaling (24–26) to map raw classifier scores to a more ‘well-calibrated’ probability scale (27). Measures to evaluate the performance of a probability output are for example the Brier score, the logarithmic loss function or by visualization using calibration plots (21,22).

Further typical statistical problems faced when training classification models on high throughput molecular data for cancer classification are the high-dimensionality of the data, the imbalanced class sizes due to common and rare cancer entities and the high number of classes that is expected to increase as research in personalized oncology progresses (21). When training a classification model on high-dimensional data, it is expected that this model is somehow overfitting the data and may not generalize to independent data. This is why a proper validation is needed to assess if the classifier generalizes to new, unseen data and to estimate classifier performance metrics (28). Typical validation strategies are either to simply split the data into training and test data sets or to use more complex approaches for unbiased prediction error estimation like, e.g. cross-validation or other resampling methods (28). The most common pitfall when using cross-validation and resampling methods is that not all of the model building steps are performed within in each fold of the cross-validation, which might lead to an over optimistic prediction error estimate (29).

The extent to which classification models suffer from imbalanced classes depends on the classification algorithm chosen. For example, a RF is known to perform poorly for the minority classes when trained on extremely imbalanced data (30). To deal with such imbalanced class sizes, many different methods such as random under- or oversampling have been developed; however, these techniques are often limited to binary classification tasks (31,32). It is important to keep this potential problem in mind, when training classifiers for personalized oncology where a high number of imbalanced classes are to be expected. Finally, to tackle the replication crisis, many scientific journals already demand that authors share their data as well as their analysis code on public online platforms like GitHub.com or Bitbucket.org. This is even more important when developing genomic classifier models for clinical decision-making, especially, as most machine learning algorithms train black-box models for which a direct biological interpretation of resulting predictions is often not possible. Sharing code used to train and validate such classifiers fosters the acceptance by the community and allows to easily train competing models that might achieve better prediction performance (17,10).

Tissue and Tumor Entity-Focused Approaches

In recent years, tissue and tumor entity-focused DNA methylation classifiers have been developed, and these classifiers are now starting to be incorporated into diagnostic work-up and into clinical decision processes. Here, we discuss successful examples that showcase the clinical utility of DNA methylome analysis for diagnosis, disease sub-classification and patient stratification.

Brain tumor classifier

DNA methylation profiling is emerging as the state of the art for molecular classification of central nervous system (CNS) tumors (14,21,33). The complexity and diversity of brain tissues is reflected by the approximately 80 biologically and clinically distinct tumor entities listed in the current World Health Organization (WHO) classification of CNS tumors (34).

Over the last decade, many studies showed robust and reproducible stratification of previously considered homogenous brain tumor entities into clinically relevant DNA methylation classes (Fig. 1). Among others, these entities comprise medulloblastoma (35), glioblastoma (36), astrocytoma (37,38), ependymoma (39,40), choroid plexus tumors (41), atypical teratoid/rhabdoid tumors (42), craniopharyngiomas (43) and adult-type anaplastic glioma (44). These findings ultimately lead to the development of a large reference cohort of 2801 DNA methylation array profiles covering 82 CNS tumor methylation classes mostly reflecting entities or sub-entities defined by the WHO classification and 9 control tissue classes on which a RF classifier was trained and validated (14,21,33). Statistical cross-validation estimated a high prediction performance. The classifier predictions and the accompanying prediction scores are in strong agreement between samples processed by different laboratories, between different DNA methylation array types (EPIC/450k) as well as between array profiles and DNA methylation profiles measured by whole-genome bisulfite sequencing. Applying this classifier to a prospective cohort of over 1000 samples resulted in a revised diagnosis in about 12% of cases, highlighting the improvement in diagnostic accuracy.

UMAP non-linear dimension reduction of 14 999 DNA methylation array profiles comprising 5915 CNS tumors collected by the molecularneuropathology.org website and 9084 profiles from the Cancer Genome Atlas Project (TCGA). Highlighted are representative CNS and TCGA tumor entities as well as TCGA control samples to demonstrate the specificity of DNA methylome profiles for normal tissues as well as for distinct tumor entities.
Figure 1

UMAP non-linear dimension reduction of 14 999 DNA methylation array profiles comprising 5915 CNS tumors collected by the molecularneuropathology.org website and 9084 profiles from the Cancer Genome Atlas Project (TCGA). Highlighted are representative CNS and TCGA tumor entities as well as TCGA control samples to demonstrate the specificity of DNA methylome profiles for normal tissues as well as for distinct tumor entities.

In addition to classifier predictions, the Infinium methylation array allows to detect copy number variation profiles (conumee R-package) and MGMT promotor methylation status (45). To share the RF classifier with the community, a freely available website was developed (molecularneuropathology.org). Upon upload of methylation array raw data, this platform automatically reports classifier predictions, copy number variation profiles and MGMT methylation status. Users may contribute their data for further development of the classifier, which up to now, resulted in >30 000 DNA methylation profiles shared. This continuously growing cohort is monitored by applying non-linear dimension reduction methods that identified a very rare, aggressive spinal ependymoma entity that is characterized by MYCN amplification (46–48). This example showcases the potential of website-based, community-driven data-sharing to improve cancer diagnostics.

Recent progress in DNA methylation-based brain tumor classification leads to a further refinement of already known DNA methylation classes for ependymomas (46,49,50), medulloblastoma (51), diffuse leptomeningeal glioneuronal tumors (52), anaplastic astrocytoma (53) as well as meningioma (15) and neuroblastoma (54). These and other DNA methylation subclasses will be included in the upcoming new version of the brain tumor classifier (v12). As the number of these subclasses is constantly increasing, a DNA methylation-based molecular taxonomy of CNS tumors with several hierarchical levels combining subclasses to classes and family classes will be established. Such a molecular taxonomy that is aligned as much as possible to the WHO classification of CNS tumors is needed to translate RF predictions into clinical practice where the most detailed information about molecular subclasses might not yet have practical implications.

Sarcoma classifier

Sarcomas are a rare, clinically and morphologically heterogeneous group of malignant soft tissue and bone tumors that occur across all age groups (55). The diagnosis of sarcomas is often challenging, which is reflected by a high inter-observer variability among pathologists as well as between primary institutions and specialized referral centers (56,57). The detection of gene fusions resulting from genomic translocation events might help here; however, for almost 50% of sarcomas, well-defined molecular markers are lacking (55,58). DNA methylome profiling is emerging as a promising method to overcome these diagnostic limitations. For sarcomas with small blue round cell histology, DNA methylation profiling has been successfully applied to classify these diagnostically challenging tumors (59). Comparable studies showed the diagnostic potential of DNA methylation profiling for the classification of high-grade soft tissue sarcomas (60), rhabdomyosarcomas (61), nerve sheath tumors (62), bone sarcomas (63), spindle cell sarcomas (64) and angiosarcomas (65). Applying the blueprint of a machine learning workflow that was originally developed for the classification of CNS tumors (14,21), a comparable model for the classification of 47 soft tissue and bone sarcoma entities has been established (Koelsche et al., in revision). Like the so-called ‘Heidelberg brain tumor classifier’, this sarcoma classification model is publicly available through a web platform (molecularsarcomapathology.org), and its results can be used to guide differential diagnosis of sarcomas (66). This website also offers users to contribute their data to develop more advanced sarcoma classifiers in the future.

Lymphoid malignancies

Chronic lymphocytic leukemia (CLL) is a B cell neoplasm that is characterized by a highly heterogeneous clinical course ranging from patients who have stable disease over many years to patients with rapidly progressive disease (67). Epigenetic alterations in CLL are known for more than 20 years (68–78). Several recent studies investigated DNA methylation across selected B cell developmental stages obtained from healthy donors and detected massive DNA methylation programming that affects up to 30% of the DNA methylome (7,8,79,80). Subsequently, it was shown that the malignant B cells in CLL patients are derived from a continuum of possible differentiation stages through which healthy B cells transition during normal B cell differentiation (8,9). Furthermore, it was demonstrated that the degree of maturation achieved in the cell-of-origin of CLL closely associates with an increasingly favorable clinical outcome (8,80). Of interest, these studies uncovered that nearly all DNA methylation differences previously reported between tumor/normal or between defined disease subtypes are naturally present in healthy, non-malignant B cells (8,9,79,80). Together, these studies provided proof-of-principle that the epigenomes of CLL patients consist of two different components: (1) the epigenetic imprint of the cell-of-origin of CLL, and (2) the disease-specific epigenetic aberrations acquired during disease pathogenesis and disease progression (9). This information provides useful to identify tumor-specific changes in cell biology that could be used as targets to develop novel therapeutic approaches for CLL (9). Similar principles have been shown to apply to other B cell neoplasms like, e.g. B cell acute lymphoblastic leukemia, diffuse-large B cell lymphoma, mantle cell lymphoma and multiple myeloma (7,79,81). Very recently, a pre-print described the combined use of information of the epigenetic imprint of the cell-of-origin and of disease-specific epigenetic aberrations to train a DNA methylation classifier that identifies 14 different B cell neoplasms and clinically important disease subtypes (82). If this classifier proves to perform confidently also for independent samples, it may provide additional, independent information for the differential diagnosis of B cell neoplasm, especially in unclear diagnostic situations.

Myeloid malignancies

Myeloid malignancies have been studied in great detail at the genomic, transcriptomic and epigenomic level during the last two decades (16,83–88). While results from genomic studies have transformed the classification of myeloid malignancies, epigenomic studies have yet to demonstrate their added value as clinically relevant biomarkers for the classification of myeloid malignancies (89,90). One successful example has been published by Meldi and colleagues, who examined which patients suffering from chronic myelomonocytic leukemia are likely to respond to therapy with the demethylating agent decitabine. Using samples taken at the time of diagnosis, they developed an epigenetic classifier that confidently predicts response to decitabine treatment (91). In addition, recent studies using DNA methylome profiling identified prognostically distinct epigenetic subgroups in patients with myelodysplastic syndromes or chronic myelomonocytic leukemia that associate with clinical, biological and genetic features (92,93). To the best of our knowledge, none of these biomarkers has been implemented into a routine clinical workflow yet.

For juvenile myelomonocytic leukemia (JMML), epigenetic biomarkers have just recently started to be used in clinical practice. Research groups from Europe, the United States and Japan independently reported DNA methylation patterns as a predictive biomarker for outcome in JMML (13,17,19). Subsequently, a machine learning classifier (XGBoost) for the prediction of JMML subclasses was trained based on a consensus definition of DNA methylation subgroups (Schönung et al., under review). This classifier has been validated in an independent patient cohort and demonstrated that DNA methylation subgroups are the single most predictive biomarker for outcome in JMML. This classifier is now used by clinicians in combination with other parameters to guide risk-adapted clinical management of JMML patients.

Together, these examples from different tumor entities show that the era of DNA methylation biomarkers is just about to start, but they also highlight the potential of DNA methylome analysis for cancer diagnosis, identification of disease subclasses and as prognostic/predictive biomarkers.

DNA Methylation Analysis for Early Diagnosis and Disease Monitoring using Liquid Biopsies

In the era of personalized medicine, the identification of tumor-specific genetic and epigenetic aberrations plays an increasingly important role. The detection of potential molecular targets using tumor DNA and RNA is an integral part of the development of targeted and individualized therapeutic concepts. It is not uncommon, however, that extraction of tumor material is impossible due to the localization or the health condition of the patient. In addition, it is often not possible to repeatedly obtain tumor samples by invasive procedures solely for the purpose of disease monitoring while the patient is on therapy or during follow-up after therapy. Analysis of ctDNA from blood plasma and other body fluids (so-called liquid biopsy) has gained considerable attention in recent years and it is the subject of intensive research in almost all tumor entities. Liquid Biopsy offers several advantages over conventional biopsy-based methods: (1) it can be performed as part of a routine blood collection; (2) it can be performed at any time and can therefore be used to assess the course of the disease; (3) the ctDNA maps all tumor manifestations in the body and thus allows the characterization of tumor heterogeneity.

In addition to the sensitive detection of tumor mutations, identification of epigenetic patterns stemming from tumor cells has emerged as one of the key-areas of research in this field. Epigenetic biomarkers like tumor-specific DNA methylation patterns are attractive non-invasive, diagnostic targets in clinical oncology due to their stability in liquid biopsies (94). This development is in part fueled by technological advances that have enabled DNA methylome analysis from problematic samples as ctDNA typically is lowly abundant and highly fragmented (95–101). In addition, the tumor-derived DNA methylation patterns of ctDNA are often contaminated to a variable degree by cell-free DNA originating from non-neoplastic cells like, e.g. blood cells or endothelial cells (102,103). Knowledge on the DNA methylation patterns present in the different tissues from which the cell-free DNA originates can support the deconvolution of the epigenetic patterns detected in the cell-free DNA (102,103). Advances in bioinformatics pipelines, the broad availability of reference epigenomes from different tissues as well as the increasing amount of DNA methylomes available from cancer patients spanning also rare cancer entities have improved the identification and detection of tumor-derived epigenetic patterns, and we predict that this will be a dynamically evolving field in the next few years. Targeted, locus-specific epigenetic assays for early detection of cancer or for disease monitoring are already in clinical use (reviewed by Locke (104)). These assays have the advantage that they are easy to standardize and that they can be quickly adopted by routine laboratories. On the downside, most of these tests suffer from relatively poor specificity for a given tumor entity, which makes them unattractive as screening tools (104). Until now, there have been only few published examples of genome-wide DNA methylome analysis in the context of liquid biopsies for the diagnosis or monitoring of malignant tumors. Successful examples include proof-of-principle studies using whole genome bisulfite sequencing of ctDNA-enriched samples (105), immunoprecipitation-based approaches (103) or 450k array analysis (106). The immunoprecipitation-based approach has recently been applied to the detection of renal cancer in blood and urine samples (107). In addition, a combination of targeted capture BS-seq and low-coverage whole-genome bisulfite sequencing was used to identify prostate cancer-specific DNA methylation patterns in blood plasma and allowed the prediction of response to anti-androgen therapy (108).

Outlook

Genome-wide DNA methylation profiling has proven to be a powerful tool that already impacts on diagnostic work-up and patient stratification. This is foremost true in the brain tumor field, but other tumor classes and entities like, e.g. sarcomas and hematopoietic malignancies are about to catch up. It is tempting to speculate that in the near future, sensitive and accurate DNA methylation classifiers will be developed that support the diagnostic work-up and patient stratification at the pan-cancer scale. Such classifiers might one day facilitate pan-cancer screening from liquid biopsies. The development of sensitive and scalable epigenomic analyses from liquid biopsies will hugely benefit from the dynamically evolving field of single-cell epigenomics in the near future.

Acknowledgements

We thank the German Glioma Network and the Neuroonkologische Arbeitsgemeinschaft for sharing their data. In addition, we thank David Capper, David Jones, Volker Hovestadt and Andreas von Deimling for their fundamental work on the original brain tumor classifier.

Conflict of Interest statement. A patent for a ‘DNA-methylation-based method for classifying tumor species of the brain’ has been applied for by the Deutsches Krebsforschungszentrum Stiftung des öffentlichen Rechts and Ruprecht-Karls-Universität Heidelberg (EP 3067432 A1) with S.M.P. and M.S. as inventors. C.P. and D.B.L. have no potential conflicts of interest to report.

Funding

German Cancer Aid, (DKH 70113869 to C.P., DKH 70112574 to D.B.L.); the José Carreras Leukämie Stiftung (DJCLS R 15/01 to C.P. and D.B.L.); the Helmholtz Foundation (to C.P.); the DKFZ-Heidelberg Center for Personalized Oncology (DKFZ-HIPO_036); the German Childhood Cancer Foundation (DKS 2015.01); an Illumina Medical Research Grant, the DKTK joint funding project ‘Next Generation Molecular Diagnostics of Malignant Gliomas’.

References

1.

Long
,
M.D.
,
Smiraglia
,
D.J.
and
Campbell
,
M.J.
(
2017
)
The genomic impact of DNA CpG methylation on gene expression; relationships in prostate cancer
.
Biomol. Ther.
,
7
,
15
.

2.

Lister
,
R.
and
Mukamel
,
E.A.
(
2015
)
Turning over DNA methylation in the mind
.
Front. Neurosci.
,
9
,
252
.

3.

Jones
,
M.J.
,
Goodman
,
S.J.
and
Kobor
,
M.S.
(
2015
)
DNA methylation and healthy human aging
.
Aging Cell
,
14
,
924
932
.

4.

Niculescu
,
M.D.
and
Lupu
,
D.S.
(
2011
)
Nutritional influence on epigenetics and effects on longevity
.
Curr. Opin. Clin. Nutr. Metab. Care
,
14
,
35
40
.

5.

Plass
,
C.
,
Pfister
,
S.M.
,
Lindroth
,
A.M.
,
Bogatyrova
,
O.
,
Claus
,
R.
and
Lichter
,
P.
(
2013
)
Mutations in regulators of the epigenome and their connections to global chromatin patterns in cancer
.
Nat. Rev. Genet.
,
14
,
765
780
.

6.

Campbell
,
P.J.
,
Getz
,
G.
,
Korbel
,
J.O.
,
Stuart
,
J.M.
,
Jennings
,
J.L.
,
Stein
,
L.D.
,
Perry
,
M.D.
,
Nahal-Bose
,
H.K.
,
Ouellette
,
B.F.F.
,
Li
,
C.H.
 et al. (
2020
)
Pan-cancer analysis of whole genomes
.
Nature
,
578
,
82
93
.

7.

Kulis
,
M.
,
Merkel
,
A.
,
Heath
,
S.
,
Queiros
,
A.C.
,
Schuyler
,
R.P.
,
Castellano
,
G.
,
Beekman
,
R.
,
Raineri
,
E.
,
Esteve
,
A.
,
Clot
,
G.
 et al. (
2015
)
Whole-genome fingerprint of the DNA methylome during human B cell differentiation
.
Nat. Genet.
,
47
,
746
756
.

8.

Oakes
,
C.C.
,
Seifert
,
M.
,
Assenovl
,
Y.
,
Gu
,
L.
,
Przekopowitz
,
M.
,
Ruppert
,
A.S.
,
Wang
,
Q.
,
Imbusch
,
C.D.
,
Serva
,
A.
,
Koser
,
S.D.
 et al. (
2016
)
DNA methylation dynamics during B cell maturation underlie a continuum of disease phenotypes in chronic lymphocytic leukemia
.
Nat. Genet.
,
48
,
253
264
.

9.

Wierzbinska
,
J.A.
,
Toth
,
R.
,
Ishaque
,
N.
,
Rippe
,
K.
,
Mallm
,
J.-P.
,
Klett
,
L.C.
,
Mertens
,
D.
,
Zenz
,
T.
,
Hielscher
,
T.
,
Seifert
,
M.
 et al. (
2020
)
Methylome-based cell-of-origin modeling (methyl-COOM) identifies aberrant expression of immune regulatory molecules in CLL
.
Genome Med.
,
12
,
29
.

10.

Bird
,
A.
(
2002
)
DNA methylation patterns and epigenetic memory
.
Genes Dev.
,
16
,
6
21
.

11.

Jang
,
H.S.
,
Shin
,
W.J.
,
Lee
,
J.E.
and
Do
,
J.T.
(
2017
)
CpG and non-CpG methylation in epigenetic gene regulation and brain function
.
Genes (Basel)
,
8
,
148
.

12.

Ramsahoye
,
B.H.
,
Biniszkiewicz
,
D.
,
Lyko
,
F.
,
Clark
,
V.
,
Bird
,
A.P.
and
Jaenisch
,
R.
(
2000
)
Non-CpG methylation is prevalent in embryonic stem cells and may be mediated by DNA methyltransferase 3a
.
Proc. Natl. Acad. Sci. USA
,
97
,
5237
5242
.

13.

Lipka
,
D.B.
,
Witte
,
T.
,
Toth
,
R.
,
Yang
,
J.
,
Wiesenfarth
,
M.
,
Nollke
,
P.
,
Fischer
,
A.
,
Brocks
,
D.
,
Gu
,
Z.
,
Park
,
J.
 et al. (
2017
)
RAS-pathway mutation patterns define epigenetic subclasses in juvenile myelomonocytic leukemia
.
Nat. Commun.
,
8
,
2126
.

14.

Capper
,
D.
,
Jones
,
D.T.W.
,
Sill
,
M.
,
Hovestadt
,
V.
,
Schrimpf
,
D.
,
Sturm
,
D.
,
Koelsche
,
C.
,
Sahm
,
F.
,
Chavez
,
L.
,
Reuss
,
D.E.
 et al. (
2018
)
DNA methylation-based classification of central nervous system tumours
.
Nature
,
555
,
469
474
.

15.

Sahm
,
F.
,
Schrimpf
,
D.
,
Stichel
,
D.
,
Jones
,
D.T.W.
,
Hielscher
,
T.
,
Schefzyk
,
S.
,
Okonechnikov
,
K.
,
Koelsche
,
C.
,
Reuss
,
D.E.
,
Capper
,
D.
 et al. (
2017
)
DNA methylation-based classification and grading system for meningioma: a multicentre, retrospective analysis
.
Lancet Oncol.
,
18
,
682
694
.

16.

Figueroa
,
M.E.
,
Lugthart
,
S.
,
Li
,
Y.
,
Erpelinck-Verschueren
,
C.
,
Deng
,
X.
,
Christos
,
P.J.
,
Schifano
,
E.
,
Booth
,
J.
,
van Putten
,
W.
,
Skrabanek
,
L.
 et al. (
2010
)
DNA methylation signatures identify biologically distinct subtypes in acute myeloid leukemia
.
Cancer Cell
,
17
,
13
27
.

17.

Stieglitz
,
E.
,
Mazor
,
T.
,
Olshen
,
A.B.
,
Geng
,
H.
,
Gelston
,
L.C.
,
Akutagawa
,
J.
,
Lipka
,
D.B.
,
Plass
,
C.
,
Flotho
,
C.
,
Chehab
,
F.F.
 et al. (
2017
)
Genome-wide DNA methylation is predictive of outcome in juvenile myelomonocytic leukemia
.
Nat. Commun.
,
8
,
2127
.

18.

Goeppert
,
B.
,
Toth
,
R.
,
Singer
,
S.
,
Albrecht
,
T.
,
Lipka
,
D.B.
,
Lutsik
,
P.
,
Brocks
,
D.
,
Baehr
,
M.
,
Muecke
,
O.
,
Assenov
,
Y.
 et al. (
2019
)
Integrative analysis defines distinct prognostic subgroups of intrahepatic cholangiocarcinoma
.
Hepatology
,
69
,
2091
2106
.

19.

Murakami
,
N.
,
Okuno
,
Y.
,
Yoshida
,
K.
,
Shiraishi
,
Y.
,
Nagae
,
G.
,
Suzuki
,
K.
,
Narita
,
A.
,
Sakaguchi
,
H.
,
Kawashima
,
N.
,
Wang
,
X.
 et al. (
2018
)
Integrated molecular profiling of juvenile myelomonocytic leukemia
.
Blood
,
131
,
1576
1586
.

20.

Kim
,
K.I.
and
Simon
,
R.
(
2014
)
Overfitting, generalization, and MSE in class probability estimation with high-dimensional data
.
Biom. J.
,
56
,
256
269
.

21.

Maros
,
M.E.
,
Capper
,
D.
,
Jones
,
D.T.W.
,
Hovestadt
,
V.
,
von Deimling
,
A.
,
Pfister
,
S.M.
,
Benner
,
A.
,
Zucknick
,
M.
and
Sill
,
M.
(
2020
)
Machine learning workflows to estimate class probabilities for precision cancer diagnostics on DNA methylation microarray data
.
Nat. Protoc.
,
15
,
479
512
.

22.

Schwarz
,
J.
and
Heider
,
D.
(
2019
)
GUESS: projecting machine learning scores to well-calibrated probability estimates for clinical decision-making
.
Bioinformatics
,
35
,
2458
2465
.

23.

Simon
,
R.
(
2014
)
Class probability estimation for medical studies
.
Biom. J.
,
56
,
597
600
.

24.

Platt
,
J.C.
(
1999
). ADVANCES IN LARGE MARGIN CLASSIFIERS: Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods, MIT Press,
61
74
.

25.

Boström
,
H.
(
2008
), In
2008 Seventh International Conference on Machine Learning and Applications
: Calibrating Random Forests,
121
126
.

26.

Wu
,
T.-F.
,
Lin
,
C.-J.
and
Weng
,
R.C.
(
2004
)
Probability estimates for multi-class classification by pairwise coupling
.
J. Mach. Learn. Res.
,
5
,
975
1005
.

27.

Appel
,
I.J.
,
Gronwald
,
W.
and
Spang
,
R.
(
2011
)
Estimating classification probabilities in high-dimensional diagnostic studies
.
Bioinformatics
,
27
,
2563
2570
.

28.

Simon
,
R.
(
2005
)
Roadmap for developing and validating therapeutically relevant genomic classifiers
.
J. Clin. Oncol.
,
23
,
7332
7341
.

29.

Varma
,
S.
and
Simon
,
R.
(
2006
)
Bias in error estimation when using cross-validation for model selection
.
BMC Bioinformatics
,
7
,
91
.

30.

Chen
,
C.L.
and
Breiman
,
L.
(
2004
)
Using Random Forest to Learn Imbalanced Data
. https://statistics.berkeley.edu/sites/default/files/tech-reports/666.pdf.

31.

Chawla
,
N.V.B.
,
Bowyer
,
K.W.
,
Hall
,
L.O.
and
Kegelmeyer
,
W.P.
(
2002
)
SMOTE: synthetic minority over-sampling technique
.
J. Artif. Intell. Res.
,
16
.

32.

Lunardon
,
N.
,
Menardi
,
G.
and
Torelli
,
N.
(
2014
)
ROSE: a package for binary imbalanced learning
.
R Journal
,
6
,
79
89
.

33.

Capper
,
D.
,
Stichel
,
D.
,
Sahm
,
F.
,
Jones
,
D.T.W.
,
Schrimpf
,
D.
,
Sill
,
M.
,
Schmid
,
S.
,
Hovestadt
,
V.
,
Reuss
,
D.E.
,
Koelsche
,
C.
 et al. (
2018
)
Practical implementation of DNA methylation and copy-number-based CNS tumor diagnostics: the Heidelberg experience
.
Acta Neuropathol.
,
136
,
181
210
.

34.

Louis
,
D.N.
,
Perry
,
A.
,
Reifenberger
,
G.
,
von Deimling
,
A.
,
Figarella-Branger
,
D.
,
Cavenee
,
W.K.
,
Ohgaki
,
H.
,
Wiestler
,
O.D.
,
Kleihues
,
P.
and
Ellison
,
D.W.
(
2016
)
The 2016 World Health Organization classification of tumors of the central nervous system: a summary
.
Acta Neuropathol.
,
131
,
803
820
.

35.

Hovestadt
,
V.
,
Jones
,
D.T.
,
Picelli
,
S.
,
Wang
,
W.
,
Kool
,
M.
,
Northcott
,
P.A.
,
Sultan
,
M.
,
Stachurski
,
K.
,
Ryzhova
,
M.
,
Warnatz
,
H.J.
 et al. (
2014
)
Decoding the regulatory landscape of medulloblastoma using DNA methylation sequencing
.
Nature
,
510
,
537
541
.

36.

Sturm
,
D.
,
Witt
,
H.
,
Hovestadt
,
V.
,
Khuong-Quang
,
D.A.
,
Jones
,
D.T.
,
Konermann
,
C.
,
Pfaff
,
E.
,
Tönjes
,
M.
,
Sill
,
M.
,
Bender
,
S.
 et al. (
2012
)
Hotspot mutations in H3F3A and IDH1 define distinct epigenetic and biological subgroups of glioblastoma
.
Cancer Cell
,
22
,
425
437
.

37.

Lambert
,
S.R.
,
Witt
,
H.
,
Hovestadt
,
V.
,
Zucknick
,
M.
,
Kool
,
M.
,
Pearson
,
D.M.
,
Korshunov
,
A.
,
Ryzhova
,
M.
,
Ichimura
,
K.
,
Jabado
,
N.
 et al. (
2013
)
Differential expression and methylation of brain developmental genes define location-specific subsets of pilocytic astrocytoma
.
Acta Neuropathol.
,
126
,
291
301
.

38.

Reuss
,
D.E.
,
Kratz
,
A.
,
Sahm
,
F.
,
Capper
,
D.
,
Schrimpf
,
D.
,
Koelsche
,
C.
,
Hovestadt
,
V.
,
Bewerunge-Hudler
,
M.
,
Jones
,
D.T.
,
Schittenhelm
,
J.
 et al. (
2015
)
Adult IDH wild type astrocytomas biologically and clinically resolve into other tumor entities
.
Acta Neuropathol.
,
130
,
407
417
.

39.

Mack
,
S.C.
,
Witt
,
H.
,
Piro
,
R.M.
,
Gu
,
L.
,
Zuyderduyn
,
S.
,
Stütz
,
A.M.
,
Wang
,
X.
,
Gallo
,
M.
,
Garzia
,
L.
,
Zayne
,
K.
 et al. (
2014
)
Epigenomic alterations define lethal CIMP-positive ependymomas of infancy
.
Nature
,
506
,
445
450
.

40.

Pajtler
,
K.W.
,
Witt
,
H.
,
Sill
,
M.
,
Jones
,
D.T.
,
Hovestadt
,
V.
,
Kratochwil
,
F.
,
Wani
,
K.
,
Tatevossian
,
R.
,
Punchihewa
,
C.
,
Johann
,
P.
 et al. (
2015
)
Molecular classification of ependymal tumors across all CNS compartments, Histopathological grades, and age groups
.
Cancer Cell
,
27
,
728
743
.

41.

Thomas
,
C.
,
Sill
,
M.
,
Ruland
,
V.
,
Witten
,
A.
,
Hartung
,
S.
,
Kordes
,
U.
,
Jeibmann
,
A.
,
Beschorner
,
R.
,
Keyvani
,
K.
,
Bergmann
,
M.
 et al. (
2016
)
Methylation profiling of choroid plexus tumors reveals 3 clinically distinct subgroups
.
Neuro-Oncology
,
18
,
790
796
.

42.

Johann
,
P.D.
,
Erkek
,
S.
,
Zapatka
,
M.
,
Kerl
,
K.
,
Buchhalter
,
I.
,
Hovestadt
,
V.
,
Jones
,
D.T.W.
,
Sturm
,
D.
,
Hermann
,
C.
,
Segura Wang
,
M.
 et al. (
2016
)
Atypical Teratoid/Rhabdoid tumors are comprised of three epigenetic subgroups with distinct enhancer landscapes
.
Cancer Cell
,
29
,
379
393
.

43.

Hölsken
,
A.
,
Sill
,
M.
,
Merkle
,
J.
,
Schweizer
,
L.
,
Buchfelder
,
M.
,
Flitsch
,
J.
,
Fahlbusch
,
R.
,
Metzler
,
M.
,
Kool
,
M.
,
Pfister
,
S.M.
 et al. (
2016
)
Adamantinomatous and papillary craniopharyngiomas are characterized by distinct epigenomic as well as mutational and transcriptomic profiles
.
Acta Neuropathol. Commun.
,
4
,
20
.

44.

Wiestler
,
B.
,
Capper
,
D.
,
Sill
,
M.
,
Jones
,
D.T.
,
Hovestadt
,
V.
,
Sturm
,
D.
,
Koelsche
,
C.
,
Bertoni
,
A.
,
Schweizer
,
L.
,
Korshunov
,
A.
 et al. (
2014
)
Integrated DNA methylation and copy-number profiling identify three clinically and biologically relevant groups of anaplastic glioma
.
Acta Neuropathol.
,
128
,
561
571
.

45.

Bady
,
P.
,
Sciuscio
,
D.
,
Diserens
,
A.C.
,
Bloch
,
J.
,
van den Bent
,
M.J.
,
Marosi
,
C.
,
Dietrich
,
P.Y.
,
Weller
,
M.
,
Mariani
,
L.
,
Heppner
,
F.L.
 et al. (
2012
)
MGMT methylation analysis of glioblastoma on the Infinium methylation BeadChip identifies two distinct CpG regions associated with gene silencing and outcome, yielding a prediction model for comparisons across datasets, tumor grades, and CIMP-status
.
Acta Neuropathol.
,
124
,
547
560
.

46.

Ghasemi
,
D.R.
,
Sill
,
M.
,
Okonechnikov
,
K.
,
Korshunov
,
A.
,
Yip
,
S.
,
Schutz
,
P.W.
,
Scheie
,
D.
,
Kruse
,
A.
,
Harter
,
P.N.
,
Kastelan
,
M.
 et al. (
2019
)
MYCN amplification drives an aggressive form of spinal ependymoma
.
Acta Neuropathol.
,
138
,
1075
1089
.

47.

van der
 
Maaten
,
L.
and
Hinton
,
G.
(
2008
)
Visualizing Data Using t-SNE, Journal of Machine Learning Research
,
9
,
2579
2605
.

48.

McInnes
,
L.
,
Healy
,
J.
and
Melville
,
J.
(
2018
)
UMAP: uniform manifold approximation and projection for dimension reduction
, arXiv:1802.03426
.

49.

Pajtler
,
K.W.
,
Wen
,
J.
,
Sill
,
M.
,
Lin
,
T.
,
Orisme
,
W.
,
Tang
,
B.
,
Hübner
,
J.M.
,
Ramaswamy
,
V.
,
Jia
,
S.
,
Dalton
,
J.D.
 et al. (
2018
)
Molecular heterogeneity and CXorf67 alterations in posterior fossa group A (PFA) ependymomas
.
Acta Neuropathol.
,
136
,
211
226
.

50.

Cavalli
,
F.M.G.
,
Hübner
,
J.M.
,
Sharma
,
T.
,
Luu
,
B.
,
Sill
,
M.
,
Zapotocky
,
M.
,
Mack
,
S.C.
,
Witt
,
H.
,
Lin
,
T.
,
Shih
,
D.J.H.
 et al. (
2018
)
Heterogeneity within the PF-EPN-B ependymoma subgroup
.
Acta Neuropathol.
,
136
,
227
237
.

51.

Sharma
,
T.
,
Schwalbe
,
E.C.
,
Williamson
,
D.
,
Sill
,
M.
,
Hovestadt
,
V.
,
Mynarek
,
M.
,
Rutkowski
,
S.
,
Robinson
,
G.W.
,
Gajjar
,
A.
,
Cavalli
,
F.
 et al. (
2019
)
Second-generation molecular subgrouping of medulloblastoma: an international meta-analysis of group 3 and group 4 subtypes
.
Acta Neuropathol.
,
138
,
309
326
.

52.

Deng
,
M.Y.
,
Sill
,
M.
,
Chiang
,
J.
,
Schittenhelm
,
J.
,
Ebinger
,
M.
,
Schuhmann
,
M.U.
,
Monoranu
,
C.M.
,
Milde
,
T.
,
Wittmann
,
A.
,
Hartmann
,
C.
 et al. (
2018
)
Molecularly defined diffuse leptomeningeal glioneuronal tumor (DLGNT) comprises two subgroups with distinct clinical and genetic features
.
Acta Neuropathol.
,
136
,
239
253
.

53.

Reinhardt
,
A.
,
Stichel
,
D.
,
Schrimpf
,
D.
,
Sahm
,
F.
,
Korshunov
,
A.
,
Reuss
,
D.E.
,
Koelsche
,
C.
,
Huang
,
K.
,
Wefers
,
A.K.
,
Hovestadt
,
V.
 et al. (
2018
)
Anaplastic astrocytoma with piloid features, a novel molecular class of IDH wildtype glioma with recurrent MAPK pathway, CDKN2A/B and ATRX alterations
.
Acta Neuropathol.
,
136
,
273
291
.

54.

Henrich
,
K.O.
,
Bender
,
S.
,
Saadati
,
M.
,
Dreidax
,
D.
,
Gartlgruber
,
M.
,
Shao
,
C.
,
Herrmann
,
C.
,
Wiesenfarth
,
M.
,
Parzonka
,
M.
,
Wehrmann
,
L.
 et al. (
2016
)
Integrative Genome-scale analysis identifies epigenetic mechanisms of transcriptional deregulation in unfavorable neuroblastomas
.
Cancer Res.
,
76
,
5523
5537
.

55.

Hemmings
,
C.
(
2014
)
Morphology, molecular genetics and multidisciplinary management: soft tissue pathology in 2014 and beyond
.
Pathology
,
46
,
93
94
.

56.

Ray-Coquard
,
I.
,
Montesco
,
M.C.
,
Coindre
,
J.M.
,
Dei Tos
,
A.P.
,
Lurkin
,
A.
,
Ranchère-Vince
,
D.
,
Vecchiato
,
A.
,
Decouvelaere
,
A.V.
,
Mathoulin-Pélissier
,
S.
,
Albert
,
S.
 et al. (
2012
)
Sarcoma: concordance between initial diagnosis and centralized expert review in a population-based study within three European regions
.
Ann. Oncol.
,
23
,
2442
2449
.

57.

Gatta
,
G.
,
van der Zwan
,
J.M.
,
Casali
,
P.G.
,
Siesling
,
S.
,
Dei Tos
,
A.P.
,
Kunkler
,
I.
,
Otter
,
R.
,
Licitra
,
L.
,
Mallone
,
S.
,
Tavilla
,
A.
 et al. (
2011
)
Rare cancers are not so rare: the rare cancer burden in Europe
.
Eur. J. Cancer
,
47
,
2493
2511
.

58.

Italiano
,
A.
,
Di Mauro
,
I.
,
Rapp
,
J.
,
Pierron
,
G.
,
Auger
,
N.
,
Alberti
,
L.
,
Chibon
,
F.
,
Escande
,
F.
,
Voegeli
,
A.C.
,
Ghnassia
,
J.P.
 et al. (
2016
)
Clinical effect of molecular methods in sarcoma diagnosis (GENSARC): a prospective, multicentre, observational study
.
Lancet Oncol.
,
17
,
532
538
.

59.

Koelsche
,
C.
,
Hartmann
,
W.
,
Schrimpf
,
D.
,
Stichel
,
D.
,
Jabar
,
S.
,
Ranft
,
A.
,
Reuss
,
D.E.
,
Sahm
,
F.
,
Jones
,
D.T.W.
,
Bewerunge-Hudler
,
M.
 et al. (
2018
)
Array-based DNA-methylation profiling in sarcomas with small blue round cell histology provides valuable diagnostic information
.
Mod. Pathol.
,
31
,
1246
1256
.

60.

Renner
,
M.
,
Wolf
,
T.
,
Meyer
,
H.
,
Hartmann
,
W.
,
Penzel
,
R.
,
Ulrich
,
A.
,
Lehner
,
B.
,
Hovestadt
,
V.
,
Czwan
,
E.
,
Egerer
,
G.
 et al. (
2013
)
Integrative DNA methylation and gene expression analysis in high-grade soft tissue sarcomas
.
Genome Biol.
,
14
,
r137
.

61.

Seki
,
M.
,
Nishimura
,
R.
,
Yoshida
,
K.
,
Shimamura
,
T.
,
Shiraishi
,
Y.
,
Sato
,
Y.
,
Kato
,
M.
,
Chiba
,
K.
,
Tanaka
,
H.
,
Hoshino
,
N.
 et al. (
2015
)
Integrated genetic and epigenetic analysis defines novel molecular subgroups in rhabdomyosarcoma
.
Nat. Commun.
,
6
,
7557
.

62.

Röhrich
,
M.
,
Koelsche
,
C.
,
Schrimpf
,
D.
,
Capper
,
D.
,
Sahm
,
F.
,
Kratz
,
A.
,
Reuss
,
J.
,
Hovestadt
,
V.
,
Jones
,
D.T.
,
Bewerunge-Hudler
,
M.
 et al. (
2016
)
Methylation-based classification of benign and malignant peripheral nerve sheath tumors
.
Acta Neuropathol.
,
131
,
877
887
.

63.

Wu
,
S.P.
,
Cooper
,
B.T.
,
Bu
,
F.
,
Bowman
,
C.J.
,
Killian
,
J.K.
,
Serrano
,
J.
,
Wang
,
S.
,
Jackson
,
T.M.
,
Gorovets
,
D.
,
Shukla
,
N.
 et al. (
2017
)
DNA methylation-based classifier for accurate molecular diagnosis of bone sarcomas
.
JCO Precis. Oncol.
,
2017
, doi: .

64.

Koelsche
,
C.
,
Mynarek
,
M.
,
Schrimpf
,
D.
,
Bertero
,
L.
,
Serrano
,
J.
,
Sahm
,
F.
,
Reuss
,
D.E.
,
Hou
,
Y.
,
Baumhoer
,
D.
,
Vokuhl
,
C.
 et al. (
2018
)
Primary intracranial spindle cell sarcoma with rhabdomyosarcoma-like features share a highly distinct methylation profile and DICER1 mutations
.
Acta Neuropathol.
,
136
,
327
337
.

65.

Weidema
,
M.E.
,
van de Geer
,
E.
,
Koelsche
,
C.
,
Desar
,
I.M.E.
,
Kemmeren
,
P.
,
Hillebrandt-Roeffen
,
M.H.S.
,
Ho
,
V.K.Y.
,
van der Graaf
,
W.T.A.
,
Versleijen-Jonkers
,
Y.M.H.
,
von Deimling
,
A.
 et al. (
2020
)
DNA methylation profiling identifies distinct clusters in angiosarcomas
.
Clin. Cancer Res.
,
26
,
93
100
.

66.

Heilig
,
C.E.
,
Horak
,
P.
,
Lipka
,
D.B.
,
Mock
,
A.
,
Uhrig
,
S.
,
Kreutzfeldt
,
S.
,
Richter
,
S.
,
Gieldon
,
L.
,
Fröhlich
,
M.
,
Hutter
,
B.
 et al. (
2020
)
Germline SDHB-inactivating mutation in gastric spindle cell sarcoma
.
Genes Chromosomes Cancer
. doi: .

67.

Rai
,
K.R.
,
Sawitsky
,
A.
,
Cronkite
,
E.P.
,
Chanana
,
A.D.
,
Levy
,
R.N.
and
Pasternack
,
B.S.
(
1975
)
Clinical staging of chronic lymphocytic leukemia
.
Blood
,
46
,
219
234
.

68.

Costello
,
J.F.
,
Fruhwald
,
M.C.
,
Smiraglia
,
D.J.
,
Rush
,
L.J.
,
Robertson
,
G.P.
,
Gao
,
X.
,
Wright
,
F.A.
,
Feramisco
,
J.D.
,
Peltomaki
,
P.
,
Lang
,
J.C.
 et al. (
2000
)
Aberrant CpG-island methylation has non-random and tumour-type-specific patterns
.
Nat. Genet.
,
24
,
132
138
.

69.

Rush
,
L.J.
,
Raval
,
A.
,
Funchain
,
P.
,
Johnson
,
A.J.
,
Smith
,
L.
,
Lucas
,
D.M.
,
Bembea
,
M.
,
Liu
,
T.H.
,
Heerema
,
N.A.
,
Rassenti
,
L.
 et al. (
2004
)
Epigenetic profiling in chronic lymphocytic leukemia reveals novel methylation targets
.
Cancer Res.
,
64
,
2424
2433
.

70.

Raval
,
A.
,
Rush
,
L.J.
,
Funchain
,
P.
,
Flax
,
E.
,
Davis
,
M.
,
Johnson
,
A.J.
,
Byrd
,
J.C.
and
Plass
,
C.
(
2002
)
Aberrant DNA methylation in chronic lymphocytic leukemia: a role in pathogenesis?
 
Blood
,
100
,
379a
.

71.

Liu
,
T.H.
,
Raval
,
A.
,
Chen
,
S.S.
,
Matkovic
,
J.J.
,
Byrd
,
J.C.
and
Plass
,
C.
(
2006
)
CpG island methylation and expression of the secreted frizzled-related protein gene family in chronic lymphocytic leukemia
.
Cancer Res.
,
66
,
653
658
.

72.

Raval
,
A.
,
Tanner
,
S.M.
,
Byrd
,
J.C.
,
Angerman
,
E.B.
,
Perko
,
J.D.
,
Chen
,
S.S.
,
Hackanson
,
B.
,
Grever
,
M.R.
,
Lucas
,
D.M.
,
Matkovic
,
J.J.
 et al. (
2007
)
Downregulation of death-associated protein kinase 1 (DAPK1) in chronic lymphocytic leukemia
.
Cell
,
129
,
879
890
.

73.

Claus
,
R.
,
Hackanson
,
B.
,
Poetsch
,
A.R.
,
Zucknick
,
M.
,
Sonnet
,
M.
,
Blagitko-Dorfs
,
N.
,
Hiller
,
J.
,
Wilop
,
S.
,
Brummendorf
,
T.H.
,
Galm
,
O.
 et al. (
2012
)
Quantitative analyses of DAPK1 methylation in AML and MDS
.
Int J Cancer
,
131
,
E138
E142
.

74.

Wei
,
Q.X.
,
Claus
,
R.
,
Hielscher
,
T.
,
Mertens
,
D.
,
Raval
,
A.
,
Oakes
,
C.C.
,
Tanner
,
S.M.
,
de la Chapelle
,
A.
,
Byrd
,
J.C.
,
Stilgenbauer
,
S.
 et al. (
2013
)
Germline allele-specific expression of DAPK1 in chronic lymphocytic leukemia
.
PLoS One
,
8
,
e55261
.

75.

Chen
,
S.S.
,
Claus
,
R.
,
Lucas
,
D.M.
,
Yu
,
L.
,
Qian
,
J.
,
Ruppert
,
A.S.
,
West
,
D.A.
,
Williams
,
K.E.
,
Johnson
,
A.J.
,
Sablitzky
,
F.
 et al. (
2011
)
Silencing of the inhibitor of DNA binding protein 4 (ID4) contributes to the pathogenesis of mouse and human CLL
.
Blood
,
117
,
862
871
.

76.

Baer
,
C.
,
Claus
,
R.
,
Frenzel
,
L.P.
,
Zucknick
,
M.
,
Park
,
Y.J.
,
Gu
,
L.
,
Weichenhan
,
D.
,
Fischer
,
M.
,
Pallasch
,
C.P.
,
Herpel
,
E.
 et al. (
2012
)
Extensive promoter DNA hypermethylation and hypomethylation is associated with aberrant microRNA expression in chronic lymphocytic leukemia
.
Cancer Res.
,
72
,
3775
3785
.

77.

Baer
,
C.
,
Claus
,
R.
and
Plass
,
C.
(
2013
)
Genome-wide epigenetic regulation of miRNAs in Cancer
.
Cancer Res.
,
73
,
473
477
.

78.

Baer
,
C.
,
Oakes
,
C.C.
,
Ruppert
,
A.S.
,
Claus
,
R.
,
Kim-Wanner
,
S.Z.
,
Mertens
,
D.
,
Zenz
,
T.
,
Stilgenbauer
,
S.
,
Byrd
,
J.C.
and
Plass
,
C.
(
2015
)
Epigenetic silencing of miR-708 enhances NF-kappaB signaling in chronic lymphocytic leukemia
.
Int J Cancer
,
137
,
1352
1361
.

79.

Beekman
,
R.
,
Chapaprieta
,
V.
,
Russiñol
,
N.
,
Vilarrasa-Blasi
,
R.
,
Verdaguer-Dot
,
N.
,
Martens
,
J.H.A.
,
Duran-Ferrer
,
M.
,
Kulis
,
M.
,
Serra
,
F.
,
Javierre
,
B.M.
 et al. (
2018
)
The reference epigenome and regulatory chromatin landscape of chronic lymphocytic leukemia
.
Nat. Med.
,
24
,
868
880
.

80.

Kulis
,
M.
,
Heath
,
S.
,
Bibikova
,
M.
,
Queirós
,
A.C.
,
Navarro
,
A.
,
Clot
,
G.
,
Martínez-Trillos
,
A.
,
Castellano
,
G.
,
Brun-Heath
,
I.
,
Pinyol
,
M.
 et al. (
2012
)
Epigenomic analysis detects widespread gene-body DNA hypomethylation in chronic lymphocytic leukemia
.
Nat. Genet.
,
44
,
1236
1242
.

81.

Queiros
,
A.C.
,
Beekman
,
R.
,
Vilarrasa-Blasi
,
R.
,
Duran-Ferrer
,
M.
,
Clot
,
G.
,
Merkel
,
A.
,
Raineri
,
E.
,
Russinol
,
N.
,
Castellano
,
G.
,
Bea
,
S.
 et al. (
2016
)
Decoding the DNA Methylome of mantle cell lymphoma in the light of the entire B cell lineage
.
Cancer Cell
,
30
,
806
821
.

82.

Duran-Ferrer
,
M.
,
Clot
,
G.
,
Nadeu
,
F.
,
Beekman
,
R.
,
Baumann
,
T.
,
Nordlund
,
J.
,
Marincevic-Zuniga
,
Y.
,
Lönnerholm
,
G.
,
Rivas-Delgado
,
A.
,
Martin
,
S.
 et al. (
2020
)
The proliferative history shapes the DNA methylome of B-cell tumors and predicts clinical outcome
.
bioRxiv
. doi: .

83.

Cancer Genome Atlas Research Network
(
2013
)
Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia
.
N. Engl. J. Med.
,
368
,
2059
2074
.

84.

Bullinger
,
L.
,
Dohner
,
K.
and
Dohner
,
H.
(
2017
)
Genomics of acute myeloid leukemia diagnosis and pathways
.
J. Clin. Oncol.
,
35
,
934
946
.

85.

Bullinger
,
L.
,
Döhner
,
K.
,
Bair
,
E.
,
Fröhling
,
S.
,
Schlenk
,
R.F.
,
Tibshirani
,
R.
,
Döhner
,
H.
and
Pollack
,
J.R.
(
2004
)
Use of gene-expression profiling to identify prognostic subclasses in adult acute myeloid leukemia
.
N. Engl. J. Med.
,
350
,
1605
1616
.

86.

Barrio
,
S.
,
Gallardo
,
M.
,
Albizua
,
E.
,
Jimenez
,
A.
,
Rapado
,
I.
,
Ayala
,
R.
,
Gilsanz
,
F.
,
Martin-Subero
,
J.I.
and
Martinez-Lopez
,
J.
(
2011
)
Epigenomic profiling in polycythaemia vera and essential thrombocythaemia shows low levels of aberrant DNA methylation
.
J. Clin. Pathol.
,
64
,
1010
1013
.

87.

Nischal
,
S.
,
Bhattacharyya
,
S.
,
Christopeit
,
M.
,
Yu
,
Y.
,
Zhou
,
L.
,
Bhagat
,
T.D.
,
Sohal
,
D.
,
Will
,
B.
,
Mo
,
Y.
,
Suzuki
,
M.
 et al. (
2013
)
Methylome profiling reveals distinct alterations in phenotypic and mutational subgroups of myeloproliferative neoplasms
.
Cancer Res.
,
73
,
1076
1085
.

88.

Pérez
,
C.
,
Pascual
,
M.
,
Martín-Subero
,
J.I.
,
Bellosillo
,
B.
,
Segura
,
V.
,
Delabesse
,
E.
,
Álvarez
,
S.
,
Larrayoz
,
M.J.
,
Rifón
,
J.
,
Cigudosa
,
J.C.
 et al. (
2013
)
Aberrant DNA methylation profile of chronic and transformed classic Philadelphia-negative myeloproliferative neoplasms
.
Haematologica
,
98
,
1414
1420
.

89.

Arber
,
D.A.
,
Orazi
,
A.
,
Hasserjian
,
R.
,
Thiele
,
J.
,
Borowitz
,
M.J.
,
Le Beau
,
M.M.
,
Bloomfield
,
C.D.
,
Cazzola
,
M.
and
Vardiman
,
J.W.
(
2016
)
The 2016 revision to the World Health Organization classification of myeloid neoplasms and acute leukemia
.
Blood
,
127
,
2391
2405
.

90.

WHO
(
2017
)
WHO Classification of Tumours of Haematopoietic and Lymphoid Tissues
.
International Agency for Research on Cancer
,
Lyon
.

91.

Meldi
,
K.
,
Qin
,
T.
,
Buchi
,
F.
,
Droin
,
N.
,
Sotzen
,
J.
,
Micol
,
J.B.
,
Selimoglu-Buet
,
D.
,
Masala
,
E.
,
Allione
,
B.
,
Gioia
,
D.
 et al. (
2015
)
Specific molecular signatures predict decitabine response in chronic myelomonocytic leukemia
.
J. Clin. Invest.
,
125
,
1857
1872
.

92.

Reilly
,
B.
,
Tanaka
,
T.N.
,
Diep
,
D.
,
Yeerna
,
H.
,
Tamayo
,
P.
,
Zhang
,
K.
and
Bejar
,
R.
(
2019
)
DNA methylation identifies genetically and prognostically distinct subtypes of myelodysplastic syndromes
.
Blood Adv.
,
3
,
2845
2858
.

93.

Palomo
,
L.
,
Malinverni
,
R.
,
Cabezón
,
M.
,
Xicoy
,
B.
,
Arnan
,
M.
,
Coll
,
R.
,
Pomares
,
H.
,
García
,
O.
,
Fuster-Tormo
,
F.
,
Grau
,
J.
 et al. (
2018
)
DNA methylation profile in chronic myelomonocytic leukemia associates with distinct clinical, biological and genetic features
.
Epigenetics
,
13
,
8
18
.

94.

van der Pol
,
Y.
and
Mouliere
,
F.
(
2019
)
Toward the early detection of cancer by decoding the epigenetic and environmental fingerprints of cell-free DNA
.
Cancer Cell
,
36
,
350
368
.

95.

Smallwood
,
S.A.
,
Lee
,
H.J.
,
Angermueller
,
C.
,
Krueger
,
F.
,
Saadeh
,
H.
,
Peat
,
J.
,
Andrews
,
S.R.
,
Stegle
,
O.
,
Reik
,
W.
and
Kelsey
,
G.
(
2014
)
Single-cell genome-wide bisulfite sequencing for assessing epigenetic heterogeneity
.
Nat. Methods
,
11
,
817
820
.

96.

Farlik
,
M.
,
Sheffield
,
N.C.
,
Nuzzo
,
A.
,
Datlinger
,
P.
,
Schönegger
,
A.
,
Klughammer
,
J.
and
Bock
,
C.
(
2015
)
Single-cell DNA methylome sequencing and bioinformatic inference of epigenomic cell-state dynamics
.
Cell Rep.
,
10
,
1386
1397
.

97.

Miura
,
F.
,
Enomoto
,
Y.
,
Dairiki
,
R.
and
Ito
,
T.
(
2012
)
Amplification-free whole-genome bisulfite sequencing by post-bisulfite adaptor tagging
.
Nucleic Acids Res.
,
40
,
e136
.

98.

Liu
,
Y.
,
Cheng
,
J.
,
Siejka-Zielińska
,
P.
,
Weldon
,
C.
,
Roberts
,
H.
,
Lopopolo
,
M.
,
Magri
,
A.
,
D’Arienzo
,
V.
,
Harris
,
J.M.
,
McKeating
,
J.A.
 et al. (
2020
)
Accurate targeted long-read DNA methylation and hydroxymethylation sequencing with TAPS
.
Genome Biol.
,
21
,
54
.

99.

Liu
,
Y.
,
Siejka-Zielinska
,
P.
,
Velikova
,
G.
,
Bi
,
Y.
,
Yuan
,
F.
,
Tomkova
,
M.
,
Bai
,
C.
,
Chen
,
L.
,
Schuster-Bockler
,
B.
and
Song
,
C.X.
(
2019
)
Bisulfite-free direct detection of 5-methylcytosine and 5-hydroxymethylcytosine at base resolution
.
Nat. Biotechnol.
,
37
,
424
429
.

100.

Ponnaluri
,
V.K.C.W.
,
Louise
,
Sexton
,
B.
,
Saleh
,
L.
,
Marks
,
K.
,
Samaranayake
,
M.
,
Langhorst
,
B.
,
Sun
,
Z.
,
Evans
 Jr,
T.C.
,
Vaisvila
,
R.
 et al. (
2019
)
EM-seq Enables Accurate and Precise Methylome Analysis of Challenging DNA Samples
. https://www.nebiolabs.com.au/-/media/posters/document/enzymatic_methyl-seq_enables_accurate_precise_methylome_analysis_of_cell-free_dna_2019.pdf.

101.

Williams
,
L.P.
,
V.K.C.
,
Sexton
,
B.S.
,
Saleh
,
L.
,
Marks
,
K.
,
Samaranayake
,
M.
,
Ettwiller
,
L.
,
Guan
,
S.
,
Church
,
H.E.
,
Dai
,
N.
 et al. (
2019
) ,
Enzymatic Methyl-seq: Next Generation Methylomes
, https://international.neb.com/-/media/posters/document/nebnext_enzymatic_methyl-seq_2019.pdf?rev=81b22e6ea6824044b67581522c5264f6&hash=D45FDACE1C86C99DC866CE274CAF281E.

102.

Gallardo-Gómez
,
M.
,
Moran
,
S.
,
Páez de la Cadena
,
M.
,
Martínez-Zorzano
,
V.S.
,
Rodríguez-Berrocal
,
F.J.
,
Rodríguez-Girondo
,
M.
,
Esteller
,
M.
,
Cubiella
,
J.
,
Bujanda
,
L.
,
Castells
,
A.
 et al. (
2018
)
A new approach to epigenome-wide discovery of non-invasive methylation biomarkers for colorectal cancer screening in circulating cell-free DNA using pooled samples
.
Clin. Epigenetics
,
10
,
53
.

103.

Shen
,
S.Y.
,
Singhania
,
R.
,
Fehringer
,
G.
,
Chakravarthy
,
A.
,
Roehrl
,
M.H.A.
,
Chadwick
,
D.
,
Zuzarte
,
P.C.
,
Borgida
,
A.
,
Wang
,
T.T.
,
Li
,
T.
 et al. (
2018
)
Sensitive tumour detection and classification using plasma cell-free DNA methylomes
.
Nature
,
563
,
579
583
.

104.

Locke
,
W.J.
,
Guanzon
,
D.
,
Ma
,
C.
,
Liew
,
Y.J.
,
Duesing
,
K.R.
,
Fung
,
K.Y.C.
and
Ross
,
J.P.
(
2019
)
DNA methylation cancer biomarkers: translation to the clinic
.
Front. Genet.
,
10
,
1150
.

105.

Maggi
,
E.C.
,
Gravina
,
S.
,
Cheng
,
H.
,
Piperdi
,
B.
,
Yuan
,
Z.
,
Dong
,
X.
,
Libutti
,
S.K.
,
Vijg
,
J.
and
Montagna
,
C.
(
2018
)
Development of a method to implement whole-genome bisulfite sequencing of cfDNA from cancer patients and a mouse tumor model
.
Front. Genet.
,
9
,
6
.

106.

Hlady
,
R.A.
,
Zhao
,
X.
,
Pan
,
X.
,
Yang
,
J.D.
,
Ahmed
,
F.
,
Antwi
,
S.O.
,
Giama
,
N.H.
,
Patel
,
T.
,
Roberts
,
L.R.
,
Liu
,
C.
 et al. (
2019
)
Genome-wide discovery and validation of diagnostic DNA methylation-based biomarkers for hepatocellular cancer detection in circulating cell free DNA
.
Theranostics
,
9
,
7239
7250
.

107.

Nuzzo
,
P.V.
,
Berchuck
,
J.E.
,
Korthauer
,
K.
,
Spisak
,
S.
,
Nassar
,
A.H.
,
Abou Alaiwi
,
S.
,
Chakravarthy
,
A.
,
Shen
,
S.Y.
,
Bakouny
,
Z.
,
Boccardo
,
F.
 et al. (
2020
)
Detection of renal cell carcinoma using plasma and urine cell-free DNA methylomes
.
Nat. Med.
,
26
,
1041
1043
.

108.

Wu
,
A.
,
Cremaschi
,
P.
,
Wetterskog
,
D.
,
Conteduca
,
V.
,
Franceschini
,
G.M.
,
Kleftogiannis
,
D.
,
Jayaram
,
A.
,
Sandhu
,
S.
,
Wong
,
S.Q.
,
Benelli
,
M.
 et al. (
2020
)
Genome-wide plasma DNA methylation features of metastatic prostate cancer
.
J. Clin. Invest.
,
130
,
1991
2000
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)