Abstract

Summary

The flood of recent cancer genomic data requires a coherent model that can sort out the findings to systematically explain clonal evolution and the resultant intra-tumor heterogeneity (ITH). Here, we present a new mathematical model designed to computationally simulate the evolution of cancer cells. The model connects the well-known hallmarks of cancer with the specific mutational states of tumor-related genes. The cell behavior phenotypes are stochastically determined, and the hallmarks probabilistically interfere with the phenotypic probabilities. In turn, the hallmark variables depend on the mutational states of tumor-related genes. Thus, our software can deepen our understanding of cancer-cell evolution and generation of ITH.

Availability and implementation

The open-source code is available in the repository https://github.com/nagornovys/Cancer_cell_evolution.

Supplementary information

Supplementary data are available at Bioinformatics online.

1 Introduction

Mathematical models, often coupled with computer simulation, can provide insights into clonal evolution and intra-tumor heterogeneity in cancer, which cannot be obtained through direct observation of the genomic states of tumor patients. Previously, a branching process model that computationally simulates clonal evolution was used to estimate parameters such as the mutation rate and relative fitness of tumor subclones (Williams et al., 2018). In the study, an approximate Bayesian computation (ABC) framework was applied to the observation data of variant allele frequencies (VAFs) detected by next-generation sequencing at the genome level in multiple cancers such as breast, blood, and lung cancers. Multiple studies have also reported using computer simulations for addressing cancer development and progression (Altrock et al., 2015; Beerenwinkel et al., 2015; Conterno Minussi et al., 2019; Sottoriva et al., 2013; Waclaw et al., 2015; Wang et al., 2014).

Despite these advances, it remains difficult to elucidate the roles of tumor-related genes in cancer development in these representative models. This is because these models postulate ‘abstract’ genes, in which generated mutations are assumed to change parameter values for the birth (cell division) and death (apoptosis) rates of tumor cells, the fitness change, and the dispersal (migration) rate. Gene functions such as angiogenesis by VEGF and immortalization by TERT are abstracted into these parameters, and so cannot directly be modeled. The roles of cancer-related genes were originally summarized under six essential ‘hallmarks’, and more recently two extra hallmarks and two characteristics were added (we will collectively refer to all the ten summary terms as hallmarks) (Hanahan and Weinberg, 2000, 2011). Such hallmarks are not only conceptually proposed, but have also been individually assigned to specific tumor-related genes in cancer knowledge databases such as COSMIC (Forbes et al., 2017) and MSigDB (Liberzon et al., 2015).

Introducing these essential cancer hallmarks into computer simulation models will improve their ability to predict the effects of tumor-related genes during cancer development. Indeed, some studies have used cancer hallmarks in their simulation models (Abbott et al., 2006; Basanta et al., 2011; Monteagudo and Santos, 2014; Spencer et al., 2006). However, these hallmark simulations focused on the phenotypic (behavioral) traits of cancer cells, in which an abstract gene corresponds to a hallmark one-to-one. When a ‘phenotypic’ mutation occurs, the corresponding hallmark is affected, though in reality, one gene can contribute to multiple hallmarks and one hallmark can be affected by multiple genes. This is exemplified in the case of TP53, which contributes at the very least to both tumor suppression and apoptosis.

Here, we present a new computer simulation program, named tugHall (tumor gene-Hallmark) simulator, of a cancer-cell evolution model, wherein gene mutations are linked to the tumor cell behaviors that are influenced by the hallmarks of cancer. All first six hallmarks were implemented. Our software can utilize the accumulated genomic data to provide insights into the underlying mechanisms of cancer-cell evolution.

2 Model

We briefly explain our model here, with full details provided in Supplementary Material. In the model, cells at an initial time-point are put on trials, where the next phenotypic state of each cell is probabilistically determined based on each trial. For example, a cell is put on the ‘apoptosis’ trial (Fig. 1), where the cell may die according to the ‘apoptosis’ probability variable, a. Cancer hallmarks are introduced to interfere with such trial probabilities. For example, the ‘evading apoptosis’, simply abbreviated to the ‘apoptosis’ hallmark (Hanahan and Weinberg, 2000), designated variable Ha, decreases the apoptosis probability by aHa. The value given to each cancer hallmark variable is calculated by the linear combination of gene indicator variables to represent mutational states and their constant weights (Supplementary Material). The mutational states and weights can be estimated by observed data provided by ICGC (International Cancer Genome Consortium et al., 2010) and The Cancer Genome Atlas (TCGA) through ABC. Our simulation model can be interpreted as an agent-based model of branching processes where a cell’s future state is stochastically determined by trials interfered with hallmarks that are linked to gene mutations via linear combinations. The algorithm and state transitions of this model are depicted in Supplementary Material.

The simulation framework. A cell is put to a ‘trial’, through which the next state of the cell is probabilistically determined. For example, a cell may die in the ‘apoptosis’ trial based on the probability value for cell death by apoptosis. Hallmarks interfere with such trial probabilities when genes related to hallmarks are impaired by mutations. The variables of hallmarks represent probabilities or rates to modify trial probability values. For example, when a gene related to apoptosis is impaired, the hallmark variable of apoptosis decreases a probabilistic value in the apoptosis trial.
Fig. 1.

The simulation framework. A cell is put to a ‘trial’, through which the next state of the cell is probabilistically determined. For example, a cell may die in the ‘apoptosis’ trial based on the probability value for cell death by apoptosis. Hallmarks interfere with such trial probabilities when genes related to hallmarks are impaired by mutations. The variables of hallmarks represent probabilities or rates to modify trial probability values. For example, when a gene related to apoptosis is impaired, the hallmark variable of apoptosis decreases a probabilistic value in the apoptosis trial.

3 Results

As an example, the simulation results for colorectal cancer with the hallmarks defined in COSMIC (Forbes et al., 2017) and with simple weighted coefficients has been provided. The time evolution of clones (the number of clones, number of cells in each clone, total number of cells, and final state of clones) in one simulation is shown in Supplementary Material (Supplementary Fig. S4). The simulation enabled observation of the competition between clones and determination of the clones that died. Finally, only a few clones survived. The order of gene dysfunction (from first to last) is shown with the number of cells. Additional results are described in Supplementary Material (Supplementary Results and Supplementary Fig. S5).

We have provided an illustration to show how to estimate simulation parameters by ABC and how to utilize the estimates as previously described (Kato et al., 2017; Sottoriva et al., 2015; Williams et al., 2018). From TCGA (Ellrott et al., 2018), we downloaded data on the experimentally observed VAFs of APC, KRAS, TP53 and PIK3CA for a colorectal cancer patient and used the VAFs as summary statistics for ABC to estimate posterior distributions for the weight parameters (see Supplementary Material for the methods). The estimated posterior distributions suggest that PIK3CA makes a greater contribution to invasion/metastasis than KRAS and TP53 in this patient (Supplementary Fig. S6E in Supplementary Material).

Based on the maximum a posteriori probability (MAP) estimates from the posteriors, we searched for key genes important for the numbers of primary tumor cells, metastatic cells, and clones, by artificially nullifying the aberrant function of each of the four genes (i.e. by setting the weights to zero for a gene of interest). Figure 2 shows that nullifying the aberrant function of TP53, but not the other genes, decreased the number of metastatic cells in this patient.

Genes influencing the numbers of primary tumor cells, metastatic cells and clones. ‘MAP’ represents simulations using MAP estimates for the weight parameters via ABC. ‘–APC’ represents simulations where the weight parameters for APC were set to zero and those for the other genes were kept as the MAP estimates. The same applies to the other genes. The numbers at the last time-point from 12 000 simulations are plotted as distributions. (A) The number of primary tumor cells. (B) The number of metastatic cells. (C) The number of clones.
Fig. 2.

Genes influencing the numbers of primary tumor cells, metastatic cells and clones. ‘MAP’ represents simulations using MAP estimates for the weight parameters via ABC. ‘–APC’ represents simulations where the weight parameters for APC were set to zero and those for the other genes were kept as the MAP estimates. The same applies to the other genes. The numbers at the last time-point from 12 000 simulations are plotted as distributions. (A) The number of primary tumor cells. (B) The number of metastatic cells. (C) The number of clones.

Acknowledgements

We are grateful to Momoko Nagai, Ritsuko Onuki, and Daichi Narushima for technical assistance. We also @thank Asmaa Elzawahry, Yusuke Suenaga, Sana Yokoi, Yoshitaka Hippo, Atsushi Niida and Daniel A. Vasco for useful suggestions.

Funding

This work was supported by JST CREST [14531766]; MEXT [15K06916]; and AMED [16ck0106013h0003 and 19cm0106536h0002].

Conflict of Interest: none declared.

References

Abbott
 
R.G.
 et al. (
2006
)
Simulating the hallmarks of cancer
.
Artif. Life
,
12
,
617
634
.

Altrock
 
P.M.
 et al. (
2015
)
The mathematics of cancer: integrating quantitative models
.
Nat. Rev. Cancer
,
15
,
730
745
.

Basanta
 
D.
 et al. (
2011
)
Computational analysis of the influence of the microenvironment on carcinogenesis
.
Math. Biosci
.,
229
,
22
29
.

Beerenwinkel
 
N.
 et al. (
2015
)
Cancer evolution: mathematical models and computational inference
.
Syst. Biol
.,
64
,
e1
25
.

Conterno Minussi
 
D.
 et al. (
2019
)
esiCancer: evolutionary in silico cancer simulator
.
Cancer Res
.,
79
,
1010
1013
.

Ellrott
 
K.
 et al. (
2018
)
Scalable open science approach for mutation calling of tumor exomes using multiple genomic pipelines
.
Cell Syst
.,
6
,
271
281.e277

Forbes
 
S.A.
 et al. (
2017
)
COSMIC: somatic cancer genetics at high-resolution
.
Nucleic Acids Res
.,
45
,
D777
D783
.

Hanahan
 
D.
,
Weinberg
R.A.
(
2000
)
The hallmarks of cancer
.
Cell
,
100
,
57
70
.

Hanahan
 
D.
,
Weinberg
R.A.
(
2011
)
Hallmarks of cancer: the next generation
.
Cell
,
144
,
646
674
.

International Cancer Genome
 
Consortium
 et al. (
2010
)
International network of cancer genome projects
.
Nature
,
464
,
993
998
.

Kato
 
M.
 et al. (
2017
)
Sweepstake evolution revealed by population-genetic analysis of copy-number alterations in single genomes of breast cancer
.
R Soc. Open Sci
.,
4
,
171060
.

Liberzon
 
A.
 et al. (
2015
)
The Molecular Signatures Database (MSigDB) hallmark gene set collection
.
Cell Syst
.,
1
,
417
425
.

Monteagudo
 
A.
,
Santos
J.
(
2014
)
Studying the capability of different cancer hallmarks to initiate tumor growth using a cellular automaton simulation. Application in a cancer stem cell context
.
Biosystems
,
115
,
46
58
.

Sottoriva
 
A.
 et al. (
2015
)
A Big Bang model of human colorectal tumor growth
.
Nat. Genet
.,
47
,
209
216
.

Sottoriva
 
A.
 et al. (
2013
)
Single-molecule genomic data delineate patient-specific tumor profiles and cancer stem cell organization
.
Cancer Res
.,
73
,
41
49
.

Spencer
 
S.L.
 et al. (
2006
)
Modeling somatic evolution in tumorigenesis
.
PLoS Comput. Biol
.,
2
,
e108
.

Waclaw
 
B.
 et al. (
2015
)
A spatial model predicts that dispersal and cell turnover limit intratumour heterogeneity
.
Nature
,
525
,
261
264
.

Wang
 
Y.
 et al. (
2014
)
Clonal evolution in breast cancer revealed by single nucleus genome sequencing
.
Nature
,
512
,
155
160
.

Williams
 
M.J.
 et al. (
2018
)
Quantification of subclonal selection in cancer from bulk sequencing data
.
Nat. Genet
.,
50
,
895
903
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Associate Editor: Russell Schwartz
Russell Schwartz
Associate Editor
Search for other works by this author on:

Supplementary data