-
PDF
- Split View
-
Views
-
Annotate
-
Cite
Cite
Jifan Shi, Kazuyuki Aihara, Tiejun Li, Luonan Chen, Energy landscape decomposition for cell differentiation with proliferation effect, National Science Review, Volume 9, Issue 8, August 2022, nwac116, https://doi.org/10.1093/nsr/nwac116
Close -
Share
Abstract
Complex interactions between genes determine the development and differentiation of cells. We establish a landscape theory for cell differentiation with proliferation effect, in which the developmental process is modeled as a stochastic dynamical system with a birth-death term. We find that two different energy landscapes, denoted U and V, collectively contribute to the establishment of non-equilibrium steady differentiation. The potential U is known as the energy landscape leading to the steady distribution, whose metastable states stand for cell types, while V indicates the differentiation direction from pluripotent to differentiated cells. This interpretation of cell differentiation is different from the previous landscape theory without the proliferation effect. We propose feasible numerical methods and a mean-field approximation for constructing landscapes U and V. Successful applications to typical biological models demonstrate the energy landscape decomposition's validity and reveal biological insights into the considered processes.
INTRODUCTION
Cell differentiation makes the biological world rich and colorful. Modeling and understanding the dynamics of gene regulatory networks (GRNs) are essential to explore the underlying mechanism of cell differentiation [1]. Since Waddington proposed his seminal metaphor of the epigenetic landscape [2,3], differentiation has been intuitively described as a ball rolling down a surface, i.e. the energy landscape.
There are mainly two approaches to construct the energy landscape for biosystems. One is the data-based approach, which tries to identify cluster/cell types, differentiation trajectories, pseudotime and cell pluripotency from the experimental data, especially single-cell RNA sequencing (scRNA-seq) data [4–8]. Various algorithms have been proposed from the theory of graph [9,10], entropy [6,11,12] and dynamical systems [13–16]. Among these data-based methods, the population balance analysis (PBA) [13–15] and landscape of differentiation dynamics (LDD) [16] especially involve cell proliferation and death rates, which generalize the hypothesis of cell differentiation from equilibrium to non-equilibrium steady processes. The other approach is the model-based approach, which tries to build dynamic equations from the GRN, analyze the dynamical behavior of the system and then construct energy landscapes using numerical simulations. Wang et al. proposed a practical framework for constructing energy landscapes for biosystems without cell birth and death effects [17–19]. They defined the landscape as |$U(\boldsymbol{x}) = -\epsilon \log P_{ss}(\boldsymbol{x})$|, where |$\boldsymbol{x}$| is the gene expression vector, |$P_{ss}(\boldsymbol{x})$| is the steady probability density function (PDF) of the system and ε is related to the amplitude of small intrinsic noise. By changing parameter values to control the biological process, the landscape pattern varies, which gives an intuitive description of cell-type locations and transition probability between clusters. This framework has been widely used in modeling the budding yeast cell cycle [17], human stem cell fate [19] and Caenorhabditis elegans ageing [20]. Without the cell proliferation effect, however, the direction of differentiation is usually not intrinsic in Wang’s landscape but controlled by manually setting parameters. Beyond the topics above, there are also many other interesting works related to the energy landscape theory [21–27].
In this study, we follow the model-based approach but consider the birth and death rates (BDRs) of cells in differentiation, which is inspired by the PBA and LDD. We show that there are two important energy landscapes to describe differentiation dynamics, which are denoted |$U(\boldsymbol{x})$| and |$V(\boldsymbol{x})$| and are also computable using the model. The metastable states in |$U(\boldsymbol{x})$| represent cell types, and the value of |$V(\boldsymbol{x})$| implicates pluripotency. The negative gradient of |$V(\boldsymbol{x})$| shows the direction of differentiation. Taking BDRs into account is essential for constructing the pluripotency landscape |$V(\boldsymbol{x})$|. Here, we explain our theory of the energy landscape decomposition (ELD) in cell differentiation with proliferation effect. Numerical algorithms for constructing energy landscapes and the mean-field approximation (MFA) in high-dimensional cases are also proposed. We use three examples to show the application of the ELD, where |$U(\boldsymbol{x})$| and |$V(\boldsymbol{x})$| intuitively explain the processes of cell differentiation.
RESULTS
Modeling cell differentiation at the population level
Modeling cell differentiation at the single-cell level
Energy landscape decomposition
Based on the above dynamical modeling of cell development, we focus on the construction of the landscapes using the model-based approach, i.e. we assume that |$\boldsymbol{b}(\boldsymbol{x}), \epsilon$|, and |$R(\boldsymbol{x})$| are known a priori.
An illustration of the ELD framework. (a) The GRN, which reins cell differentiation dynamics. Dynamical flow |$\boldsymbol{b}(\boldsymbol{x})$| can be modeled using the GRN, and its vector field is drawn in an illustrative space (X, Y). (b) Cells can proliferate or die at a natural birth and death rate |$R(\boldsymbol{x})$|. Proliferation with |$R(\boldsymbol{x})>0$| is shown in red, while cell death with |$R(\boldsymbol{x})<0$| is in blue (jet colormap). (c) The ELD theory in this study. The dynamics |$(\boldsymbol{b}(\boldsymbol{x}), R(\boldsymbol{x}))$| can be characterized using two potential terms (energy landscapes) |$U(\boldsymbol{x})$| and |$V(\boldsymbol{x})$|, and a curl part term |$\boldsymbol{f}(\boldsymbol{x})$|. We define |$U(\boldsymbol{x})$| and |$V(\boldsymbol{x})$| using two steady PDFs, denoted |$P_U(\boldsymbol{x})$| and |$P_0(\boldsymbol{x})$|. (d) The combination of dynamical flow |$\boldsymbol{b}(\boldsymbol{x})$| (arrows) and birth and death rate |$R(\boldsymbol{x})$| (jet colored background). (e–g) Illustrations for the cell-type landscape |$U(\boldsymbol{x})$|, the pluripotency landscape |$V(\boldsymbol{x})$| and the curl part term |$\boldsymbol{f}(\boldsymbol{x})$|, respectively. Metastable states of |$U(\boldsymbol{x})$| shown in (e) stand for different cell types. The values of |$V(\boldsymbol{x})$| shown in (f) indicate the pluripotency, with its negative gradient depicting the differentiation direction. The color bar in (f) is calculated using the mean value of V along the X axis, which displays an intuitive global differentiation direction. In (g), landscape |$U(\boldsymbol{x})+V(\boldsymbol{x})$| is plotted as the background, while the curl part term |$\boldsymbol{f}(\boldsymbol{x})$| is indicated by the vectors.
The connection between the ELD framework and the existing landscape theory is as follows. In the case of no birth and death of cells, i.e. |$R(\boldsymbol{x})\equiv 0$|, (9) reduces to Wang’s landscape decomposition [17–19] with |$P_U(\boldsymbol{x})\equiv P_0(\boldsymbol{x})$| and |$V(\boldsymbol{x})\equiv 0$|; if the system is of gradient type with |$\boldsymbol{b}(\boldsymbol{x}) = -\nabla F(\boldsymbol{x})$|, (9) reduces to the landscape decomposition discussed in PBA and LDD [13,16] with |$\boldsymbol{f}(\boldsymbol{x})\equiv 0$|.
In a case that only |$p_{ss}(\boldsymbol{x})$| is known, PBA and LDD define the energy landscape V for gradient systems by solving the equation
with proper boundary conditions. However, for a non-gradient system, we show that (10) is not always valid, unless the curl part |$\boldsymbol{f}$| satisfies |$\boldsymbol{f}(\boldsymbol{x})\perp \nabla V(\boldsymbol{x})$| almost everywhere (see the online supplementary material for details). In this sense, we provide a more general definition of V using (6). More detailed discussion about the ELD can be found in the online supplementary material.
Numerical construction of energy landscapes
According to the definitions of potentials |$U(\boldsymbol{x})$| in (5) and |$V(\boldsymbol{x})$| in (6), we can compute the energy landscapes once the steady distributions |$P_U(\boldsymbol{x})$| and |$P_0(\boldsymbol{x})$| are obtained. Next we discuss the numerical methods for the low-dimensional case and the high-dimensional case separately.
Solving FPE for the low-dimensional case
For the low-dimensional case (usually in less than three dimensions), we used the finite difference or finite element method to solve (1) numerically until the steady state. For a small parameter ε, the spatial scale h of the grids in traditional methods is needed to be set as h ≪ ε to resolve the dynamical behavior of such a convection-dominated equation, which would be impractical. We utilized the streamline diffusion method [28] that can avoid the numerical oscillations even when h > ε. The detailed procedure of the streamline diffusion method is described in the online supplementary material in a two-dimensional case. It is also applied in our numerical examples in the Results section.
Mean-field approximation in the high-dimensional case
With the mean-field approximations |$P_0(\boldsymbol{x})$| and |$P_U(\boldsymbol{x})$|, we obtain the estimations for the landscapes |$U(\boldsymbol{x})$| and |$V(\boldsymbol{x})$| using (5) and (6), respectively.
Several detailed remarks need to be made. (i) There is also an MFA to (1), but, for multi-potential systems, the MFA to (1) can have different numbers of components with different parameters when approximating |$P_0(\boldsymbol{x})$| and |$P_U(\boldsymbol{x})$|, respectively. That can lead to an odd |$V(\boldsymbol{x})$|. Thus, one of the advantages of the MFA to (3) is that the numbers of components K and |$p_{(k)}(\boldsymbol{x}, \boldsymbol{\mu }_{(k)}, \Sigma _{(k)})$| computed using (12) are independent of the BDR |$R(\boldsymbol{x})$|, and only mixture weights estimated in (14) and (15) are different. (ii) The MFA to (3) is not suitable for monostable systems, as the mixture weight for trajectories in the only attractive basin is supposed to be the same. For the system with a uni-component, we use the MFA to (3), which is also described in detail in the online supplementary material. For real problems, most systems are multi-stable with several different components. (iii) In (12), the MFA is expanded to O(ε) in functions L1 and L2. In Wang’s framework [17,18] where |$R(\boldsymbol{x}) \equiv 0$|, it is only expanded to O(1) to estimate energy landscape |$U(\boldsymbol{x})$|. In the online supplementary material, we state that it is necessary to have the MFA to O(ε) to obtain a proper approximation of |$V(\boldsymbol{x})$|. Two analytic examples are also used to validate the necessity.
Numerical examples
In this section, we utilize three examples to show how we applied the ELD framework to describe cell differentiation.
Example 1: two-dimensional drift-diffusion process
The landscapes for the two-dimensional drift-diffusion process. (a) The landscape U. Three metastable states represent three cell types. (b) The landscape V. The cells differentiate from the state at a high value of V to a lower value.
Example 2: the two-gene fate decision system
(a) The interaction between two genes X1 and X2 in the fate decision system. Self-activation and inter-inhibition are observed. (b) The landscape U when the amplitude of the birth and death rate is set as r = 0. (c) The landscape U changes when r decreases from 30 to 3. (d) The landscape V changes when r decreases from 30 to 3.
Example 3: T-cell differentiation
The third example is used to describe the T-cell differentiation in high-dimensional space [34]. Four genes x1 (TCF-1), x2 (PU.1), x3 (GATA3) and x4 (BLC11B) interact with each other through activation or inhibition (see Fig. 4a). The dynamical term |$\boldsymbol{b}(\boldsymbol{x})$| is modeled using Hill functions and the parameters are listed in the online supplementary material. We set the BDR as |$R(\boldsymbol{x}) = 30[4.2-x_1^2 - (x_2-4)^2]$|. Using the MFA equation, (11), the landscapes are constructed as shown in Fig. 4b and c. The potential U corresponds to the steady-state landscape with birth-death term R. The four metastable states stand for the four stages of development of T cells (ETP/DN1, DN2a, DN2b and DN3). The potential V in Fig. 4c shows the pluripotency of cells with its value and differentiation direction with its negative gradient field. As applied to MFA, the BDR is also averaged for each Gaussian component, so the variation of |$V(\boldsymbol{x})$| within each metastable state/cell type is much smaller than that shown by U. Accordingly, there exists a sharp variation of V between two adjacent cell types, as shown in Fig. 4c.
(a) The gene regulatory network in the T-cell differentiation process. The four genes X1, X2, X3 and X4 are TCF-1, PU.1, GATA3 and BLC11B, respectively. (b) The landscape U and (c) the landscape V. Metastable states represent the T-cell stages, i.e. ETP/DN1, DN2a, DN2b and DN3. The potential V indicates the pluripotency and differentiation direction.
Besides the four-dimensional example for T-cell differentiation, we also conducted ELD on a high-dimensional example with 10 variables shown in Sec. IV(D) and Figure S2 of the online supplementary material. These results support the fact that the ELD and MFA are practical in studying the metastable states by energy landscape U and the pluripotency by energy landscape V.
CONCLUSIONS AND DISCUSSIONS
In this paper, we have proposed the ELD framework to describe cell differentiation with proliferation effect. Two energy landscapes, |$U(\boldsymbol{x})$| and |$V(\boldsymbol{x})$|, can explain the dynamical behavior of the system during differentiation. The potential U depicts the attractors standing for cell types, while V characterizes the pluripotency and differentiation direction. The consideration of BDR is important to construct V. With an additional V, ELD theory is a generalization of traditional energy landscape theory, and it is a natural realization for Waddington’s epigenetic landscape with birth-death terms. The numerical construction of ELD, especially its mean-field approximation in high-dimensional cases, is introduced. Simulated examples demonstrate the applicability of ELD. However, there are still several issues that need to be further discussed and studied.
First, the ELD framework is different from other models for constructing landscapes for cell differentiation. PBA and LDD are data driven, which are based on the scRNA-seq data and difficult to handle the non-gradient system. Wang’s landscape does not consider the birth and death of cells during the differentiation. ELD is model based and analyzes a landscape V caused by cell proliferation and death to display the differentiation direction.
Second, the BDR |$R(\boldsymbol{x})$| in our model is known a priori. For practical cases, R can be estimated from experiments (as in PBA, [13]) or approximated for each cell type (as in LDD, [16]). If only R for cell types matters, which means that R is a constant for each metastable state, the corresponding V will change little within one state while varying sharply between two adjacent states.
Third, the estimated mixture weights |$\rho _0^{(k)}$| in (14) are only rough approximations according to the size of attractive basins (percentages of trajectories falling into one meta-stable state). A more rational approach is to get the transition rate matrix |$\boldsymbol{Q}$| between different states [31,35–37]. However, it is difficult to catch rare transitions within the limited time of the simulation. A quick and accurate way to estimate mixture weights is still an open question.
Fourth, constructing landscapes for systems with limit cycles is also possible once a proper BDR is given. According to the approximation in [17], |$p_{(k)}(\boldsymbol{x}) = \lim _{s\rightarrow +\infty }(1/T)\int _s^{s+T} p(\boldsymbol{x}, \boldsymbol{\mu }_t, \Sigma _t)\mathop {}\!\mathrm{d}t$| can be used as the kth component in (11), where T is the period for the limit cycle and |$p(\boldsymbol{x}; \boldsymbol{\mu }_t, \Sigma _t)$| is the Gaussian mean-field approximation of the PDF at t by simulating (12). Constructing landscapes for real systems with limit-cycle behavior will be our future work.
Finally, in this paper we used FPE with a BDR term to study the dynamical behavior of the differentiation process in normal cells. For cancer cells that developed from normal cells, the pluripotency and proliferation rates may be restored by external factors. We leave the study of the energy landscape for cancer or tumor cells to future work.
Overall, the energy landscape is a universal concept to characterize the dynamical behavior of a system, and the proposed ELD in this study can help understand systems with proliferation and death, beyond pure reactions. This work can also be applied to model-based and data-based dynamical analyses of various biological systems [38–42].
MATERIALS AND METHODS
Detailed methods are available in the online supplementary material.
FUNDING
This work was supported by the Japan Science and Technology Agency Moonshot R&D (JPMJMS2021), the Japan Agency for Medical Research and Development (JP21dm0307009), the Japan Society for the Promotion of Science KAKENHI (JP20H05921), and the Institute of AI and Beyond, The University of Tokyo. It was also supported by the National Key R&D Program of China (2017YFA0505500), the Strategic Priority Research Program of the Chinese Academy of Sciences (XDB38040400), the National Natural Science Foundation of China (11825102, 12131020, 31930022 and 12026608), the Beijing Academy of Artificial Intelligence (BAAI), the Special Fund for Science and Technology Innovation Strategy of Guangdong Province (2021B0909050004 and 2021B0909060002) and the Major Key Project of Peng Cheng Laboratory (PCL2021A12).
AUTHOR CONTRIBUTIONS
J.S., T.L., L.C. and K.A. designed the research; J.S. and T.L. performed the research; J.S. analyzed the data; and J.S., T.L., L.C. and K.A. wrote the paper.
Conflict of interest statement. None declared.



