-
PDF
- Split View
-
Views
-
Cite
Cite
Domagoj Dorešić, Stephan Grein, Jan Hasenauer, Efficient parameter estimation for ODE models of cellular processes using semi-quantitative data, Bioinformatics, Volume 40, Issue Supplement_1, July 2024, Pages i558–i566, https://doi.org/10.1093/bioinformatics/btae210
- Share Icon Share
Abstract
Quantitative dynamical models facilitate the understanding of biological processes and the prediction of their dynamics. The parameters of these models are commonly estimated from experimental data. Yet, experimental data generated from different techniques do not provide direct information about the state of the system but a nonlinear (monotonic) transformation of it. For such semi-quantitative data, when this transformation is unknown, it is not apparent how the model simulations and the experimental data can be compared.
We propose a versatile spline-based approach for the integration of a broad spectrum of semi-quantitative data into parameter estimation. We derive analytical formulas for the gradients of the hierarchical objective function and show that this substantially increases the estimation efficiency. Subsequently, we demonstrate that the method allows for the reliable discovery of unknown measurement transformations. Furthermore, we show that this approach can significantly improve the parameter inference based on semi-quantitative data in comparison to available methods.
Modelers can easily apply our method by using our implementation in the open-source Python Parameter EStimation TOolbox (pyPESTO) available at https://github.com/ICB-DCM/pyPESTO.
1 Introduction
The use of mechanistic mathematical models has greatly contributed to the understanding of biological processes at the cellular (Kitano 2002, Schöberl et al. 2009), patient (Fey et al. 2015, Hass et al. 2017) and population level (Giordano et al. 2020, Zhao and Chen 2020). In particular, mechanistic ordinary differential equation (ODE) models are used for a broad spectrum of applications, ranging from cellular signaling, metabolism, and gene regulation over pharmacokinetics and -dynamics to the spread of diseases. However, ODE models often contain parameters that cannot be measured directly. Instead, the parameters have to be estimated from experimental data (Mitra and Hlavacek 2019). This is commonly achieved by numerical optimization of an objective function, which quantifies how well the model simulations fit the given experimental data, such as the likelihood function.
The experimental data used for parameter estimation are collected using a broad spectrum of experimental techniques. For example, early studies in the field of systems biology employed well-calibrated Western blot experiments and performed an in-depth assessment of the mapping of concentration to measured intensities (Kreutz et al. 2007). In this case, the data were ensured to fall within the linear regime of the experimental technique, and often even absolute quantification was performed. However, many, even state–of–the–art, measurement techniques do not ensure a linear relationship between the abundance of the biochemical quantities of interest and the measured output (Fig. 1). Well-known examples include fluorescence microscopy data such as Förster resonance energy transfer (FRET) data (Birtwistle et al. 2011), optical density (OD) measurement (Stevenson et al. 2016) and imaging data for certain stainings (Pargett et al. 2014). In addition, many experimental techniques suffer from lower limits of detection and/or saturation effects.

A nonlinear measurement mapping. (Biochemical system) True values of a biochemical quantity of interest . (Measurement process) A measurement process can introduce unknown nonlinear data mappings (dashed blue line). In that case, a mapping function transforms the biochemical quantities and yields nonlinearly mapped measured quantities . (Measurement output) The measurement quantities are corrupted by noise, resulting in a noise-corrupted dataset
Quantitative data are easy to use for the parameterization of ODE models and the same holds for data that are collected in the linear regime of measurement devices. This is showcased in a large number of published articles [see, e.g. (Hass et al. 2019) for a collection of models and datasets]. In fact, there are custom methods for experimental data for which a linear mapping with unknown scaling and offset parameters can be assumed (Loos et al. 2018, Schmiester et al. 2019). If the linearity assumption is not fulfilled, it is usually assumed that the mapping from biochemical quantities of interest to measured output is monotone. This monotonicity ensures that the ordering is preserved and allows the use of approaches for ordinal data, such as the optimal scaling method (Shepard 1962). For ODE models, this approach has recently been accelerated by using a reformulation of the optimization problem (Schmiester et al. 2020) and gradient information (Schmiester et al. 2021b). However, in this approach, all quantitative information is discarded and the defined objective function is not based on probabilistic grounds, disallowing any uncertainty analysis.
In this manuscript, we introduce a spline-based approach to use semi-quantitative data—which are obtained using an experimental technique with a nonlinear but monotone mapping—for parameter estimation. We assume that the measurement mappings are increasing monotonically. This is frequently observed in experiments: values with a larger measurement value are assumed to correspond to larger biochemical quantities of interest. The method reconstructs the unknown mapping function using a statistically coherent formulation that facilitates uncertainty analysis. We demonstrate the credibility of the proposed approach as a tool for uncovering measurement mapping shapes. To illustrate the parameter inference capabilities of the method, we benchmark its performance with a collection of published models. Furthermore, we derive formulas for the analytical calculation of the gradients of the objective function in hierarchical optimization. To evaluate this optimization framework, we compare its efficiency with alternative approaches.
2 Materials and methods
2.1 Mechanistic modeling of biological systems
2.1.1 Linear semi-quantitative (relative) observables
2.1.2 Nonlinear semi-quantitative observables

Illustration of the spline estimation approach. (A) Model mechanistic parameters are iteratively updated during parameter estimation. For each vector of trial parameters, the model is simulated to obtain the simulation y. Then, the spline parameters are optimized and used to calculate the objective function J and its gradient . These are then passed on to obtain the next trial parameter vector, or the optimization is halted. (B) The spline (green) enables mapping of the simulation of the model y (biochemical quantities) to the measurement axis. This allows for the definition of a likelihood objective function. In the inner problem, this objective function is minimized with respect to the spline parameters to obtain optimal spline parameters . The spline is additionally regularized by the distance to the linear mapping (orange)
The model dataset consists of observations of all model observables at time-points . We denote the dataset of the ith observable as .
2.2 Parameter estimation
2.2.1 Hierarchical optimization problem and analytical gradients
In the outer optimization problem (8), we estimate the mechanistic parameters , and in the inner optimization problems (9) we estimate the observable parameters of each observable. For nonlinear semi-quantitative observables, the inner problem is additionally constrained by the positivity of spline parameters. Since the objective function can be additively separated into components depending on the observable parameters of a single observable , the inner optimization problem is a set of small inner optimization problems (9). For relative observables, the inner problems (9) can be solved analytically (Schmiester et al. 2019). For nonlinear semi-quantitative observables, they still need to be numerically minimized, but the inner problems are convex and thus easy to minimize. Second, the gradient of the objective function can be calculated analytically. We formulate and prove the two statements above in Theorems 1 and 2 of the supplementary materials.
2.3 Confidence region of a parameter vector
2.4 Scalability and complexity of the proposed method
The inner optimization objective functions for semi-quantitative observables are convex and self-concordant. Thus, their numerical optimization using barrier methods scales mostly with the number of inequality constraints in the inner problem (Boyd and Vandenberghe 2004), i.e. with the number of spline parameters of the spline mapping (5). In most applications, this number should be set to a small value (5–10), as this already provides sufficient modeling capacity for most nonlinear measurement mappings and also reduces overfitting. Therefore, in larger ODE systems, the optimization of inner problems for semi-quantitative observables constitutes a small part of computational complexity and the proposed method scales linearly in the number of semi-quantitative observables.
2.5 Benchmark models
For the evaluation of the proposed method, we consider one exemplary model and four published models that were previously developed and calibrated for different biological systems (Table 1). As the published models originally did not contain nonlinear semi-quantitative observables, we generated synthetic data at the same time points, chose nonlinear monotone measurement mappings, applied them to the observables, and corrupted them with the same type of noise as in the original model. For details on the synthetic data generation, chosen nonlinear measurement mappings, and model structure, we refer to the second section of the supplementary material.
Model . | . | . | . | . | Reference . |
---|---|---|---|---|---|
T1 | 2 | 4 | 1 | 12 | Birtwistle et al. (2011) |
M1 | 8 | 6 | 3 | 48 | Boehm et al. (2014) |
M2 | 7 | 9 | 1 | 23 | Rahman et al. (2016) |
M3 | 8 | 18 | 1 | 58 | Elowitz and Leibler (2000) |
M4 | 14 | 18 | 8 | 205 | Raia et al. (2011) |
Model . | . | . | . | . | Reference . |
---|---|---|---|---|---|
T1 | 2 | 4 | 1 | 12 | Birtwistle et al. (2011) |
M1 | 8 | 6 | 3 | 48 | Boehm et al. (2014) |
M2 | 7 | 9 | 1 | 23 | Rahman et al. (2016) |
M3 | 8 | 18 | 1 | 58 | Elowitz and Leibler (2000) |
M4 | 14 | 18 | 8 | 205 | Raia et al. (2011) |
By we denote the cardinality of the dataset.
Model . | . | . | . | . | Reference . |
---|---|---|---|---|---|
T1 | 2 | 4 | 1 | 12 | Birtwistle et al. (2011) |
M1 | 8 | 6 | 3 | 48 | Boehm et al. (2014) |
M2 | 7 | 9 | 1 | 23 | Rahman et al. (2016) |
M3 | 8 | 18 | 1 | 58 | Elowitz and Leibler (2000) |
M4 | 14 | 18 | 8 | 205 | Raia et al. (2011) |
Model . | . | . | . | . | Reference . |
---|---|---|---|---|---|
T1 | 2 | 4 | 1 | 12 | Birtwistle et al. (2011) |
M1 | 8 | 6 | 3 | 48 | Boehm et al. (2014) |
M2 | 7 | 9 | 1 | 23 | Rahman et al. (2016) |
M3 | 8 | 18 | 1 | 58 | Elowitz and Leibler (2000) |
M4 | 14 | 18 | 8 | 205 | Raia et al. (2011) |
By we denote the cardinality of the dataset.
3 Results
3.1 The proposed method uncovers measurement mapping for FRET probe activation
To illustrate an application of the proposed method, we have applied it to a FRET probe activation model introduced by Birtwistle et al. (2011). In general, the transition of inactive FRET probes P to an active state can be represented by the scheme in Fig. 3A. The quantity of interest in this model is the ratio of activated probes to total probes . The most common way to measure this value is through a measurement technique called ratiometric imaging. Cells are exposed to excitation light from the donor channel, and then fluorescence emission is divided into donor and acceptor channels. The output of ratiometric imaging, R, is the intensity in the acceptor channel, , divided by the intensity in the donor channel, . Previous studies have shown that this measured R value can have a highly nonlinear relationship to the fraction of active FRET probes (Birtwistle et al. 2011) (Fig. 3A).

A model of FRET probe activation. (A) A forward enzyme catalyzes the activation of inactive FRET probes P, and a reverse enzyme catalyzes the conversion of an active probe into the inactive state. Active probe concentration can be observed via ratiometric imaging. The measurement mapping of this process is highly nonlinear. and are fractions of the acceptor and donor emissions that the acceptor channel captures, respectively. is the fraction of donor emissions that the donor channel captures. (B) Comparison of the estimation of the measurement mapping using a linear function, proposed spline mapping, and parameterization of the true mapping. For all three models, we performed 1000 local optimizations. Depicted are the estimated mappings closest to the true mapping of starts with mechanistic parameters in the 95% confidence region. We show the synthetic noise-corrupted data used in all model optimizations in blue squares
To evaluate how well the three modeling approaches can recover the true measurement mapping, we performed 1000 local optimizations for each and chose the best measurement mapping estimates in the 95% confidence region (Fig. 3B). We found that the reconstruction using spline estimation agrees well with the true measurement mapping. Indeed, it yields similar results to using parametric representations with unknown parameters. In contrast, a linear model for the measurement mapping proved to be insufficiently flexible and resulted in biased reconstruction of the measurement mapping.
Overall, our assessment revealed that, unlike a simple linear approximation for the measurement mapping, a spline-based approximation enables the reconstruction of nonlinear mappings as observed for FRET probe activation.
3.2 The spline estimation approach as a tool for uncovering measurement mapping shapes
In the previous subsection, we have shown that the estimation of an unknown measurement mapping using a spline can yield results similar to the estimation of a parametric representation. Yet, we only considered a point estimate and did not assess the reliability of the reconstruction. To determine whether the proposed approach provided statistically coherent estimates, we considered model M1 with measurement mappings of various shapes across observables (Fig. 4A–C). Using the resulting dataset, we performed a multi-start optimization ( runs) to obtain optimal parameters and Markov chain Monte Carlo sampling using an adaptive Metropolis-Hastings algorithm ( iterations). The resulting chain was thinned by a factor of 500. We computed the optimal spline for each of the remaining samples and, with them, constructed the credibility intervals of the optimal spline mappings.

Credibility intervals of the estimated spline mappings. MCMC sampling of the model M1. The model contains different synthetic nonlinear measurement mappings for each of its observables (dashed blue). Splines were estimated for each 500th sample of the MCMC run with which we constructed the credibility intervals of the estimated spline mappings (light green). For visibility, we show only each 20th estimated spline (dark green)
The inspection of the results confirmed that the optimal splines are qualitatively similar to the measurement mappings used for data generation. Furthermore, more importantly, the measurement mappings used for data generation lie within the credibility intervals. This showcases the reliability of the method as a tool for discovering curve shapes of unknown measurement mappings.
3.3 Hierarchical optimization and analytical gradients increase the estimation efficiency
As the method provides reliable estimates for the mappings, we turn to the assessment of the computational cost, which is of high practical relevance. Here, we also wanted to evaluate the impact of (i) reformulation as a hierarchical problem and (ii) the availability of analytical gradients. For this assessment, we considered the published models M1 to M4 with synthetic data with a range of different measurement mappings, as detailed in the second section of the supplementary material. For all models, we performed 1000 local optimizations with equal start points across approaches. We then determined the overall computational cost, the number of function evaluations, and the number of converged starts per computation time.
We found that in general, the proposed hierarchical approach with analytical gradients achieves the best performance (Fig. 5). This appears to be mostly related to a reduction in the computation time, respectively, the number of function evaluations (Fig. 5A and B), while the number of converged starts remains rather similar (Fig. 5C). The number of converged starts per computation time is, for the hierarchical approach with analytical gradients, at least twice as high for the other approaches (Fig. 5D). Interestingly, a hierarchical approach without gradient information does not perform well, and is worse than the nonhierarchical approach with gradient information for all models.

Evaluation of the gradient-based nonhierarchical, gradient-free hierarchical, and gradient-based hierarchical estimation approaches. Models M1–M4 are shown from left to right. (A) Comparisons of computation time. (B) Comparisons of the number of function evaluations. (C) Comparisons of the number of converged starts. Converged starts are defined as the starts with estimated mechanistic parameters within the 95% confidence region. (D) Comparison of the number of converged starts per CPU hour
For the proposed hierarchical approach with analytical gradients, the number of converged starts per CPU hour was on average . As the spread between models was large, this finding clearly suggests that the approach is computationally tractable.
3.4 Spline approach improves the parameter inference of models with unknown measurement mappings
Our proposed method provides reliable estimates of measurement mappings. Here, we examine whether this leads to good estimates of the mechanistic model parameters. Apart from spline estimation, a generally applicable approach to the integration of semi-quantitative data into parameter estimation is linear estimation of measurement mappings. Thus, we compare the parameter inference of these two approaches in a realistic setting. In addition, for reference, we include the approach of discarding data with unknown measurement mappings. We performed 1000 local optimizations for the application examples M1–M4. We evaluated the impact of an increasing number of unknown measurement mappings by turning quantitative observables into semi-quantitative observables. As a parameter inference metric, we use the mean L2 distance of the estimated to the true mechanistic parameters normalized by the number of mechanistic parameters. For details of the study setting, we refer to the fourth section of the supplementary materials.
The spline estimation outperforms other approaches (Fig. 6). In general, linear estimation has a stronger bias than variance. We observe this primarily for model M4, as the linear estimation has the smallest standard deviation between approaches (Fig. 6D). In some cases, this even causes the linear estimation to perform worse than the approach of discarding data with unknown mappings. In contrast, the higher flexibility of the spline estimation allows for the general attainment of better parameter estimates. This is the case even for model M4 with eight unknown measurement mappings, for which the spline estimation adds seven times more parameters than the linear estimation. For a small number of unknown measurement mappings, the spline estimation can perform almost equally well as the model with completely known measurement mappings. This showcases that the proposed method yields good estimates of the mechanistic model parameters, especially when the number of unknown measurement mappings is low.

Evaluation of parameter inference across the number of unknown measurement mappings for linear and spline estimation. The parameter inference is measured by the L2 distance of the estimated to the true mechanistic model parameters. On the x-axis of each plot, we mark the model variant with a certain number of unknown measurement mappings, ranging from 0 to the number of model observables. The distance for each model variant is normalized by the number of mechanistic parameters and averaged across combinations of known-unknown observables. The best-case scenario for each model M1–M4 is the case of completely known measurement mappings (blue). We compare the distance to the true parameters for the linear (orange) and spline (green) approach for each number of unknown measurement mappings. The approach of discarding the data of observables with unknown measurement mappings (purple) is depicted for reference. This approach is not feasible for the model variants with a maximum number of unknown measurement mappings, as that would involve the removal of all data, so we denote this with N/A
4 Discussion
Semi-quantitative measurements represent a large portion of the available data that can be used to estimate unknown mechanistic parameters of ODE models. Among others, examples include spatial protein expression assays important in developmental biology, such as chemical staining, fluorescent expression, and immunohistochemistry (Brooks et al. 2012). When these are well-controlled, they are expected to linearly transform the true concentration into an image intensity. However, this is not always true: in the case of nonadequate procedural care, hard-to-control outer factors, or insufficient knowledge of the entire experimental system, the transformation function may not be available and can take on a nonlinear form. Here, we address this challenge by introducing a spline-based method for the estimation of unknown nonlinear mappings. The approach can be applied to models with quantitative data for which it is unclear whether the data are truly linear (Fig. 7). Depending on the estimated optimal splines, the data can be deemed to be quantitative, relative, censored, or semi-quantitative, so that an appropriate method can be used. If the estimated optimal spline is nonlinear, one can choose to estimate a parameterized function of a similar shape, or continue using the optimal splines as the measurement mapping. In this way, the method allows for the integration of data previously usable only as qualitative. Furthermore, it can give a clearer understanding of the underlying experimental procedures.

Diagram of application of the spline method to data with possible nonlinear measurement mappings. Estimated optimal splines are depicted in green. Each arrow from the central square is a possible outcome of the nonlinear mapping estimation
However, one has to be conservative with the number of observables chosen for spline estimation, as it can lead to a large expansion of the parameter space dimension. We evaluated the reliability of this process using an example, showing consistent qualitative measurement mapping shapes. An obvious extension of this approach is the inclusion of symbolic function identification from the estimated optimal splines. This would constitute an automatic parameterization of the unknown measurement mappings.
To increase the method’s efficiency, we employed a hierarchical gradient-based optimization approach. We evaluated its performance and compared it with alternative approaches for four published models with differences in their complexity. This revealed a higher computational efficiency across all models, allowing for faster estimation of parameters for a given model. Further optimization acceleration could be achieved by including adjoint sensitivity analysis (ASA) (Kokotovic and Heller 1967, Fröhlich et al. 2017). Although our inner problem is not solved exactly, in Theorem 2 of the supplementary, we show that its gradient contribution is still zero. Thus, existing ASA software implementations from Schmiester et al. (2019) can be used, since the gradient computation is the same as in hierarchical optimization with an exactly solved inner problem. Complementary to this, the derivation of second-order derivatives could further improve the method’s convergence and, with it, its computational efficiency.
The proposed method employs piecewise linear splines to estimate general nonlinear mappings. This was the simplest first-pass option, but, as they are not smooth, they had unavoidable approximation errors. Therefore, it is valuable to explore alternative smooth and flexible parameterized functions. Furthermore, they should retain the convexity of the inner optimization problems and the possibility of analytical gradient calculation. Interesting candidates are the scaled cumulative distribution functions (CDFs) of the beta distribution. They are monotone by definition, parameterized by only three parameters, and with promising flexibility to be able to model most types of measurement nonlinear mappings.
The models for which the method was developed are based on ODE systems primarily because of their widespread prevalence. However, the method can be used more generally. It requires only the model simulations, sensitivities, and the definition of the objective function as a negative log-likelihood. Thus, any model that satisfies these constraints can be incorporated to integrate semi-quantitative data into its estimation of parameters.
In conclusion, we developed and implemented an easy-to-use, computationally efficient framework to uncover unknown nonlinear measurement mappings and to integrate semi-quantitative data into the parameter estimation of ODE models. The approach has a user-friendly implementation in the open-source Python Parameter Estimation TOolbox (pyPESTO). As it is agnostic to the structure of the underlying dynamical model, the method can be applied to models from different research fields, such as physics and engineering.
Author contributions
J.H. conceived the project. D.D. implemented the proposed approach and conducted all studies. S.G. and J.H. provided critical feedback on the implementation development. D.D. and J.H. wrote the manuscript. All authors discussed the results and commented on the manuscript.
Supplementary data
Supplementary data are available at Bioinformatics online.
Conflict of interest
None declared.
Funding
This work was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy [EXC 2047—390685813, EXC 2151—390873048] and under the project IDs 432325352—SFB 1454 and 443187771—AMICI, by the German Federal Ministry of Education and Research (BMBF) within the e:Med funding scheme [junior research alliance PeriNAA, 01ZX1916A] and under the CompLS program [EMUNE, 031L0293C], and by the University of Bonn (via the Schlegel Professorship of J.H.).
Data availability
The proposed method is implemented in the open-source Python Parameter Estimation TOolbox (pyPESTO) (schalte2023pypesto). Models M1-M4 were taken from the PEtab benchmark collection (Schmiester et al. 2021a) based on Hass et al. (2019). For ODE integration, we used the AMICI Python toolbox (Fröhlich et al. 2021). Gradient-based optimization was performed using the fides optimizer (Fröhlich and Sorger 2022) and gradient-free optimization was performed with the SciPy Powell algorithm (Jones et al. 2001). Both optimizers were used through the pyPESTO interface with the default optimizer settings. All the code and models used in this study are available from the Zenodo database at https://doi.org/10.5281/zenodo.10568951.