MARSI: metabolite analogues for rational strain improvement

Cardoso, João G R; Zeidan, Ahmad A; Jensen, Kristian; Sonnenschein, Nikolaus; Neves, Ana Rute; Herrgård, Markus J

doi:10.1093/bioinformatics/bty108

Abstract

Summary

Metabolite analogues (MAs) mimic the structure of native metabolites, can competitively inhibit their utilization in enzymatic reactions, and are commonly used as selection tools for isolating desirable mutants of industrial microorganisms. Genome-scale metabolic models representing all biochemical reactions in an organism can be used to predict effects of MAs on cellular phenotypes. Here, we present the metabolite analogues for rational strain improvement (MARSI) framework. MARSI provides a rational approach to strain improvement by searching for metabolites as targets instead of genes or reactions. The designs found by MARSI can be implemented by supplying MAs in the culture media, enabling metabolic rewiring without the use of recombinant DNA technologies that cannot always be used due to regulations. To facilitate experimental implementation, MARSI provides tools to identify candidate MAs to a target metabolite from a database of known drugs and analogues.

Availability and implementation

The code is freely available at https://github.com/biosustain/marsi under the Apache License V2. MARSI is implemented in Python.

Supplementary information

Supplementary data are available at Bioinformatics online.

1 Introduction

Genome-scale metabolic models (GEMs) describe the biochemical reactions in an organism and their relation to the proteome and genome (McCloskey et al., 2013). These models comprehensively represent natural metabolism and they are useful for predicting the effect of metabolite analogues (MAs) as therapeutics (Agren et al., 2014; Kim et al., 2014).

Non-rational strategies such as mutagenesis and selection or laboratory evolution can be used to develop industrial strains when the use of recombinant DNA technology is not allowed due to regulations (Derkx et al., 2014; Hansen et al., 2017). MAs, inhibiting the enzymatic conversion of the target metabolite, act as metabolite knockouts and can be used as the selective pressure in non-rational strategies to shape the metabolism of microorganisms (Sørensen et al., 2016).

Here, we present software that implements workflows to identify metabolite knockouts instead of gene or reaction knockouts (Figure 1A). We also provide a pipeline to identify structural analogues for those targets.

2 Materials and methods

The first workflow consists of systematically replacing reaction knockouts (identified by other strain design methods) by metabolite knockouts, until we can find metabolite targets that result in a similar flux distribution. The second workflow consists of searching for metabolite targets using heuristic optimization, without the need to specify reaction knockouts a priori. A metabolite knockout consists of blocking all reactions consuming a given metabolite, excluding transporters.

After identifying the metabolite targets, we search for MAs similar to them. We compiled a database of potential MAs from publicly available sources (see Supplementary Material). We use OpenBabel (O’Boyle et al., 2011) and RDKit (2017) (http://www.rdkit.org) to calculate the features used to compare candidate MAs to the target metabolite: number of atoms/bonds/rings, MACCs fingerprints, Tanimoto coefficient (TC) and structural similarity score (SS).

3 Results

We implemented a software package containing algorithms to generate strain design strategies using MAs. Our software could generate metabolite targets for a published knockout-based design (Harder et al., 2016). We also provide the tools to identify candidate MAs that could be used for implementation of the designs.

3.1 Identification of replacement targets

We used an experimentally validated strain design for itaconic acid production in Escherichia coli (Harder et al., 2016) and the E.coli GEM iJO1366 (Orth et al., 2014) to demonstrate the use of MARSI. MARSI identified acetyl-phosphate as a metabolite knockout target that can replace the Phosphotransacetylase (PTAr) reaction knockout and sustain the same flux for itaconic acid production (Table 1). Using a SS cut-off of 0.5 (see Supplementary Material), we found 182 MAs for acetyl-phosphate (Supplementary Table S1 shows the top 10 hits). More examples of replacement targets in other E.coli strain designs can be found in Supplementary Material.

Table 1.

Knockout replacements for the strain design

Non-replaced knockouts	Replaced reaction	Metabolite	Original fitness	New fitness
PTA2, ICL, ALDD2x, PYK, SUCOAS, GGGABADr	PTAr	Acetyl-P	0.001	0.001

Non-replaced knockouts	Replaced reaction	Metabolite	Original fitness	New fitness
PTA2, ICL, ALDD2x, PYK, SUCOAS, GGGABADr	PTAr	Acetyl-P	0.001	0.001

We use Biomass Product Coupled Yield (Patil et al., 2005) as fitness measure. Reaction Ids: Phosphate acetyltransferase (PTA), Isocitrate lyase (ICL), Aldehyde dehydrogenase (ALDD2x), Pyruvate kinase (PK), Succinyl-CoA synthetase (SUCOAS) and Gamma-glutamyl-gamma aminobutyric acid dehydrogenase (GGGABADr).

Table 1.

Knockout replacements for the strain design

Non-replaced knockouts	Replaced reaction	Metabolite	Original fitness	New fitness
PTA2, ICL, ALDD2x, PYK, SUCOAS, GGGABADr	PTAr	Acetyl-P	0.001	0.001

Non-replaced knockouts	Replaced reaction	Metabolite	Original fitness	New fitness
PTA2, ICL, ALDD2x, PYK, SUCOAS, GGGABADr	PTAr	Acetyl-P	0.001	0.001

We use Biomass Product Coupled Yield (Patil et al., 2005) as fitness measure. Reaction Ids: Phosphate acetyltransferase (PTA), Isocitrate lyase (ICL), Aldehyde dehydrogenase (ALDD2x), Pyruvate kinase (PK), Succinyl-CoA synthetase (SUCOAS) and Gamma-glutamyl-gamma aminobutyric acid dehydrogenase (GGGABADr).

3.2 Query calibration with known MAs

In order to validate the ability of MARSI to find known analogues for a target metabolite, we selected 42 known metabolite-MA pairs from the literature (Supplementary Table S3). We compared the structural features between the MAs and their target metabolites (Supplementary Fig. S1). We used a distance of 4 for the number of atoms, 3 for the number of bonds and 2 for the number of rings as our query cut-off. The TC cut-off changes dynamically with the size of the metabolites (see Supplementary Material). In Figure 1B, we show the SS and TC for different targets and their known analogues as well as the best hit analogue in the database. For most targets MARSI found candidate MAs that showed higher structural similarity to the target metabolite than the known analogue.

Fig. 1.

Open in new tab Download slide

Metabolite target identification workflow and examples of MA targets. (A) The workflow for identifying for metabolite knockouts and candidate MAs. (B) Comparison between the known MAs (columns 1 and 2) and the best MARSI hits (columns 3 and 4) used to calibrate the search parameters. We show the TC and the SS. We highlighted rows where the best MARSI hit and the known MA are the same

Acknowledgements

We would like to thank Miguel Campodonico for input on chemoinformatics tools.

Funding

This work has been supported by the Novo Nordisk Foundation. This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 686070.

Conflict of Interest: none declared.

References

Agren

R.

et al. (

2014

)

Identification of anticancer drugs for hepatocellular carcinoma through personalized genome-scale metabolic modeling

.

Mol. Syst. Biol

.,

10

,

721

–

721

.

Derkx

P.M.

et al. (

2014

)

The art of strain improvement of industrial lactic acid bacteria without the use of recombinant DNA technology

.

Microb. Cell. Fact.

,

13

(Suppl. 1), S5.

Google Scholar

OpenURL Placeholder Text

WorldCat

Hansen

A.S.L.

et al. (

2017

)

Systems biology solutions for biochemical production challenges

.

Curr. Opin. Biotechnol

.,

45

,

85

–

91

.

Harder

B.-J.

et al. (

2016

)

Model-based metabolic engineering enables high yield itaconic acid production by Escherichia coli

.

Metab. Eng

,

38

,

29

–

37

.

Kim

H.U.

et al. (

2014

)

Integrative genome-scale metabolic analysis of Vibrio vulnificus for drug targeting and discovery

.

Mol. Syst. Biol

.,

7

,

460

–

460

.

Google Scholar

Crossref

WorldCat

McCloskey

D.

et al. (

2013

)

Basic and applied uses of genome-scale metabolic network reconstructions of Escherichia coli

.

Mol. Syst. Biol

.,

9

,

661

.

O’Boyle

N.M.

et al. (

2011

)

Open Babel: an Open chemical toolbox

.

J. Cheminform

.,

3

,

1

–

14

.

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

Orth

J.D.

et al. (

2011

)

A comprehensive genome-scale reconstruction of Escherichia coli metabolism–2011

.

Mol. Syst. Biol

.,

7

,

535

.

Patil

K.R.

et al. (

2005

)

Evolutionary programming as a platform for in silico metabolic engineering

.

BMC Bioinformatics

,

6

,

308

.

RDKit.

(

2017

)

Cheminformatics and machine Learning Software

. http://www.rdkit.org (31 August 2017, date last accessed)

OpenURL Placeholder Text

WorldCat

Sørensen

K.I.

et al. (

2016

)

Enhancing the sweetness of yoghurt through metabolic remodeling of carbohydrate metabolism in Streptococcus thermophilus and Lactobacillus delbrueckii subsp. bulgaricus

.

Appl. Environ. Microbiol

.,

82

,

3683

–

3616

.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Associate Editor:

Download all slides

Month:	Total Views:
February 2018	543
March 2018	284
April 2018	90
May 2018	50
June 2018	113
July 2018	265
August 2018	260
September 2018	41
October 2018	41
November 2018	60
December 2018	63
January 2019	218
February 2019	166
March 2019	88
April 2019	64
May 2019	42
June 2019	54
July 2019	52
August 2019	40
September 2019	52
October 2019	60
November 2019	24
December 2019	32
January 2020	37
February 2020	29
March 2020	35
April 2020	24
May 2020	16
June 2020	64
July 2020	73
August 2020	15
September 2020	46
October 2020	47
November 2020	14
December 2020	29
January 2021	14
February 2021	9
March 2021	21
April 2021	31
May 2021	29
June 2021	30
July 2021	16
August 2021	5
September 2021	12
October 2021	21
November 2021	25
December 2021	14
January 2022	28
February 2022	19
March 2022	18
April 2022	22
May 2022	26
June 2022	13
July 2022	19
August 2022	22
September 2022	22
October 2022	35
November 2022	26
December 2022	22
January 2023	15
February 2023	14
March 2023	11
April 2023	10
May 2023	22
June 2023	22
July 2023	8
August 2023	19
September 2023	12
October 2023	13
November 2023	18
December 2023	24
January 2024	30
February 2024	47
March 2024	18
April 2024	16

Article Contents

MARSI: metabolite analogues for rational strain improvement

Abstract

1 Introduction

2 Materials and methods

3 Results

3.1 Identification of replacement targets

3.2 Query calibration with known MAs

Acknowledgements

Funding

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

Article Contents

MARSI: metabolite analogues for rational strain improvement

Abstract

1 Introduction

2 Materials and methods

3 Results

3.1 Identification of replacement targets

3.2 Query calibration with known MAs

Acknowledgements

Funding

References

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

This Feature Is Available To Subscribers Only