- Split View
-
Views
-
Cite
Cite
Annika Röhl, Tanguy Riou, Alexander Bockmayr, Computing irreversible minimal cut sets in genome-scale metabolic networks via flux cone projection, Bioinformatics, Volume 35, Issue 15, August 2019, Pages 2618–2625, https://doi.org/10.1093/bioinformatics/bty1027
- Share Icon Share
Abstract
Minimal cut sets (MCSs) for metabolic networks are sets of reactions which, if they are removed from the network, prevent a target reaction from carrying flux. To compute MCSs different methods exist, which may fail to find sufficiently many MCSs for larger genome-scale networks.
Here we introduce irreversible minimal cut sets (iMCSs). These are MCSs that consist of irreversible reactions only. The advantage of iMCSs is that they can be computed by projecting the flux cone of the metabolic network on the set of irreversible reactions, which usually leads to a smaller cone. Using oriented matroid theory, we show how the projected cone can be computed efficiently and how this can be applied to find iMCSs even in large genome-scale networks.
Software is freely available at https://sourceforge.net/projects/irreversibleminimalcutsets/.
Supplementary data are available at Bioinformatics online.
1 Introduction
Constraint-based modelling of metabolic networks has become a widely used approach in computational systems biology (Bordbar et al., 2014; Lewis et al., 2012; O’Brien et al., 2015; Terzer et al., 2009). Typically a metabolic network is given by a set of (internal) metabolites, a set of reactions, a stoichiometric matrix and a set of irreversible reactions. An example is given in Figure 1. The set of reversible reactions is . We always consider metabolic networks at steady-state and call a vector a (feasible) flux vector if Sv = 0 and . The set of all flux vectors defines the flux cone.
This work is about subsets of reactions that are called cut sets. If the reactions in ζ are removed from , other reactions called target reactions are not able to carry non-zero flux anymore. A cut set ζ is called (inclusion-)minimal if no proper subset of ζ is a cut set for the pre-defined target reactions. Minimal cut sets (MCSs) were first introduced in Klamt and Gilles (2004) and have many applications (Apaolaza et al., 2017; Clark and Verwoerd, 2012; Gerstl et al., 2016; Gruchattka et al., 2013; Harder et al., 2016; Imielinski and Belta, 2008; Klamt, 2006; Machado and Herrgård, 2015; von Kamp and Klamt, 2017; Wilhelm et al., 2004). Various methods exist to compute MCSs (Ballerstein et al., 2012; Goldstein and Bockmayr, 2015; Haus et al., 2008; Jungreuthmayer et al., 2013; Nair et al., 2017; Tobalina et al., 2016; von Kamp and Klamt, 2014).
One of the very first methods to compute MCSs (Klamt, 2006; Klamt and Gilles, 2004) is to use the set of all elementary flux modes (EFMs). EFMs (Schuster and Hilgetag, 1994) are flux vectors with (inclusion-)minimal support, where the support is the set of active reactions in v. One can show that, up to scaling, an EFM is uniquely determined by its support. Therefore, two EFMs are considered to be the same if . The set of all EFMs defines a generating set for the flux cone . This means that every flux vector can be represented as a conical combination with . Thus, every (steady-state) behaviour of the network can be represented with the help of EFMs. Over the last 20 years, various methods have been developed to compute the set of all EFMs in a given metabolic network (Fukuda and Prodon, 1996; Gagneur and Klamt, 2004; Pfeiffer et al., 1999; Terzer, 2009; Urbanczik and Wagner, 2005b).
The procedure to compute MCSs using EFMs (Klamt, 2006) works as follows: Suppose all EFMs are known and a target reaction is given. Let be the set of those EFMs that involve the target reaction. Given , we search for hitting sets, i.e. sets of reactions such that each EFM in contains at least one reaction of the hitting set. Inclusion-minimal hitting sets are the MCSs we are looking for.
(Using EFMs to compute MCSs) For the network in Figure 1 there exist 18 EFMs, a complete list can be found in Supplementary Material, Section 1. We choose reaction 7 as the target reaction. There are four EFMs using reaction 7 with the supports , {4, 6, 7}. In all these EFMs, reaction 6 is active. Therefore, if reaction 6 is removed, there will be no feasible flux vector using reaction 7. Consequently, the set {6} is an MCS for the target reaction 7. MCSs of cardinality 2 are {1, 4} and {2, 4}. The set {1, 6} is not an MCS since it contains the MCS {6}. The remaining MCSs are .
A second way to compute MCSs is based on the Farkas’ lemma and linear programming duality (Ballerstein et al., 2012; Larhlimi and Bockmayr, 2007; von Kamp and Klamt, 2014). As shown in Ballerstein et al. (2012), the MCSs in a network for a given target reaction correspond to EFMs in a dual network. Intuitively speaking, the metabolites in correspond to the reactions in and vice versa. The stoichiometric matrix of is the transposed stoichiometric matrix of the primal network , where some columns have to be added. Exploiting this duality, one can use mixed integer linear programming (MILP) to enumerate MCSs of increasing size (Larhlimi and Bockmayr, 2007; von Kamp and Klamt, 2014). Further extending these methods, Tobalina et al. (2016) compute MCSs that contain a pre-defined knock-out reaction by enumerating EFMs in the dual network in which is active.
Computing MCSs via EFMs, either in the primal or in the dual network, gets the more and more difficult, the larger these networks become. To address this problem, we propose here to use projections in order to reduce the size of the underlying polyhedral cones. The projection of the flux cone onto the set of irreversible reactions leads to a smaller cone , which is again a flux cone, but where only irreversible reactions are active. To compute MCSs, we can apply the methods explained before to instead of in order to compute MCSs. The resulting cuts are MCSs of that consist of irreversible reactions only. Therefore, we call them irreversible minimal cut sets (iMCSs).
(iMCSs) Given a metabolic network a set of reactions is called a cut set for a target reaction , if vt = 0 for all flux vectors with vi = 0, for all . For , an iMCS is an (inclusion-)MCSs consisting of irreversible reactions only.
We will show that the EFMs of the projection are the minimal metabolic behaviours (MMBs) of the network . A metabolic behaviour (Larhlimi and Bockmayr, 2009) is a non-empty set of irreversible reactions that are active together in some flux vector , i.e. . A metabolic behaviour is minimal if there is no metabolic behaviour strictly contained in D. This definition is similar to the one of EFMs with the difference that in the case of MMBs only irreversible reactions are considered. The number of MMBs may be by several orders of magnitude smaller than the number of EFMs, see Larhlimi and Bockmayr (2009) and Table 1. We can use MMBs in the same way as EFMs in order to compute iMCSs instead of MCSs. Suppose the whole set of MMBs is known and an irreversible target is chosen. The set consists of all MMBs which contain the target reaction. Within minimal hitting sets are searched, which are MCSs. Since they consist of irreversible reactions only, they are iMCSs, see Example 2 for illustration.
Network . | rxns . | irr . | MMBs . | EFMs . | Time MMBs (s) . | Target MMBs . | iMCSs . | Time iMCSs . |
---|---|---|---|---|---|---|---|---|
Escherichia coli MG1655 (Orth et al., 2010) | 87 | 41 | 2572 | 100 274 | 5.7 | 1151 | 257 | 113 s |
Rhizobium etli iOR363 (Resendis-Antonio et al., 2007) | 194 | 104 | 6147 | N/A | 26 | 651 | 60 | 60 s |
Buchnera iSM197 (MacDonald et al., 2011) | 244 | 170 | 245 | N/A | 7 | 146 | 200 | 130 s |
Blattabacterium cuenoti iCG238 (González-Domenech et al., 2012) | 308 | 197 | 374 | N/A | 7.7 | 128 | 184 | 138 s |
Blattabacterium iCG230 (González-Domenech et al., 2012) | 400 | 192 | 82 | N/A | 6.7 | 32 | 159 | 115 s |
Mycobacterium tuberculosis iNJ661 (Jamshidi and Palsson, 2007) | 427 | 296 | 501 | N/A | 13 | 350 | 381 | 449 s |
Salmonella Typhimurium STM_v1_0 (Thiele et al., 2011) | 458 | 316 | 97 | N/A | 11.8 | 96 | 321 | 386 s |
Helicobacter pylori iCS291 (Schilling et al., 2002) | 444 | 271 | 150, 132 | N/A | 325 | 148 608 | 187 | 5 h 44 min 1 s |
Network . | rxns . | irr . | MMBs . | EFMs . | Time MMBs (s) . | Target MMBs . | iMCSs . | Time iMCSs . |
---|---|---|---|---|---|---|---|---|
Escherichia coli MG1655 (Orth et al., 2010) | 87 | 41 | 2572 | 100 274 | 5.7 | 1151 | 257 | 113 s |
Rhizobium etli iOR363 (Resendis-Antonio et al., 2007) | 194 | 104 | 6147 | N/A | 26 | 651 | 60 | 60 s |
Buchnera iSM197 (MacDonald et al., 2011) | 244 | 170 | 245 | N/A | 7 | 146 | 200 | 130 s |
Blattabacterium cuenoti iCG238 (González-Domenech et al., 2012) | 308 | 197 | 374 | N/A | 7.7 | 128 | 184 | 138 s |
Blattabacterium iCG230 (González-Domenech et al., 2012) | 400 | 192 | 82 | N/A | 6.7 | 32 | 159 | 115 s |
Mycobacterium tuberculosis iNJ661 (Jamshidi and Palsson, 2007) | 427 | 296 | 501 | N/A | 13 | 350 | 381 | 449 s |
Salmonella Typhimurium STM_v1_0 (Thiele et al., 2011) | 458 | 316 | 97 | N/A | 11.8 | 96 | 321 | 386 s |
Helicobacter pylori iCS291 (Schilling et al., 2002) | 444 | 271 | 150, 132 | N/A | 325 | 148 608 | 187 | 5 h 44 min 1 s |
Note: Network: name of the metabolic network. rxns: number of unblocked reactions of the network. irr: number of unblocked irreversible reactions of the network. MMBs: number of MMBs. EFMs: number of EFMs. N/A: the programmes used here (polco and efmtool, http://www.csb.ethz.ch/tools/software) could not handle the size of the models resp. the number of EFMs. Time MMBs: Time to compute MMBs using projection. Target MMBs: number of MMBs involving the target reaction (biomass reaction). iMCS: number of iMCSs. Time iMCSs: time to compute iMCSs. The description of E.coli MG1655 (Orth et al., 2010) was taken from the BiGG Models Database (King et al., 2016), while the remaining ones are all taken from BioModels Database (Li et al., 2010).
Network . | rxns . | irr . | MMBs . | EFMs . | Time MMBs (s) . | Target MMBs . | iMCSs . | Time iMCSs . |
---|---|---|---|---|---|---|---|---|
Escherichia coli MG1655 (Orth et al., 2010) | 87 | 41 | 2572 | 100 274 | 5.7 | 1151 | 257 | 113 s |
Rhizobium etli iOR363 (Resendis-Antonio et al., 2007) | 194 | 104 | 6147 | N/A | 26 | 651 | 60 | 60 s |
Buchnera iSM197 (MacDonald et al., 2011) | 244 | 170 | 245 | N/A | 7 | 146 | 200 | 130 s |
Blattabacterium cuenoti iCG238 (González-Domenech et al., 2012) | 308 | 197 | 374 | N/A | 7.7 | 128 | 184 | 138 s |
Blattabacterium iCG230 (González-Domenech et al., 2012) | 400 | 192 | 82 | N/A | 6.7 | 32 | 159 | 115 s |
Mycobacterium tuberculosis iNJ661 (Jamshidi and Palsson, 2007) | 427 | 296 | 501 | N/A | 13 | 350 | 381 | 449 s |
Salmonella Typhimurium STM_v1_0 (Thiele et al., 2011) | 458 | 316 | 97 | N/A | 11.8 | 96 | 321 | 386 s |
Helicobacter pylori iCS291 (Schilling et al., 2002) | 444 | 271 | 150, 132 | N/A | 325 | 148 608 | 187 | 5 h 44 min 1 s |
Network . | rxns . | irr . | MMBs . | EFMs . | Time MMBs (s) . | Target MMBs . | iMCSs . | Time iMCSs . |
---|---|---|---|---|---|---|---|---|
Escherichia coli MG1655 (Orth et al., 2010) | 87 | 41 | 2572 | 100 274 | 5.7 | 1151 | 257 | 113 s |
Rhizobium etli iOR363 (Resendis-Antonio et al., 2007) | 194 | 104 | 6147 | N/A | 26 | 651 | 60 | 60 s |
Buchnera iSM197 (MacDonald et al., 2011) | 244 | 170 | 245 | N/A | 7 | 146 | 200 | 130 s |
Blattabacterium cuenoti iCG238 (González-Domenech et al., 2012) | 308 | 197 | 374 | N/A | 7.7 | 128 | 184 | 138 s |
Blattabacterium iCG230 (González-Domenech et al., 2012) | 400 | 192 | 82 | N/A | 6.7 | 32 | 159 | 115 s |
Mycobacterium tuberculosis iNJ661 (Jamshidi and Palsson, 2007) | 427 | 296 | 501 | N/A | 13 | 350 | 381 | 449 s |
Salmonella Typhimurium STM_v1_0 (Thiele et al., 2011) | 458 | 316 | 97 | N/A | 11.8 | 96 | 321 | 386 s |
Helicobacter pylori iCS291 (Schilling et al., 2002) | 444 | 271 | 150, 132 | N/A | 325 | 148 608 | 187 | 5 h 44 min 1 s |
Note: Network: name of the metabolic network. rxns: number of unblocked reactions of the network. irr: number of unblocked irreversible reactions of the network. MMBs: number of MMBs. EFMs: number of EFMs. N/A: the programmes used here (polco and efmtool, http://www.csb.ethz.ch/tools/software) could not handle the size of the models resp. the number of EFMs. Time MMBs: Time to compute MMBs using projection. Target MMBs: number of MMBs involving the target reaction (biomass reaction). iMCS: number of iMCSs. Time iMCSs: time to compute iMCSs. The description of E.coli MG1655 (Orth et al., 2010) was taken from the BiGG Models Database (King et al., 2016), while the remaining ones are all taken from BioModels Database (Li et al., 2010).
(Using MMBs to compute iMCSs) The network in Figure 1 has three different MMBs: and {6, 8}. We apply the method from Example 1 to compute MCSs, but instead of EFMs we now use MMBs. The target reaction is again reaction 7. The only MMB containing reaction 7 is {6, 7}. Therefore, the iMCSs for reaction 7 are {6} and {7}.
Using MMBs instead of EFMs will allow us to compute (irreversible) MCSs for larger networks, see Section 4. However, there are still limits. In practice, it is often not possible to compute the MMBs for genome-scale metabolic networks. Nevertheless, we can still perform the projection and then apply MILP techniques on the dual projected cone , using the method of von Kamp and Klamt (2014) or Tobalina et al. (2016). The classical method to perform projections is Fourier–Motzkin elimination, see e.g. (Ziegler, 1995). In general, Fourier–Motzkin elimination leads to a combinatorial explosion in the number of linear inequalities needed to describe the projected cone. However, in the special case of flux cones, we can use the theory of oriented matroids to avoid this if we project along reversible reactions on the set of irreversible reactions. This enables us to perform the projection efficiently also on genome-scale metabolic networks.
To summarize, the contributions of this paper are the following:
We introduce the new concept of iMCSs.
We present an efficient method to project a metabolic network onto its irreversible reactions, based on contraction of oriented matroids.
We present a new method for computing MMBs and two approaches for computing iMCSs.
We illustrate our theoretical results by various computational experiments.
2 Mathematical background
We start by providing some mathematical background needed to develop our method. For a vector and a set of indices we denote by xI the subvector consisting of the components xi with . We use superscripts to refer to several vectors such as . For a matrix we denote by resp. the submatrix consisting of the rows resp. columns whose indices belong to I. The rank or maximum number of linearly independent columns in A is denoted by . The identity matrix is denoted by .
Next we give some basic definitions and theorems related to polyhedral cones, see Schrijver (1998) for more details. A subset is called a cone if any conic combination of two elements belongs to Γ again, i.e. , for any non-negative . A cone Γ is called polyhedral if there exists a matrix such that . A cone Γ is finitely generated if there exists a finite set of generators such that every element can be written in the form , for some non-negative . By a classical theorem of Farkas–Minkowski–Weyl, a cone is polyhedral if and only if it is finitely generated. For a polyhedral cone Γ, the linear subspace is called the lineality space of Γ. The cone Γ is called pointed if . This means that whenever , we have . In other words, Γ does not contain any line , for . By basic linear algebra, a cone Γ is pointed if and only if the matrix A has full column rank, i.e. . A ray is a non-zero element of Γ and two rays are considered identical if for some . A ray x is called an extreme ray of Γ, if there exist no two linearly independent rays such that . If Γ is pointed, the extreme rays form a unique minimum set of generators , see Figure 2 for illustration.
In the following, we consider a special class of polyhedral cones, which we call flux cones:
Either B or I may be empty, but not both.
Clearly, the flux cone of a metabolic network is a flux cone in the sense of Definition 2, with B = S and .
Let Γ be a flux cone. If for all , then Γ is pointed and the extreme rays of Γ are exactly the rays in Γ of minimal support.
The proof of Proposition 1 and all the following proofs can be found in Supplementary Material, Section 5.
In general, metabolic networks may contain reversible reactions together with reversible flux vectors for which . For example in Figure 1, there is a reversible flux vector with support . In such networks, the flux cone is not pointed, and Proposition 1 does not apply. If there exist reversible reactions but the flux cone is still pointed, then all extreme rays have minimal support, but the converse is no longer true.
3 Materials and methods
As explained in the Section 1, our goal is to compute (irreversible) MCSs for genome-scale networks by computing MCSs of the projected cone . Since is usually smaller than , this will allow us to handle larger metabolic networks and to compute (irreversible) MCSs more efficiently.
3.1 Projection of flux cones
Projections of the flux cone of a metabolic network have been considered before in the literature (Covert and Palsson, 2003; Marashi et al., 2012; Urbanczik, 2006; Urbanczik and Wagner, 2005a; Wagner and Urbanczik, 2005; Wiback and Palsson, 2002). For and the projection of x on H is defined as . Similarly, for a polyhedral cone , let . It is well known that is again a polyhedral cone (Ziegler, 1995).
Here we show that the projection of the flux cone of a metabolic network will be again a flux cone in the sense of Definition 2, if we project along one or more reversible reactions on a superset of the set of irreversible reactions.
Let be a flux cone with . For any the projection is again a flux cone.
In the following, we consider the special case of flux cones and . This means that we project along all the reversible reactions on the set of all irreversible reactions. By Theorem 1, the projection is again a flux cone in the sense of Definition 2. Every vector can be lifted to a vector of the original network by adding suitable components for the reversible reactions. The flux cone is pointed since only irreversible reactions can be active. However, does not necessarily describe a metabolic network. An example is given in Figure 2, which shows that projecting the flux cone along a reversible reaction is different from deleting this reaction from the network.
3.2 Computing MMBs via projection
We next show that the MMBs of a metabolic network can be obtained from the extreme rays of the pointed cone .
Let be the flux cone of a metabolic network . The supports of the extreme rays of the pointed cone are exactly the MMBs of the network .
Up to now there are three ways for enumerating all MMBs in a metabolic network. The first one is to compute all EFMs and to search for the MMBs contained in them. Since this method involves enumerating the whole set of EFMs, it is usually not applicable to genome-scale networks. A closely related approach is to compute a minimum set of generators for the flux cone, e.g. by using the double description method (Fukuda and Prodon, 1996). The supports of the generators that do not belong to the lineality space then correspond to the MMBs (Larhlimi and Bockmayr, 2009). A third method is using MILP (Rezola et al., 2011) to compute so-called generating flux modes, from which again the MMBs can be obtained.
Theorem 2 leads to a new approach for computing the set of MMBs of a metabolic network . First compute an inequality description of the projection , then compute the extreme rays of and their supports.
3.3 Projection and contraction
In this section, we apply the theory of oriented matroids to give an efficient procedure for computing an inequality description of the cone . The classical method for projection of polyhedral cones is Fourier–Motzkin elimination (Ziegler, 1995). In general, this can lead to a combinatorial explosion in the number of linear inequalities needed to describe the projected cone. In our case we are concerned with the special case of flux cones of the form with , and we always project along components . As we will see, this implies that all changes in the row structure of A during projection will occur in the equational part Bx = 0, while the matrix representing the linear inequalities is not changed (cf. the proof of Theorem 1 in Supplementary Material, Section 5). Since the homogeneous linear equation system Bx = 0 describes an oriented matroid (Björner et al., 1999), the projection of flux cones can be performed efficiently via contraction of oriented matroids.
Let U be a set. A signed subset of U is a pair with a set of positive elements and a set of negative elements such that . The support of X is the set . An oriented matroid is a pair . Here U is a set and is a family of signed subsets of U, called the oriented circuits of , which satisfy the circuit axioms that can be found in Supplementary Material, Section 2, or Björner et al. (1999). The equivalent of projection for oriented matroids is called contraction. Let be an oriented matroid. For and we denote by the signed set . The family of non-empty intersections of circuits of with H that have inclusion-minimal support is the set of circuits of an oriented matroid on H (Björner et al., 1999). This matroid is called the contraction ofto H and will be denoted by .
In our case, we consider a special class of matroids which are called representable. Let be a matrix and let U be the set of columns of B. The oriented matroidrepresented by B is given by , where . Here denotes the signed set obtained by applying the sign function component-wise to the vector x, and is the set of non-empty signed sets in with inclusion-minimal support. The matrix representing the contraction of the oriented matroid can be computed by basic linear algebra, see Supplementary Material, Section 3. There is no combinatorial explosion like in Fourier–Motzkin elimination.
for the details, see Supplementary Material, Section 3, Example 1.
By Theorem 3, the matrix T also describes the flux cone , in which all active reactions are irreversible. By Proposition 1, the extreme rays of are exactly the EFMs of the network represented by , which is shown in Figure 3. The supports of these extreme rays are , which are the MMBs of the original network from Figure 1. The oriented circuits of include these MMBs (as signed sets) and one additional oriented circuit .
4 Results and discussion
To illustrate our method, we describe a number of computational experiments for computing (irreversible) MCSs based on the projection of the flux cone. For lack of space, we highlight here only some of the results using overview figures. Full tables with all the data can be found in Supplementary Material, Section 6. All computations were done on a desktop computer with eight processors Intel(R) Core(TM) i7-2600, CPU 3.40 GHZ, each with two threads. When searching for MCSs in the original flux cone, we only allowed irreversible reactions to be included in the computed MCSs. Thus, we computed iMCSs as well, but without performing the projection.
4.1 Computing iMCSs using MMBs
We first consider the hitting set approach. Instead of EFMs we use MMBs, which correspond to the extreme rays (or EFMs) of (cf. Theorem 2). It is known that the number of MMBs is typically much smaller than the number of EFMs (Larhlimi and Bockmayr, 2009). For computing the MMBs, we implemented the approach from Section 3.2 in MATLAB. We used the software SAGE (http://www.sagemath.org) (The Sage Developers, 2016) for computing the projection via contraction and the software polco (Terzer, 2009) for enumerating the extreme rays. Given the set of MMBs, computing iMCSs becomes a hitting set problem, see Supplementary Material Section 4 or Klamt (2006).
To evaluate our method, we considered a selection of medium-sized metabolic networks from the BioModels Database (Li et al., 2010). The number of unblocked reactions, i.e. reactions whose steady-state flux is not always zero, ranges from 87 up to 444. While EFMs could be computed for only one network, the set of MMBs could be obtained for all these networks in a relatively short amount of time, see Table 1.
All these networks contain a biomass reaction, which we used as target reaction for computing the iMCSs. The hitting set approach allowed us to enumerate all iMCSs for these networks, see Supplementary Table S3. For several networks, the maximal cardinality of the iMCSs did not exceed 5 and most iMCSs have even cardinality 1. In Blattabacterium iCG230, we only have iMCSs of cardinality 1 (152 iMCSs) and cardinality 2 (7 iMCSs). Salmonella Typhimurium STM_v1_0 is another example with 296 iMCSs of cardinality 1, 19 of cardinality 2 and 6 of cardinality 3.
4.2 Computing iMCSs using the dual approach
The network sizes for which we are able to compute the MMBs are still limited. The bottleneck is computing the extreme rays of by polco (Terzer, 2009), while doing the projection is feasible even for very large networks. We performed the projection on the set of irreversible reactions for all 84 networks of the BiGG Models Database (King et al., 2016), using SAGE (The Sage Developers, 2016) for the contraction. This took between 32 s (for a network of 87 unblocked reactions) and 35 min (for a network of 4047 unblocked reactions), see Figure 1 and Supplementary Tables S1 and S2.
Next we used the resulting matrices describing for computing k-shortest iMCSs by the method of von Kamp and Klamt (2014), which is part of CellNetAnalyzer version 2018.1 (von Kamp et al., 2017). As MILP solver, we used CPLEX 12.8 (http://www.cplex.com). Compared to using the full flux cone, we were able to compute more iMCSs and in shorter time, see Figure 4 and Supplementary Tables S4–S7. Note that for various larger genome-scale metabolic networks, it was not possible to compute any iMCSs using the original cone, while in all these cases we were able to compute 40 or more iMCSs using the projected cone (see Supplementary Tables S6 and S7). In general, we can observe that the benefits from using the projection increase with the size of the network.
Finally we considered the computation of iMCSs containing a specific reaction knock-out. We applied the software of Tobalina et al. (2016) together with CPLEX 12.4 to the original stoichiometric matrices as well as to the reduced matrices after projection, and compared the results. Like in Tobalina et al. (2016), we used the biomass as target reaction. For each irreversible reaction, we set a time limit of 1 min to compute a single iMCS that contains this reaction. If no iMCS could be found within 1 min, the computation was stopped and we moved on to the next irreversible reaction. We report on the 39 networks in the BiGG database for which the total computation time for this experiment did not exceed 3 days. Except for one network, we were able to compute more iMCSs using the projection, see Figure 5 and Supplementary Table S8.
5 Conclusion
We introduced the new concept of iMCSs and presented different ways for computing them. On the algorithmic side, the key idea is the projection of flux cones, which is done by the contraction of an oriented matroid. Our first approach to compute iMCSs uses MMBs. To find the MMBs of a metabolic network , we project the flux cone on the set of irreversible reactions . The supports of the extreme rays of the pointed cone are the MMBs of .
A second way for computing iMCSs consists in applying to the projected cone the dual approach for computing (irreversible) MCSs (Tobalina et al., 2016; von Kamp and Klamt, 2014). Using the projection we are able to compute iMCSs in most cases faster and in networks where the standard method (using the original cone) fails, due to lack of memory. Since we can only compute irreversible MCSs with our approach, it may happen that certain MCSs of interest are missed, e.g. MCSs which are easy to implement in the wet lab. If no irreversible MCSs exist or if certain reversible reactions are of particular importance, the projection can be performed on those reversible reactions, too. The only restriction is that all irreversible reactions have to preserve in the projected cone. A possible direction for future research is computing constrained iMCSs (Hädicke and Klamt, 2011) or applying other constraint-based methods to the projected cone .
While large MCSs are hard to use for wet lab experiments, they allow studying the fragility of a metabolic network (Clark and Verwoerd, 2012; Gerstl et al., 2016; Klamt, 2006; Klamt and Gilles, 2004). Since the fragility of a network (resp. of a pre-defined reaction ) is given by the size distribution over the total set of MCSs (resp. the set of MCSs including i), all MCSs have to be known. For example, Klamt and Gilles (2004) found MCSs indicating that in the central metabolism of Escherichia coli growth on glucose is less fragile (more robust) than growth on acetate. While iMCSs do not cover the whole set of MCSs, they give the possibility to compute MCSs of larger cardinality, which may give additional biological insights.
Acknowledgement
We thank Alexandra M. Reimers for fruitful discussions and comments on this work.
Conflict of Interest: none declared.
References