-
PDF
- Split View
-
Views
-
Cite
Cite
Cristiana Gomes de Oliveira Dal'Molin, Lake-Ee Quek, Robin William Palfreyman, Stevens Michael Brumbley, Lars Keld Nielsen, AraGEM, a Genome-Scale Reconstruction of the Primary Metabolic Network in Arabidopsis , Plant Physiology, Volume 152, Issue 2, February 2010, Pages 579–589, https://doi.org/10.1104/pp.109.148817
Close -
Share
Abstract
Genome-scale metabolic network models have been successfully used to describe metabolism in a variety of microbial organisms as well as specific mammalian cell types and organelles. This systems-based framework enables the exploration of global phenotypic effects of gene knockouts, gene insertion, and up-regulation of gene expression. We have developed a genome-scale metabolic network model (AraGEM) covering primary metabolism for a compartmentalized plant cell based on the Arabidopsis (Arabidopsis thaliana) genome. AraGEM is a comprehensive literature-based, genome-scale metabolic reconstruction that accounts for the functions of 1,419 unique open reading frames, 1,748 metabolites, 5,253 gene-enzyme reaction-association entries, and 1,567 unique reactions compartmentalized into the cytoplasm, mitochondrion, plastid, peroxisome, and vacuole. The curation process identified 75 essential reactions with respective enzyme associations not assigned to any particular gene in the Kyoto Encyclopedia of Genes and Genomes or AraCyc. With the addition of these reactions, AraGEM describes a functional primary metabolism of Arabidopsis. The reconstructed network was transformed into an in silico metabolic flux model of plant metabolism and validated through the simulation of plant metabolic functions inferred from the literature. Using efficient resource utilization as the optimality criterion, AraGEM predicted the classical photorespiratory cycle as well as known key differences between redox metabolism in photosynthetic and nonphotosynthetic plant cells. AraGEM is a viable framework for in silico functional analysis and can be used to derive new, nontrivial hypotheses for exploring plant metabolism.
The past two decades have seen impressive progress in the mapping of plant genes to metabolic function, culminating in the complete sequencing and partial annotation of Arabidopsis (Arabidopsis thaliana; Dennis and Surridge, 2000), rice (Oryza sativa; Goff, 2005), and sorghum (Sorghum bicolor; Paterson et al., 2009), with the maize (Zea mays) genome under way (Maize Genome Sequencing Project, 2009). Simultaneously, tools for accurately manipulating gene expression in plants have developed rapidly. However, attempts to use these tools to engineer plant metabolism have met with limited success due to the complexity of plant metabolism. Genetic manipulations rarely cause the predicted effects, and new rate-limiting steps prevent the accumulation of some desired compounds (Sweetlove et al., 2003; Gutierrez et al., 2005). In a bid to improve our understanding of plant metabolism and thereby the success rate of plant metabolic engineering, a systems-based framework to study plant metabolism is needed (DellaPenna, 2001; Gutierrez et al., 2005). Systems biology involves an iterative process of experimentation, data integration, modeling, and generation of hypotheses (Kitano, 2000, 2002a, 2002b). “Omics-based analysis,” in the form of transcriptomics (Yonekura-Sakakibara et al., 2008; Minic et al., 2009; van Dijk et al., 2009), proteomics (Thiellement, 1999; Dai et al., 2007; Cui et al., 2008), and metabolomics (Hall, 2006; Morgenthal et al., 2006), is well advanced in plant biology. In contrast, data mining beyond blind statistical analysis, modeling, and hypothesis generation have suffered from a lack of a mathematical modeling framework to handle the complexity. The synergistic use of model and omics data is necessary to understand the complex relationships and interactions among the various parts of the system. Although the ultimate goal is the development of dynamic models for the complete simulation of cellular systems (Tomita et al., 1999; Sugimoto et al., 2005), the success of such approaches has been severely hampered by the current lack of kinetic information on the dynamics and regulation of metabolic reactions.
The genome-scale modeling approach addresses the problem from the opposite direction, building on well-established network features. The foundation is a systemic (two-dimensional) annotation that lists all components (in all their states) of a system in one dimension and all the links connecting the components in the second dimension. Thus, the biochemical network contains all the metabolites in one dimension and all the reactions connecting the metabolites in the second dimension. This approach is readily extended to genes and proteins to describe transcription and translation as well as interactions and regulation. At present, however, genome-scale understanding is largely constrained to biochemical networks. Systemic annotations have been termed BIGG (for biochemically, genetically, and genomically structured) databases, and BIGG databases have been reconstructed for several organisms, including Escherichia coli (Feist et al., 2007) Saccharomyces cerevisiae (Duarte et al., 2004), mouse (Mus musculus; Sheikh et al., 2005; Quek and Nielsen, 2008), and human (Duarte et al., 2007).
A BIGG database defines the topology of the biological network. Network topology can be used in mining omics data (Cakir et al., 2006; Oliveira et al., 2008). Topological properties on their own, however, provide limited potential for predicting actual functional states of the network until they are combined with constraints reflecting the physicochemical nature of the system. Generally, systems are studied in metabolic steady state (i.e. Sv = 0), where S is the stoichiometric matrix with rows representing metabolites and columns representing individual reactions, and v is the flux for each reaction. The fluxes are subject to maximum capacity constraints (v min ≤ v ≤ v max), and some reactions are irreversible (i.e. the corresponding v min = 0). Furthermore, the model may be constrained by regulation (e.g. two reactions may be mutually exclusive) and by experimental data.
The BIGG database together with the constraints constitutes the genome scale model (GEM). This model can be used with a broad range of constraint-based analysis tools to predict, for example, optimal growth rate, yields on different substrates, and the effects of gene deletions (Varma and Palsson, 1994; Bonarius et al., 1997; Fong et al., 2003; Forster et al., 2003; Duarte et al., 2004; Price et al., 2004). In silico models have enabled hypothesis-driven biology, including the study of adaptive evolution in microbial systems (Fong et al., 2003, 2005) and the discovery of new metabolic roles of individual genes (Reed et al., 2006).
GEMs may greatly facilitate the study of the complex metabolism in plants, with its duplication of many pathways in multiple organelles and greatly varying metabolic objectives between different tissues. A partial annotation was recently published for the green alga Chlamydomonas reinhardtii (Boyle and Morgan, 2009). This annotation covers central and intermediary metabolism and consists of 484 reactions in cytosol, chloroplast, and mitochondria linked to protein identifier (ID). For plants, an Arabidopsis GEM was recently published (Poolman et al., 2009). Although the reconstructed model is capable of producing biomass components (amino acids, nucleotides, lipid, starch, and cellulose) in the proportion observed experimentally in a heterotrophic suspension culture, it does not consider the localization of reactions in organelles and cannot describe autotrophic metabolism. Moreover, no two-dimensional annotation was provided to link functions to genes.
In this study, a genome-scale model was developed for Arabidopsis. It represents, to our knowledge, the first attempt to characterize the metabolism of a compartmentalized photosynthetic plant cell at the genome scale and represents a structured compilation of cell components and interactions, which enables systematic investigation of metabolic properties and interrogation of available omics data sets for plants. The genome-scale model has been validated against a number of classical physiological scenarios: photosynthesis, photorespiration, and respiration.
RESULTS AND DISCUSSION
Large-Scale Metabolic Reconstruction
A gene-centric organization of metabolic information was adopted, in which each known metabolic gene is mapped to one or several reactions. The core of the Arabidopsis genome-scale model (AraGEM version 1.0) was reconstructed from the Arabidopsis gene and reaction database publicly available from the Kyoto Encyclopedia of Genes and Genomes (KEGG; release 49.0, January 1, 2009; Kanehisa et al., 2008). The reconstruction process was automated and used the same procedure previously applied to the GEM of mouse (Quek and Nielsen, 2008). The reconstruction retained all reaction attributes from KEGG, including unique reaction and compound IDs and reaction reversibilities.
KEGG does not capture the compartmentalization of metabolism in eukaryotes. The plant cell model was compartmentalized into cytosol, mitochondrion, plastid, peroxisome, and vacuole based on literature information and the available databases: The Arabidopsis Information Resource and AraPerox (a database of putative Arabidopsis proteins from plant peroxisomes; Reumann et al., 2004; Cui et al., 2008; Table I).
Online resources for the reconstruction of the metabolic network of Arabidopsis
Database . | Link . |
|---|---|
| Genome database | |
| The Arabidopsis Information Resource (TAIR version 9) | http://www.arabidopsis.org/index.jsp |
| Pathway databases | |
| Kyoto Encyclopedia of Genes and Genomes (KEGG) | http://www.genome.jp/kegg/pathway.html |
| AraCyc version 5.0 | http://www.arabidopsis.org:1555/ARA/ |
| Biocyc | http://biocyc.org/ |
| ExPASy Biochemical Pathways | http://www.expasy.ch/cgi-bin/search-biochem-index |
| Reactome (Arabidopsis) | http://www.reactome.org |
| Enzymes databases | |
| ExPASy Enzyme Database | www.expasy.org |
| Brenda | http://www.brenda-enzymes.org |
| Enzyme/protein localization and others databases | |
| AraPerox (Arabidopsis Protein from Plant Peroxisomes) | http://www.araperox.uni-goettingen.de/ |
| SUBA (Arabidopsis Subcellular Database) | http://www.plantenergy.uwa.edu.au/applications/suba2/index.php |
| PPDB (Plant Proteome Database) | http://ppdb.tc.cornell.edu/default.aspx |
| UniproKB/SwissProt | http://ca.expasy.org/sprot/relnotes/relstat.html |
Database . | Link . |
|---|---|
| Genome database | |
| The Arabidopsis Information Resource (TAIR version 9) | http://www.arabidopsis.org/index.jsp |
| Pathway databases | |
| Kyoto Encyclopedia of Genes and Genomes (KEGG) | http://www.genome.jp/kegg/pathway.html |
| AraCyc version 5.0 | http://www.arabidopsis.org:1555/ARA/ |
| Biocyc | http://biocyc.org/ |
| ExPASy Biochemical Pathways | http://www.expasy.ch/cgi-bin/search-biochem-index |
| Reactome (Arabidopsis) | http://www.reactome.org |
| Enzymes databases | |
| ExPASy Enzyme Database | www.expasy.org |
| Brenda | http://www.brenda-enzymes.org |
| Enzyme/protein localization and others databases | |
| AraPerox (Arabidopsis Protein from Plant Peroxisomes) | http://www.araperox.uni-goettingen.de/ |
| SUBA (Arabidopsis Subcellular Database) | http://www.plantenergy.uwa.edu.au/applications/suba2/index.php |
| PPDB (Plant Proteome Database) | http://ppdb.tc.cornell.edu/default.aspx |
| UniproKB/SwissProt | http://ca.expasy.org/sprot/relnotes/relstat.html |
Online resources for the reconstruction of the metabolic network of Arabidopsis
Database . | Link . |
|---|---|
| Genome database | |
| The Arabidopsis Information Resource (TAIR version 9) | http://www.arabidopsis.org/index.jsp |
| Pathway databases | |
| Kyoto Encyclopedia of Genes and Genomes (KEGG) | http://www.genome.jp/kegg/pathway.html |
| AraCyc version 5.0 | http://www.arabidopsis.org:1555/ARA/ |
| Biocyc | http://biocyc.org/ |
| ExPASy Biochemical Pathways | http://www.expasy.ch/cgi-bin/search-biochem-index |
| Reactome (Arabidopsis) | http://www.reactome.org |
| Enzymes databases | |
| ExPASy Enzyme Database | www.expasy.org |
| Brenda | http://www.brenda-enzymes.org |
| Enzyme/protein localization and others databases | |
| AraPerox (Arabidopsis Protein from Plant Peroxisomes) | http://www.araperox.uni-goettingen.de/ |
| SUBA (Arabidopsis Subcellular Database) | http://www.plantenergy.uwa.edu.au/applications/suba2/index.php |
| PPDB (Plant Proteome Database) | http://ppdb.tc.cornell.edu/default.aspx |
| UniproKB/SwissProt | http://ca.expasy.org/sprot/relnotes/relstat.html |
Database . | Link . |
|---|---|
| Genome database | |
| The Arabidopsis Information Resource (TAIR version 9) | http://www.arabidopsis.org/index.jsp |
| Pathway databases | |
| Kyoto Encyclopedia of Genes and Genomes (KEGG) | http://www.genome.jp/kegg/pathway.html |
| AraCyc version 5.0 | http://www.arabidopsis.org:1555/ARA/ |
| Biocyc | http://biocyc.org/ |
| ExPASy Biochemical Pathways | http://www.expasy.ch/cgi-bin/search-biochem-index |
| Reactome (Arabidopsis) | http://www.reactome.org |
| Enzymes databases | |
| ExPASy Enzyme Database | www.expasy.org |
| Brenda | http://www.brenda-enzymes.org |
| Enzyme/protein localization and others databases | |
| AraPerox (Arabidopsis Protein from Plant Peroxisomes) | http://www.araperox.uni-goettingen.de/ |
| SUBA (Arabidopsis Subcellular Database) | http://www.plantenergy.uwa.edu.au/applications/suba2/index.php |
| PPDB (Plant Proteome Database) | http://ppdb.tc.cornell.edu/default.aspx |
| UniproKB/SwissProt | http://ca.expasy.org/sprot/relnotes/relstat.html |
Characteristics of the Reconstructed Network
The reconstructed plant metabolic network contains 5,253 gene-reaction association entries (Table II).
Network characteristics of the reconstructed metabolic network of Arabidopsis
Metabolic Characteristics . | Total . |
|---|---|
| Gene reaction-association entries | 5,253 |
| ORFs (unique) | 1,419 |
| Metabolites | 1,748 |
| Unique reactions | 1,567 |
| Cytosolic reactions | 1,265 |
| Mitochondrial reactions | 60 |
| Plastidic reactions | 159 |
| Peroxisomal reactions | 98 |
| Modified reactions | 36 |
| Biomass drains and transporters | 148 |
| Biomass drains | 47 |
| Transporters (intercellular) | 18 |
| Transporters (interorganellar) | 83 |
| Gaps (unique reaction IDs) filled by manual curation | 75 |
| Singleton metabolites | 446 |
Metabolic Characteristics . | Total . |
|---|---|
| Gene reaction-association entries | 5,253 |
| ORFs (unique) | 1,419 |
| Metabolites | 1,748 |
| Unique reactions | 1,567 |
| Cytosolic reactions | 1,265 |
| Mitochondrial reactions | 60 |
| Plastidic reactions | 159 |
| Peroxisomal reactions | 98 |
| Modified reactions | 36 |
| Biomass drains and transporters | 148 |
| Biomass drains | 47 |
| Transporters (intercellular) | 18 |
| Transporters (interorganellar) | 83 |
| Gaps (unique reaction IDs) filled by manual curation | 75 |
| Singleton metabolites | 446 |
Network characteristics of the reconstructed metabolic network of Arabidopsis
Metabolic Characteristics . | Total . |
|---|---|
| Gene reaction-association entries | 5,253 |
| ORFs (unique) | 1,419 |
| Metabolites | 1,748 |
| Unique reactions | 1,567 |
| Cytosolic reactions | 1,265 |
| Mitochondrial reactions | 60 |
| Plastidic reactions | 159 |
| Peroxisomal reactions | 98 |
| Modified reactions | 36 |
| Biomass drains and transporters | 148 |
| Biomass drains | 47 |
| Transporters (intercellular) | 18 |
| Transporters (interorganellar) | 83 |
| Gaps (unique reaction IDs) filled by manual curation | 75 |
| Singleton metabolites | 446 |
Metabolic Characteristics . | Total . |
|---|---|
| Gene reaction-association entries | 5,253 |
| ORFs (unique) | 1,419 |
| Metabolites | 1,748 |
| Unique reactions | 1,567 |
| Cytosolic reactions | 1,265 |
| Mitochondrial reactions | 60 |
| Plastidic reactions | 159 |
| Peroxisomal reactions | 98 |
| Modified reactions | 36 |
| Biomass drains and transporters | 148 |
| Biomass drains | 47 |
| Transporters (intercellular) | 18 |
| Transporters (interorganellar) | 83 |
| Gaps (unique reaction IDs) filled by manual curation | 75 |
| Singleton metabolites | 446 |
Thirty-six unique KEGG reactions were modified to give a proper stoichiometric coefficient and/or consistent nomenclature and/or reversibility constraints. Modified reactions from KEGG have been marked by adding an “N” (new identity) to the KEGG reaction ID (e.g. R01175 became R01175N; Supplemental Table S1).
A total of 148 biomass drains and transporters were introduced in the model. Forty-seven biomass drain equations describe the accumulation of carbohydrates, amino acids, fatty acid, cellulose, and hemicellulose, representing the major biomass drain for a plant cell (Table III).
List of biomass components
Components . | Major Drains . |
|---|---|
| Carbohydrates and sugars | Starch, Suc, Fru, Glc, maltose |
| Cell wall | Lignin (4-coumaryl alcohol, coniferyl alcohol, sinapyl alcohol), cellulose, hemicellulose (Xyl) |
| Amino acids | Ala, Arg, Asp, Asn, Cys, Lys, Leu, Ile, Glu, Gln, His, Met, Phe, Pro, Ser, Tyr, Trp, Val |
| Nucleotides | ATP, GTP, CTP, UTP, dATP, dGTP, dCTP, dTTP |
| Fatty acids | C16:0 (palmitic acid) |
| Vitamins and cofactors | Biotin, CoA, riboflavin, folate, chlorophyll, nicotinamide, thiamine, ubiquinone |
Components . | Major Drains . |
|---|---|
| Carbohydrates and sugars | Starch, Suc, Fru, Glc, maltose |
| Cell wall | Lignin (4-coumaryl alcohol, coniferyl alcohol, sinapyl alcohol), cellulose, hemicellulose (Xyl) |
| Amino acids | Ala, Arg, Asp, Asn, Cys, Lys, Leu, Ile, Glu, Gln, His, Met, Phe, Pro, Ser, Tyr, Trp, Val |
| Nucleotides | ATP, GTP, CTP, UTP, dATP, dGTP, dCTP, dTTP |
| Fatty acids | C16:0 (palmitic acid) |
| Vitamins and cofactors | Biotin, CoA, riboflavin, folate, chlorophyll, nicotinamide, thiamine, ubiquinone |
List of biomass components
Components . | Major Drains . |
|---|---|
| Carbohydrates and sugars | Starch, Suc, Fru, Glc, maltose |
| Cell wall | Lignin (4-coumaryl alcohol, coniferyl alcohol, sinapyl alcohol), cellulose, hemicellulose (Xyl) |
| Amino acids | Ala, Arg, Asp, Asn, Cys, Lys, Leu, Ile, Glu, Gln, His, Met, Phe, Pro, Ser, Tyr, Trp, Val |
| Nucleotides | ATP, GTP, CTP, UTP, dATP, dGTP, dCTP, dTTP |
| Fatty acids | C16:0 (palmitic acid) |
| Vitamins and cofactors | Biotin, CoA, riboflavin, folate, chlorophyll, nicotinamide, thiamine, ubiquinone |
Components . | Major Drains . |
|---|---|
| Carbohydrates and sugars | Starch, Suc, Fru, Glc, maltose |
| Cell wall | Lignin (4-coumaryl alcohol, coniferyl alcohol, sinapyl alcohol), cellulose, hemicellulose (Xyl) |
| Amino acids | Ala, Arg, Asp, Asn, Cys, Lys, Leu, Ile, Glu, Gln, His, Met, Phe, Pro, Ser, Tyr, Trp, Val |
| Nucleotides | ATP, GTP, CTP, UTP, dATP, dGTP, dCTP, dTTP |
| Fatty acids | C16:0 (palmitic acid) |
| Vitamins and cofactors | Biotin, CoA, riboflavin, folate, chlorophyll, nicotinamide, thiamine, ubiquinone |
Flux balance analysis revealed 75 step reactions (unique reaction IDs) with essential metabolic functions that were not assigned to any particular gene, three of which are autocatalytic reactions (Table IV; for a complete list, see Supplemental Table S3).
Metabolic reactions not assigned to any particular gene and with essential metabolic function
No. of Gaps (75) . | Metabolic Function (Core Metabolism) . |
|---|---|
| 24 | Starch and Suc metabolism, lignin biosynthesis, glycerophospholipid metabolism, fatty acid biosynthesis, carbon fixation and pyruvate metabolism, citrate cycle, lipopolysaccharide biosynthesis; N-glycan biosynthesis; vitamins |
| 23 | Metabolism of amino acids (Ala and Asp metabolism; Gly, Ser, and Thr metabolism; Met metabolism; Phe, Tyr, and Trp biosynthesis; Lys biosynthesis and His metabolism; Val, Leu, and Ile biosynthesis; metabolism of amino groups) |
| 25 | Purine metabolism |
| 3 | Autocatalytic reactions |
No. of Gaps (75) . | Metabolic Function (Core Metabolism) . |
|---|---|
| 24 | Starch and Suc metabolism, lignin biosynthesis, glycerophospholipid metabolism, fatty acid biosynthesis, carbon fixation and pyruvate metabolism, citrate cycle, lipopolysaccharide biosynthesis; N-glycan biosynthesis; vitamins |
| 23 | Metabolism of amino acids (Ala and Asp metabolism; Gly, Ser, and Thr metabolism; Met metabolism; Phe, Tyr, and Trp biosynthesis; Lys biosynthesis and His metabolism; Val, Leu, and Ile biosynthesis; metabolism of amino groups) |
| 25 | Purine metabolism |
| 3 | Autocatalytic reactions |
Metabolic reactions not assigned to any particular gene and with essential metabolic function
No. of Gaps (75) . | Metabolic Function (Core Metabolism) . |
|---|---|
| 24 | Starch and Suc metabolism, lignin biosynthesis, glycerophospholipid metabolism, fatty acid biosynthesis, carbon fixation and pyruvate metabolism, citrate cycle, lipopolysaccharide biosynthesis; N-glycan biosynthesis; vitamins |
| 23 | Metabolism of amino acids (Ala and Asp metabolism; Gly, Ser, and Thr metabolism; Met metabolism; Phe, Tyr, and Trp biosynthesis; Lys biosynthesis and His metabolism; Val, Leu, and Ile biosynthesis; metabolism of amino groups) |
| 25 | Purine metabolism |
| 3 | Autocatalytic reactions |
No. of Gaps (75) . | Metabolic Function (Core Metabolism) . |
|---|---|
| 24 | Starch and Suc metabolism, lignin biosynthesis, glycerophospholipid metabolism, fatty acid biosynthesis, carbon fixation and pyruvate metabolism, citrate cycle, lipopolysaccharide biosynthesis; N-glycan biosynthesis; vitamins |
| 23 | Metabolism of amino acids (Ala and Asp metabolism; Gly, Ser, and Thr metabolism; Met metabolism; Phe, Tyr, and Trp biosynthesis; Lys biosynthesis and His metabolism; Val, Leu, and Ile biosynthesis; metabolism of amino groups) |
| 25 | Purine metabolism |
| 3 | Autocatalytic reactions |
The final model has 446 singleton or dead-end metabolites (i.e. internal metabolites only used in a single reaction; Supplemental Table S4). A total of 512 reactions are linked to the use or production of dead-end metabolites. These reactions will by definition have zero flux in flux balance analysis. There are several causes of singletons. Most are linked to vitamins, cofactors, and secondary metabolites not currently included in the model (i.e. not described by biomass drains or intercellular transporters). Some result from true gaps in the network, where one or more essential reactions have not yet been assigned. Finally, some result from KEGG's automatic annotation, in which every reaction known to be catalyzed by a particular EC enzyme in some organism will be included by default, whether or not other reactions required for functionality of the particular reaction are present. Continued curation efforts will focus on resolving these singletons.
In its current form, the model has 183 degrees of freedom. Of these, 47 are associated with the production of biomass components and reduce to a single degree of freedom (growth rate) when a fixed biomass composition is assumed. Another 18 degrees of freedom are associated with intercellular transport. The remaining 118 degrees of freedom represent the maximum cellular scope for using alternative pathways to achieve identical outcomes in terms of growth rate and net transport (e.g. the use of cytosolic or chloroplastic glycolysis). The real scope is further limited by irreversibility constraints as well as regulatory constraints.
AraGEM version 1.0 is available in System Biology Markup Language (SBML) format (Supplemental Data File S1).
In Silico Fluxomics: Photosynthesis Versus Photorespiration
AraGEM is a generic plant cell model capable of representing both photosynthetic and nonphotosynthetic cell types. In order to evaluate the model, its ability to reproduce classical physiological scenarios of plant cell metabolism was explored. The first scenario explored was photorespiration.
A simplified scheme of the current textbook photorespiratory cycle. Numbers are as follows: 1, plastidic glycolate transporter; 2, peroxisomal glycolate transporter; 3, plastidic Glu-malate translocator; 4, plastidic 2-oxoglutarate-malate translocator; 5, peroxisomal Glu transporter; 6, peroxisomal 2-oxoglutarate transporter; 7, peroxisomal Gly transporter; 8, mitochondrial Gly transporter. GDC, Gly decarboxylase; Glt, glycolate; Gox, glyoxylate; GS, Gln synthetase; methylene-THF, methylene-tetrahydrofolate; 2-OG, 2-oxoglutarate; 3-PGA, 3-phosphoglycerate; 2-Pglt, 2-phosphoglycolate; Pi, inorganic phosphate; Ru 1,5-BP, ribulose 1,5-bisP.
The effect of photorespiration was explored by comparing the optimum flux distribution predicted with pure photosynthesis (Table V, case 1)
Constraints imposed to represent each scenario
Case 1 is photosynthesis, case 2 is photorespiration, and case 3 is respiration/nitrogen assimilation in nonphotosynthetic cells. ™, Flux limited, constraint to zero; +, free flux, estimated through optimization.
Inputs, Outputs, and Constraints . | Case 1 . | Case 2 . | Case 3 . |
|---|---|---|---|
| Carbon source: CO2 uptake (free flux) | + | + | ™ |
| Carbon source: Suc uptake (free flux) | ™ | ™ | + |
| Photon uptake (free flux) | + | + | ™ |
| Rubisco; EC 4.1.1.39 (carboxylation-oxygenation ratio) | +1:0 | +3:1 | ™ |
| Fd-GOGAT; EC 1.4.7.1 (plastid) | + | + | ™ |
| NADH-GOGAT; EC 1.4.1.14 (plastid) | ™ | ™ | + |
| Optimization: minimize uptake of | Photons | Photons | Suc |
| Biomass rate (estimated and fixed) | Leaf | Leaf | Root |
Inputs, Outputs, and Constraints . | Case 1 . | Case 2 . | Case 3 . |
|---|---|---|---|
| Carbon source: CO2 uptake (free flux) | + | + | ™ |
| Carbon source: Suc uptake (free flux) | ™ | ™ | + |
| Photon uptake (free flux) | + | + | ™ |
| Rubisco; EC 4.1.1.39 (carboxylation-oxygenation ratio) | +1:0 | +3:1 | ™ |
| Fd-GOGAT; EC 1.4.7.1 (plastid) | + | + | ™ |
| NADH-GOGAT; EC 1.4.1.14 (plastid) | ™ | ™ | + |
| Optimization: minimize uptake of | Photons | Photons | Suc |
| Biomass rate (estimated and fixed) | Leaf | Leaf | Root |
Constraints imposed to represent each scenario
Case 1 is photosynthesis, case 2 is photorespiration, and case 3 is respiration/nitrogen assimilation in nonphotosynthetic cells. ™, Flux limited, constraint to zero; +, free flux, estimated through optimization.
Inputs, Outputs, and Constraints . | Case 1 . | Case 2 . | Case 3 . |
|---|---|---|---|
| Carbon source: CO2 uptake (free flux) | + | + | ™ |
| Carbon source: Suc uptake (free flux) | ™ | ™ | + |
| Photon uptake (free flux) | + | + | ™ |
| Rubisco; EC 4.1.1.39 (carboxylation-oxygenation ratio) | +1:0 | +3:1 | ™ |
| Fd-GOGAT; EC 1.4.7.1 (plastid) | + | + | ™ |
| NADH-GOGAT; EC 1.4.1.14 (plastid) | ™ | ™ | + |
| Optimization: minimize uptake of | Photons | Photons | Suc |
| Biomass rate (estimated and fixed) | Leaf | Leaf | Root |
Inputs, Outputs, and Constraints . | Case 1 . | Case 2 . | Case 3 . |
|---|---|---|---|
| Carbon source: CO2 uptake (free flux) | + | + | ™ |
| Carbon source: Suc uptake (free flux) | ™ | ™ | + |
| Photon uptake (free flux) | + | + | ™ |
| Rubisco; EC 4.1.1.39 (carboxylation-oxygenation ratio) | +1:0 | +3:1 | ™ |
| Fd-GOGAT; EC 1.4.7.1 (plastid) | + | + | ™ |
| NADH-GOGAT; EC 1.4.1.14 (plastid) | ™ | ™ | + |
| Optimization: minimize uptake of | Photons | Photons | Suc |
| Biomass rate (estimated and fixed) | Leaf | Leaf | Root |
Flux map for the photorespiratory cycle. Comparison of fluxes between photorespiration and photosynthesis (no oxygenation of ribulose 1,5-bisP). In the diagram, solid lines represent fluxes and dashed lines represent the transporters. Green and red lines highlight fluxes that have increased and decreased, respectively, during photorespiration when compared with photosynthesis. Gray lines represent flux values that have not changed significantly. Steps 1 to 17 represent the order of events that complete the photorespiratory cycle. Transporters are described in the Figure 1 legend. Boxes 9, 15, and 16 represent ammonium and CO2 being transported by diffusion. Box 17 represents CO2 being fixed in the Calvin cycle.
Figure 2 represents one optimal solution. Flexibility in the network means that there are generally many solutions achieving identical optimal performance. For example, if it is assumed that both plastids and mitochondria have transporters for both Gln and Glu, the Glu-Gln shuttle (Linka and Weber, 2005) is an equally photon-optimal alternative to the diffusive ammonia transport assumed in the textbook model of photorespiration. In contrast, the less efficient Orn-citruline shuttle would only be predicted if ammonia diffusion were deliberately constrained and no mitochondrial Gln transporter were assumed to exist. More generally, the existence of multiple solutions with characteristic usage of particular enzymes can be used to generate testable hypotheses.
Apart from the use of the classical photorespiratory cycle, AraGEM also predicts system-level changes in the optimum flux distribution (Fig. 2; Table VI).
Up-regulation of key target enzymes during photorespiration and respiration relative to photosynthesis highlighted by AraGEM
Metabolic Pathway . | . | Enzymes (EC) Up-Regulated during Photorespirationa . | . | Enzymes (EC) Up-Regulated during Respirationa . |
|---|---|---|---|---|
| Glycolysis (cytosolic and plastidic) | 4.2.1.11 | Phosphopyruvate hydratase | 2.7.1.40 | Pyruvate kinase |
| 5.3.1.1 | Triose-P isomerase | 4.2.1.11 | Phosphopyruvate hydratase | |
| 1.2.1.13 | Glyceraldehyde-3-phosphate dehydrogenase (NADP+) | 1.2.1.12 | G3PDH (NAD+) | |
| 4.1.2.13 | Fru-bisP aldolase | 5.4.2.1 | Phosphoglycerate mutase | |
| 2.7.2.3 | Phosphoglycerate kinase | 5.1.3.3 | Aldose 1-epimerase | |
| 5.3.1.9 | Glc-6-P isomerase | 2.7.1.1 | Hexokinase | |
| 3.1.3.11 | Fru-bisP | 2.7.1.11 | 6-Phosphofructokinase | |
| Suc metabolism | 3.2.1.21 | β-Glucosidase | ||
| 2.4.1.14 | Suc-P synthase | |||
| 3.2.1.20 | α-Glucosidase; | |||
| 3.2.1.26 | β-fructofuranosidase | |||
| 2.4.1.13 | Suc synthase | |||
| 2.7.1.1 | Hexokinase | |||
| 3.2.1.4 | Cellulase | |||
| 2.7.1.4 | Fructokinase | |||
| 3.2.1.26 | β-Fructofuranosidase | |||
| 2.4.1.12 | Cellulose synthase (UDP forming) | |||
| Carbon fixation | 4.1.1.39 | Rubisco | ||
| 2.7.1.19 | Phosphoribulokinase | |||
| 4.1.1.31 | PEP carboxylase | |||
| 1.1.1.39 | Malate dehydrogenase (decarboxylating) | |||
| PPP (plastidic) | 5.3.1.6 | Rib-5-P isomerase | 2.7.1.15 | Ribokinase |
| 4.1.2.13 | Fru-bisP aldolase | 5.4.2.2 | Phosphoglucomutase | |
| 5.1.3.1 | Ribulose-P 3-epimerase | 5.1.3.1 | Ribulose-P 3-epimerase | |
| 2.2.1.1 | Transketolase | 5.3.1.9 | Glc-6-P isomerase | |
| 2.2.1.2 | Transaldolase | 1.1.1.49 | Glc-6-P dehydrogenase | |
| 5.3.1.9 | Glc-6-P isomerase | |||
| 3.1.3.11 | Fru-bisP | |||
| TCA cycle | 1.1.1.37 | Malate dehydrogenase | ||
| 2.3.3.1 | Citrate (Si)-synthase | |||
| 1.2.4.2 | Oxoglutarate dehydrogenase | |||
| 1.1.1.41 | Isocitrate dehydrogenase (NAD+) | |||
| 4.2.1.2 | Fumarate hydratase | |||
| 6.2.1.5 | Succinyl-CoA synthetase | |||
| 1.3.99.1 | Succinate dehydrogenase | |||
| Glyoxylate cycle | 1.1.1.37 | Malate dehydrogenase | 1.1.1.37 | Malate dehydrogenase |
| 1.1.3.15 | Glycolate oxidase | |||
| 4.2.1.3 | Aconitate hydratase | |||
| 3.1.3.18 | Phosphoglycolate phosphatase | |||
| 1.1.1.29 | Glycerate dehydrogenase | |||
| 2.7.1.31 | Glycerate kinase | |||
| GS/GOGAT | 1.18.1.2 | Ferredoxin-NADP+ reductase | 1.4.1.14 | Glutamate synthase (NADH) |
Metabolic Pathway . | . | Enzymes (EC) Up-Regulated during Photorespirationa . | . | Enzymes (EC) Up-Regulated during Respirationa . |
|---|---|---|---|---|
| Glycolysis (cytosolic and plastidic) | 4.2.1.11 | Phosphopyruvate hydratase | 2.7.1.40 | Pyruvate kinase |
| 5.3.1.1 | Triose-P isomerase | 4.2.1.11 | Phosphopyruvate hydratase | |
| 1.2.1.13 | Glyceraldehyde-3-phosphate dehydrogenase (NADP+) | 1.2.1.12 | G3PDH (NAD+) | |
| 4.1.2.13 | Fru-bisP aldolase | 5.4.2.1 | Phosphoglycerate mutase | |
| 2.7.2.3 | Phosphoglycerate kinase | 5.1.3.3 | Aldose 1-epimerase | |
| 5.3.1.9 | Glc-6-P isomerase | 2.7.1.1 | Hexokinase | |
| 3.1.3.11 | Fru-bisP | 2.7.1.11 | 6-Phosphofructokinase | |
| Suc metabolism | 3.2.1.21 | β-Glucosidase | ||
| 2.4.1.14 | Suc-P synthase | |||
| 3.2.1.20 | α-Glucosidase; | |||
| 3.2.1.26 | β-fructofuranosidase | |||
| 2.4.1.13 | Suc synthase | |||
| 2.7.1.1 | Hexokinase | |||
| 3.2.1.4 | Cellulase | |||
| 2.7.1.4 | Fructokinase | |||
| 3.2.1.26 | β-Fructofuranosidase | |||
| 2.4.1.12 | Cellulose synthase (UDP forming) | |||
| Carbon fixation | 4.1.1.39 | Rubisco | ||
| 2.7.1.19 | Phosphoribulokinase | |||
| 4.1.1.31 | PEP carboxylase | |||
| 1.1.1.39 | Malate dehydrogenase (decarboxylating) | |||
| PPP (plastidic) | 5.3.1.6 | Rib-5-P isomerase | 2.7.1.15 | Ribokinase |
| 4.1.2.13 | Fru-bisP aldolase | 5.4.2.2 | Phosphoglucomutase | |
| 5.1.3.1 | Ribulose-P 3-epimerase | 5.1.3.1 | Ribulose-P 3-epimerase | |
| 2.2.1.1 | Transketolase | 5.3.1.9 | Glc-6-P isomerase | |
| 2.2.1.2 | Transaldolase | 1.1.1.49 | Glc-6-P dehydrogenase | |
| 5.3.1.9 | Glc-6-P isomerase | |||
| 3.1.3.11 | Fru-bisP | |||
| TCA cycle | 1.1.1.37 | Malate dehydrogenase | ||
| 2.3.3.1 | Citrate (Si)-synthase | |||
| 1.2.4.2 | Oxoglutarate dehydrogenase | |||
| 1.1.1.41 | Isocitrate dehydrogenase (NAD+) | |||
| 4.2.1.2 | Fumarate hydratase | |||
| 6.2.1.5 | Succinyl-CoA synthetase | |||
| 1.3.99.1 | Succinate dehydrogenase | |||
| Glyoxylate cycle | 1.1.1.37 | Malate dehydrogenase | 1.1.1.37 | Malate dehydrogenase |
| 1.1.3.15 | Glycolate oxidase | |||
| 4.2.1.3 | Aconitate hydratase | |||
| 3.1.3.18 | Phosphoglycolate phosphatase | |||
| 1.1.1.29 | Glycerate dehydrogenase | |||
| 2.7.1.31 | Glycerate kinase | |||
| GS/GOGAT | 1.18.1.2 | Ferredoxin-NADP+ reductase | 1.4.1.14 | Glutamate synthase (NADH) |
Relative flux to photosynthesis.
Up-regulation of key target enzymes during photorespiration and respiration relative to photosynthesis highlighted by AraGEM
Metabolic Pathway . | . | Enzymes (EC) Up-Regulated during Photorespirationa . | . | Enzymes (EC) Up-Regulated during Respirationa . |
|---|---|---|---|---|
| Glycolysis (cytosolic and plastidic) | 4.2.1.11 | Phosphopyruvate hydratase | 2.7.1.40 | Pyruvate kinase |
| 5.3.1.1 | Triose-P isomerase | 4.2.1.11 | Phosphopyruvate hydratase | |
| 1.2.1.13 | Glyceraldehyde-3-phosphate dehydrogenase (NADP+) | 1.2.1.12 | G3PDH (NAD+) | |
| 4.1.2.13 | Fru-bisP aldolase | 5.4.2.1 | Phosphoglycerate mutase | |
| 2.7.2.3 | Phosphoglycerate kinase | 5.1.3.3 | Aldose 1-epimerase | |
| 5.3.1.9 | Glc-6-P isomerase | 2.7.1.1 | Hexokinase | |
| 3.1.3.11 | Fru-bisP | 2.7.1.11 | 6-Phosphofructokinase | |
| Suc metabolism | 3.2.1.21 | β-Glucosidase | ||
| 2.4.1.14 | Suc-P synthase | |||
| 3.2.1.20 | α-Glucosidase; | |||
| 3.2.1.26 | β-fructofuranosidase | |||
| 2.4.1.13 | Suc synthase | |||
| 2.7.1.1 | Hexokinase | |||
| 3.2.1.4 | Cellulase | |||
| 2.7.1.4 | Fructokinase | |||
| 3.2.1.26 | β-Fructofuranosidase | |||
| 2.4.1.12 | Cellulose synthase (UDP forming) | |||
| Carbon fixation | 4.1.1.39 | Rubisco | ||
| 2.7.1.19 | Phosphoribulokinase | |||
| 4.1.1.31 | PEP carboxylase | |||
| 1.1.1.39 | Malate dehydrogenase (decarboxylating) | |||
| PPP (plastidic) | 5.3.1.6 | Rib-5-P isomerase | 2.7.1.15 | Ribokinase |
| 4.1.2.13 | Fru-bisP aldolase | 5.4.2.2 | Phosphoglucomutase | |
| 5.1.3.1 | Ribulose-P 3-epimerase | 5.1.3.1 | Ribulose-P 3-epimerase | |
| 2.2.1.1 | Transketolase | 5.3.1.9 | Glc-6-P isomerase | |
| 2.2.1.2 | Transaldolase | 1.1.1.49 | Glc-6-P dehydrogenase | |
| 5.3.1.9 | Glc-6-P isomerase | |||
| 3.1.3.11 | Fru-bisP | |||
| TCA cycle | 1.1.1.37 | Malate dehydrogenase | ||
| 2.3.3.1 | Citrate (Si)-synthase | |||
| 1.2.4.2 | Oxoglutarate dehydrogenase | |||
| 1.1.1.41 | Isocitrate dehydrogenase (NAD+) | |||
| 4.2.1.2 | Fumarate hydratase | |||
| 6.2.1.5 | Succinyl-CoA synthetase | |||
| 1.3.99.1 | Succinate dehydrogenase | |||
| Glyoxylate cycle | 1.1.1.37 | Malate dehydrogenase | 1.1.1.37 | Malate dehydrogenase |
| 1.1.3.15 | Glycolate oxidase | |||
| 4.2.1.3 | Aconitate hydratase | |||
| 3.1.3.18 | Phosphoglycolate phosphatase | |||
| 1.1.1.29 | Glycerate dehydrogenase | |||
| 2.7.1.31 | Glycerate kinase | |||
| GS/GOGAT | 1.18.1.2 | Ferredoxin-NADP+ reductase | 1.4.1.14 | Glutamate synthase (NADH) |
Metabolic Pathway . | . | Enzymes (EC) Up-Regulated during Photorespirationa . | . | Enzymes (EC) Up-Regulated during Respirationa . |
|---|---|---|---|---|
| Glycolysis (cytosolic and plastidic) | 4.2.1.11 | Phosphopyruvate hydratase | 2.7.1.40 | Pyruvate kinase |
| 5.3.1.1 | Triose-P isomerase | 4.2.1.11 | Phosphopyruvate hydratase | |
| 1.2.1.13 | Glyceraldehyde-3-phosphate dehydrogenase (NADP+) | 1.2.1.12 | G3PDH (NAD+) | |
| 4.1.2.13 | Fru-bisP aldolase | 5.4.2.1 | Phosphoglycerate mutase | |
| 2.7.2.3 | Phosphoglycerate kinase | 5.1.3.3 | Aldose 1-epimerase | |
| 5.3.1.9 | Glc-6-P isomerase | 2.7.1.1 | Hexokinase | |
| 3.1.3.11 | Fru-bisP | 2.7.1.11 | 6-Phosphofructokinase | |
| Suc metabolism | 3.2.1.21 | β-Glucosidase | ||
| 2.4.1.14 | Suc-P synthase | |||
| 3.2.1.20 | α-Glucosidase; | |||
| 3.2.1.26 | β-fructofuranosidase | |||
| 2.4.1.13 | Suc synthase | |||
| 2.7.1.1 | Hexokinase | |||
| 3.2.1.4 | Cellulase | |||
| 2.7.1.4 | Fructokinase | |||
| 3.2.1.26 | β-Fructofuranosidase | |||
| 2.4.1.12 | Cellulose synthase (UDP forming) | |||
| Carbon fixation | 4.1.1.39 | Rubisco | ||
| 2.7.1.19 | Phosphoribulokinase | |||
| 4.1.1.31 | PEP carboxylase | |||
| 1.1.1.39 | Malate dehydrogenase (decarboxylating) | |||
| PPP (plastidic) | 5.3.1.6 | Rib-5-P isomerase | 2.7.1.15 | Ribokinase |
| 4.1.2.13 | Fru-bisP aldolase | 5.4.2.2 | Phosphoglucomutase | |
| 5.1.3.1 | Ribulose-P 3-epimerase | 5.1.3.1 | Ribulose-P 3-epimerase | |
| 2.2.1.1 | Transketolase | 5.3.1.9 | Glc-6-P isomerase | |
| 2.2.1.2 | Transaldolase | 1.1.1.49 | Glc-6-P dehydrogenase | |
| 5.3.1.9 | Glc-6-P isomerase | |||
| 3.1.3.11 | Fru-bisP | |||
| TCA cycle | 1.1.1.37 | Malate dehydrogenase | ||
| 2.3.3.1 | Citrate (Si)-synthase | |||
| 1.2.4.2 | Oxoglutarate dehydrogenase | |||
| 1.1.1.41 | Isocitrate dehydrogenase (NAD+) | |||
| 4.2.1.2 | Fumarate hydratase | |||
| 6.2.1.5 | Succinyl-CoA synthetase | |||
| 1.3.99.1 | Succinate dehydrogenase | |||
| Glyoxylate cycle | 1.1.1.37 | Malate dehydrogenase | 1.1.1.37 | Malate dehydrogenase |
| 1.1.3.15 | Glycolate oxidase | |||
| 4.2.1.3 | Aconitate hydratase | |||
| 3.1.3.18 | Phosphoglycolate phosphatase | |||
| 1.1.1.29 | Glycerate dehydrogenase | |||
| 2.7.1.31 | Glycerate kinase | |||
| GS/GOGAT | 1.18.1.2 | Ferredoxin-NADP+ reductase | 1.4.1.14 | Glutamate synthase (NADH) |
Relative flux to photosynthesis.
Respiration in Nonphotosynthetic Cells
AraGEM was also used to contrast metabolism in nonphotosynthetic cells (Table V, case 3) to that in photosynthetic cells (Table V, case 1). Under normal conditions, Suc is the main respiratory and growth substrate of higher plants (Dennis and Miernyk, 1982), and the optimal flux distribution for nonphotosynthetic cells was defined as the flux distribution that minimizes Suc uptake for a fixed rate of biomass synthesis. The energy derived from respiration is used for growth and maintenance processes and is higher for roots than for tops (Hansen and Jensen, 1977).
Flux map for respiration of a nonphotosynthetic plant cell. Flux comparison is shown between a nonphotosynthetic cell (respiration) and a photosynthetic cell (photosynthesis). Green and red lines represent increased and decreased flux, respectively, of a nonphotosynthetic cell compared with a photosynthetic cell. Gray lines represent flux values that have not changed significantly compared with photosynthesis. Boxes A and B represent the metabolic activity of some isoenzymes typically activated in nongreen tissues. Box A in plastidic glycolysis (glycolysis II) represents the plastid form of GAPDH [NAD(H)-dependent] activity. Box B represents the metabolic activity of NADH-GOGAT involved in ammonia assimilation in nongreen plastids.
AraGEM predicts an increase in metabolic activity of key isoenzymes responsible for the supply of redox equivalents in plastids during the respiratory cycle. In silico analysis suggests an increase in metabolic activity of the plastid form of NAD-glyceraldehyde-3-phosphate dehydrogenase (GAPDH [GapCp]), an isoenzyme closely related to the cytosolic NAD-GAPDH (Fig. 3, boxes A and B). This prediction is consistent with the observation that this gene in normally expressed in nongreen tissues (Petersen et al., 2003).
AraGEM also predicts that the use of NADP(H) by GAPDH [and not NAD(H)] is energetically favorable for photosynthetic cells during the light period, indicating a preferential coenzyme use under that condition (Fig. 2). Consistent with this, it has been observed that the bispecific chloroplast form of GAPDH (EC 1.2.1.13) uses NADP(H) as a coenzyme under physiological conditions during the photoperiod (leaves; Wolosiuk and Buchanan, 1978; Backhausen et al., 1998).
Another hypothesis derived from the model is that, during light induction (in leaf), the reducing power generated by the photosynthetic transport chain is used in the reduction of nitrite to ammonia and in the Gln synthetase/GOGAT cycle (Fd-GOGAT; ferredoxin-NADP+ reductase dependent). In the dark (no light induction or for nonphotosynthetic cells), on the other hand, the plastidic PPP supplies reducing power to the Gln synthetase/GOGAT cycle. Again, this prediction is in agreement with published observations (Abrol et al., 1983).
Optimality Criterion
The agreement between experimental observations and in silico predictions based on optimization suggests that plant cell metabolism could be regulated to achieve resource efficiency (photons in photosynthetic tissue and Suc in nonphotosynthetic tissue) in individual tissues. This, however, may only be true in a macroscopic sense (i.e. the metabolic flexibility encoded by the genome has evolved to ensure resource efficiency under the highly divergent conditions seen in different tissues [e.g. photosynthetic leaf tissue versus respiring root tissue]).
In order to establish if resource utilization is indeed optimized in individual tissues, it is necessary to compare predicted fluxes with actual fluxes. The current flux estimates are necessarily low estimates, since no value has been assigned to maintenance requirements. Moreover, tissues have been considered in isolation, whereas in reality there is an exchange of material between tissues and photosynthetic tissue must capture carbon for the whole plant.
The modeling framework is readily extended to scenarios where multiple tissues, each represented by a subset of the full model, exchange resources, and we are currently collating the growth parameters necessary to explore whole plant metabolism.
CONCLUSION
AraGEM is an extensively curated, compartmentalized, genome-scale model of plant cell primary metabolism. The reconstruction process led to the identification of 75 reactions essential for primary metabolism for which genes have yet to be identified. Continued curation efforts will focus on refining lipid metabolism, closing gaps in secondary metabolism, and resolving gene product targeting, where this has yet to be established.
The use of AraGEM for in silico flux predictions illustrates the potential of using genome-scale models to explore complex, compartmentalized networks and develop nontrivial hypotheses. The optimality criterion itself is one such hypothesis.
MATERIALS AND METHODS
Flux Balance Analysis
Information flow from a genome-scale model repository (Excel file) to a model document (SBML) to a flux map (Excel file). An initial list of components was compiled from the Arabidopsis genome, and the network was manually curated. The Java parsing program separates reactions into reactants and product stoichiometry and removes repeated reactions. Calculation output from MATLAB was used to update the flux map.
The stoichiometric matrix, S, as well as reversibility constraints (defining v min) were extracted from the SBML database in MATLAB (version 7.3; The MathWorks), and the relevant linear programming problems (see below) were solved using the MOSEK Optimization Toolbox for MATLAB (version 4). Finally, flux simulations were visualized on a metabolic flux map (which represents the central metabolism of a compartmentalized plant cell) drawn in Excel.
Manual Curation
In addition to compartmentalization, the manual curation process consisted of checking the model for reaction consistency, introducing biomass drains and transport reactions, and closing network gaps.
While KEGG is generally logically coherent, a number of inconsistencies make it impossible to use the automatically generated database directly. One is the use of multiple labels to describe the same compound (e.g. the use of nonspecific and specific references to sugar stereoisomers [d-Glc versus α-d-Glc]). Each such multiplicity was resolved as described previously (Quek and Nielsen, 2008). Another is the presence of nonbalanced reactions, typically for (1) the synthesis or breakdown of polymers (e.g. DNA + nucleotide = DNA), (2) the use of generic groups “R”, and (3) the consumption or production of H2O, H+, and redox equivalents [e.g. NAD(P)H]. In AraGEM, polymers were described in the form of their corresponding monomers, and the use of the generic atom R was avoided.
Biomass drain reactions are incorporated into AraGEM as the accumulation terms of the biomass precursors. It is useful to describe these accumulation terms individually (e.g. “Sucrose = Sucrose_biomass”) in order to simplify the task of uncovering the pathway gaps in each of the biosynthetic routes separately. The list of biomass components considered is shown in Table III and includes major structural and storage components as well as trace elements such as vitamins. Once the network gaps were filled (see below), these individual accumulation terms were combined into an overall biomass synthesis equation, with the appropriate coefficients assigned to each precursor to define the composition of biomass. The overall biomass synthesis equation is tissue specific; therefore, the composition of biomass has been estimated to represent photosynthetic or nonphotosynthetic tissues (e.g. leaf, stem, and root; Poorter and Bergkotte, 1992; Niemann et al., 1995). Trace elements were not included in the biomass equation, since their contribution to overall flux is trivial.
Exchange equations are used to describe the intercompartmental exchange of metabolites between cytoplasm and extracellular space and between cytoplasm and the organelles: plastids, peroxisomes, mitochondria, and vacuole. Annotation of transporters is far less developed than annotation of metabolic enzymes, and the annotation was largely manual based on normal metabolic functions described in the literature and transport requirements predicted from network gap filling.
Model Simulations
(i.e. the distributions that minimize the use of the key energy substrate [photons or Suc] while achieving a specified growth rate).
Supplemental Data
The following materials are available in the online version of this article.
Supplemental Table S1. Modified reactions and gene-enzyme association.
Supplemental Table S2. List of biomass drains and transporters.
Supplemental Table S3. Reactions with no ORF association.
Supplemental Table S4. List of singletons.
Supplemental Data File S1. AraGEM_vs1.0, SBML format.
Supplemental Data File S2. FluxMapC3, Excel file.
Supplemental Data File S3. c3.m, MATLAB file.
ACKNOWLEDGMENTS
We thank Simon Boyes for complementing data curation analysis in AraGEM. We also acknowledge our partners, BSES Ltd. and Metabolix, Inc.
LITERATURE CITED
Abrol YP, Sawhney SK, Naik MS (
Backhausen JE, Vetter S, Baalmann E, Kitzmann C, Scheibe R (
Bonarius HPJ, Schmid G, Tramper J (
Boyle NR, Morgan J (
Cakir T, Patil KR, Onsan ZI, Ulgen KO, Kirdar B, Nielsen J (
Coschigano KT, Melo-Oliveira R, Lim J, Coruzzi GM (
Cui J, Li P, Li G, Xu F, Zhao C, Li YH, Yang ZN, Wang G, Yu QB, Li YX, et al (
Dai SJ, Chen TT, Chong K, Xue YB, Liu SQ, Wang T (
DellaPenna D (
Dennis DT, Miernyk JA (
Duarte NC, Becker SA, Jamshidi N, Thiele I, Mo ML, Vo TD, Srivas R, Palsson BO (
Duarte NC, Herrgard MJ, Palsson BO (
Feist AM, Henry CS, Reed JL, Krummenacker M, Joyce AR, Karp PD, Broadbelt LJ, Hatzimanikatis V, Palsson BO (
Fong SS, Burgard AP, Herring CD, Knight EM, Blattner FR, Maranas CD, Palsson BO (
Fong SS, Marciniak JY, Palsson BO (
Forster J, Famili I, Fu P, Palsson BO, Nielsen J (
Goff SA (
Gutierrez RA, Shasha DE, Coruzzi GM (
Hall RD (
Hansen GK, Jensen CR (
Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, et al (
Kitano H (
Lancien M, Martin M, Hsieh MH, Leustek T, Goodman H, Coruzzi GM (
Linka M, Weber APM (
Minic Z, Jamet E, San-Clemente H, Pelletier S, Renou JP, Rihouey C, Okinyo DPO, Proux C, Lerouge P, Jouanin L (
Morgenthal K, Weckwerth W, Steuer R (
Niemann GJ, Pureveen JBM, Eijkel GB, Poorter H, Boon JJ (
Oliveira AP, Patil KR, Nielsen J (
Paterson AH, Bowers JE, Bruggmann R, Dubchak I, Grimwood J, Gundlach H, Haberer G, Hellsten U, Mitros T, Poliakov A, et al (
Petersen J, Brinkmann H, Cerff R (
Poolman M, Miguet L, Sweetlove LJ, Fell DA (
Poorter H, Bergkotte M (
Price ND, Reed JL, Palsson BO (
Quek LE, Nielsen LK (
Reed JL, Patel TR, Chen KH, Joyce AR, Applebee MK, Herring CD, Bui OT, Knight EM, Fong SS, Palsson BO (
Reumann S, Ma CL, Lemke S, Babujee L (
Sheikh K, Forster J, Nielsen LK (
Sugimoto M, Takahashi K, Kitayama T, Ito D, Tomita M (
Sweetlove LJ, Last RL, Fernie AR (
Tomita M, Hashimoto K, Takahashi K, Shimizu TS, Matsuzaki Y, Miyoshi F, Saito K, Tanida S, Yugi K, Venter JC, et al (
Trepp GB, van de Mortel M, Yoshioka H, Miller SS, Samac DA, Gantt JS, Vance CP (
Turano FJ, Muhitch MJ (
van Dijk JP, Cankar K, Scheffer SJ, Beenen HG, Shepherd LVT, Stewart D, Davies HV, Wilkockson SJ, Lelfert C, Gruden K, et al (
Varma A, Palsson BO (
Wedding RT (
Wingler A, Lea PJ, Quick WP, Leegood RC (
Wise RR, Hoober JK (
Wolosiuk RA, Buchanan BB (
Yonekura-Sakakibara K, Tohge T, Matsuda F, Nakabayashi R, Takayama H, Niida R, Watanabe-Takahashi A, Inoue E, Saito K (
Author notes
This work was supported by the Cooperative Research Centre for Sugar Industry Innovation through Biotechnology.
Corresponding author; e-mail lars.nielsen@uq.edu.au.
The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantphysiol.org) is: Lars Keld Nielsen (lars.nielsen@uq.edu.au).
The online version of this article contains Web-only data.


![Flux map for respiration of a nonphotosynthetic plant cell. Flux comparison is shown between a nonphotosynthetic cell (respiration) and a photosynthetic cell (photosynthesis). Green and red lines represent increased and decreased flux, respectively, of a nonphotosynthetic cell compared with a photosynthetic cell. Gray lines represent flux values that have not changed significantly compared with photosynthesis. Boxes A and B represent the metabolic activity of some isoenzymes typically activated in nongreen tissues. Box A in plastidic glycolysis (glycolysis II) represents the plastid form of GAPDH [NAD(H)-dependent] activity. Box B represents the metabolic activity of NADH-GOGAT involved in ammonia assimilation in nongreen plastids.](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/plphys/152/2/10.1104_pp.109.148817/4/m_plphys_v152_2_579_f3.jpeg?Expires=1686450407&Signature=PhaXE0XOPRP1WS~NKnZbW03-FsimLPSzJi0D0KPaavkRB5NlFx8UtcFi8cf3A7PiLwD1ouRNkTQeSaVnfh2-Ny~bBuP6flUiTlMQBNSFAiCGG5vpASFIedfdzG7olSggkFls3Onwjn40bPucZkd8tdAUS7Rpf7HKk7iWSS1nuKKPt2~9BTE6VIiRSNAQA-65oHF~mG-9gi72tqKa5ZFGs~Badj4TzW-qXlkP-hcQ5cN789AY81GXJCSEmfagb3FmjI61FMdnL-BJ8rauiZmB1iyzRhEpH7nEEXydMduLQ83cX9fVo1yhWUxSWPrmirCafmlf-fxHsF4LmIGN7QrsMA__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
