Gold Open-Access
Refine
Has Fulltext
- no (30)
Language
- English (30)
Is part of the Bibliography
- yes (30)
Keywords
- Network clustering (2)
- Protein complexes (2)
- Species comparison (2)
- 13C labeling (1)
- 3-way coffee hybrids (1)
- A. thaliana (1)
- Arabidopsis thaliana (1)
- Biochemical networks (1)
- Biochemical reaction networks (1)
- Biochemistry (1)
Institute
Mining of metabolite-protein interaction networks facilitates the identification of design principles underlying the regulation of different cellular processes. However, identification and characterization of the regulatory role that metabolites play in interactions with proteins on a genome-scale level remains a pressing task. Based on availability of high-quality metabolite-protein interaction networks and genome-scale metabolic networks, here we propose a supervised machine learning approach, called CIRI that determines whether or not a metabolite is involved in a competitive inhibitory regulatory interaction with an enzyme. First, we show that CIRI outperforms the naive approach based on a structural similarity threshold for a putative competitive inhibitor and the substrates of a metabolic reaction. We also validate the performance of CIRI on several unseen data sets and databases of metabolite-protein interactions not used in the training, and demonstrate that the classifier can be effectively used to predict competitive inhibitory interactions. Finally, we show that CIRI can be employed to refine predictions about metabolite-protein interactions from a recently proposed PROMIS approach that employs metabolomics and proteomics profiles from size exclusion chromatography in E. coli to predict metaboliteprotein interactions. Altogether, CIRI fills a gap in cataloguing metabolite-protein interactions and can be used in directing future machine learning efforts to categorize the regulatory type of these interactions.
Trade-offs between traits are present across different levels of biological systems and ultimately reflect constraints imposed by physicochemical laws and the structure of underlying biochemical networks. Yet, mechanistic explanation of how trade-offs between molecular traits arise and how they relate to optimization of fitness-related traits remains elusive. Here, we introduce the concept of relative flux trade-offs and propose a constraint-based approach, termed FluTOr, to identify metabolic reactions whose fluxes are in relative trade-off with respect to an optimized fitness-related cellular task, like growth. We then employed FluTOr to identify relative flux trade-offs in the genome-scale metabolic networks of Escherichia coli, Saccharomyces cerevisiae, and Arabidopsis thaliana. For the metabolic models of E. coli and S. cerevisiae we showed that: (i) the identified relative flux trade-offs depend on the carbon source used and that (ii) reactions that participated in relative trade-offs in both species were implicated in cofactor biosynthesis. In contrast to the two microorganisms, the relative flux trade-offs for the metabolic model of A. thaliana did not depend on the available nitrogen sources, reflecting the differences in the underlying metabolic network as well as the considered environments. Lastly, the established connection between relative flux trade-offs allowed us to identify overexpression targets that can be used to optimize fitness-related traits. Altogether, our computational approach and findings demonstrate how relative flux trade-offs can shape optimization of metabolic tasks, important in biotechnological applications.
Rhizophagus irregularis is one of the most extensively studied arbuscular mycorrhizal fungi (AMF) that forms symbioses with and improves the performance of many crops. Lack of transformation protocol for R. irregularis renders it challenging to investigate molecular mechanisms that shape the physiology and interactions of this AMF with plants.
Here, we used all published genomics, transcriptomics, and metabolomics resources to gain insights into the metabolic functionalities of R. irregularis by reconstructing its high-quality genome-scale metabolic network that considers enzyme constraints. Extensive validation tests with the enzyme-constrained metabolic model demonstrated that it can be used to (i) accurately predict increased growth of R. irregularis on myristate with minimal medium; (ii) integrate enzyme abundances and carbon source concentrations that yield growth predictions with high and significant Spearman correlation (rS = 0.74) to measured hyphal dry weight; and (iii) simulate growth rate increases with tighter association of this AMF with the host plant across three fungal structures.
Based on the validated model and system-level analyses that integrate data from transcriptomics studies, we predicted that differences in flux distributions between intraradical mycelium and arbuscles are linked to changes in amino acid and cofactor biosynthesis.
Therefore, our results demonstrated that the enzyme-constrained metabolic model can be employed to pinpoint mechanisms driving developmental and physiological responses of R. irregularis to different environmental cues.
In conclusion, this model can serve as a template for other AMF and paves the way to identify metabolic engineering strategies to modulate fungal metabolic traits that directly affect plant performance.
IMPORTANCE Mounting evidence points to the benefits of the symbiotic interactions between the arbuscular mycorrhiza fungus Rhizophagus irregularis and crops; however, the molecular mechanisms underlying the physiological responses of this fungus to different host plants and environments remain largely unknown.
We present a manually curated, enzyme-constrained, genome-scale metabolic model of R. irregularis that can accurately predict experimentally observed phenotypes.
We show that this high-quality model provides an entry point into better understanding the metabolic and physiological responses of this fungus to changing environments due to the availability of different nutrients.
The model can be used to design metabolic engineering strategies to tailor R. irregularis metabolism toward improving the performance of host plants.
The deficiency of a (bio)chemical reaction network can be conceptually interpreted as a measure of its ability to support exotic dynamical behavior and/or multistationarity. The classical definition of deficiency relates to the capacity of a network to permit variations of the complex formation rate vector at steady state, irrespective of the network kinetics. However, the deficiency is by definition completely insensitive to the fine details of the directionality of reactions as well as bounds on reaction fluxes. While the classical definition of deficiency can be readily applied in the analysis of unconstrained, weakly reversible networks, it only provides an upper bound in the cases where relevant constraints on reaction fluxes are imposed. Here we propose the concept of effective deficiency, which provides a more accurate assessment of the network’s capacity to permit steady state variations at the complex level for constrained networks of any reversibility patterns. The effective deficiency relies on the concept of nonstoichiometric balanced complexes, which we have already shown to be present in real-world biochemical networks operating under flux constraints. Our results demonstrate that the effective deficiency of real-world biochemical networks is smaller than the classical deficiency, indicating the effects of reaction directionality and flux bounds on the variation of the complex formation rate vector at steady state.
Introduction Flux phenotypes from different organisms and growth conditions allow better understanding of differential metabolic networks functions. Fluxes of metabolic reactions represent the integrated outcome of transcription, translation, and post-translational modifications, and directly affect growth and fitness. However, fluxes of intracellular metabolic reactions cannot be directly measured, but are estimated via metabolic flux analysis (MFA) that integrates data on isotope labeling patterns of metabolites with metabolic models. While the application of metabolomics technologies in photosynthetic organisms have resulted in unprecedented data from 13CO2-labeling experiments, the bottleneck in flux estimation remains the application of isotopically nonstationary MFA (INST-MFA). INST-MFA entails fitting a (large) system of coupled ordinary differential equations, with metabolite pools and reaction fluxes as parameters. Here, we focus on the Calvin-Benson cycle (CBC) as a key pathway for carbon fixation in photosynthesizing organisms and ask if approaches other than classical INST-MFA can provide reliable estimation of fluxes for reactions comprising this pathway.
Methods First, we show that flux estimation with the labeling patterns of all CBC intermediates can be formulated as a single constrained regression problem, avoiding the need for repeated simulation of time-resolved labeling patterns.
Results We then compare the flux estimates of the simulation-free constrained regression approach with those obtained from the classical INST-MFA based on labeling patterns of metabolites from the microalgae Chlamydomonas reinhardtii, Chlorella sorokiniana and Chlorella ohadii under different growth conditions.
Discussion Our findings indicate that, in data-rich scenarios, simulation-free regression-based approaches provide a suitable alternative for flux estimation from classical INST-MFA since we observe a high qualitative agreement (rs=0.89) to predictions obtained from INCA, a state-of-the-art tool for INST-MFA.
The use of automated tools to reconstruct lipid metabolic pathways is not warranted in plants. Here, the authors construct Plant Lipid Module for Arabidopsis rosette using constraint-based modeling, demonstrate its integration in other plant metabolic models, and use it to dissect the genetic architecture of lipid metabolism.
Lipids play fundamental roles in regulating agronomically important traits. Advances in plant lipid metabolism have until recently largely been based on reductionist approaches, although modulation of its components can have system-wide effects. However, existing models of plant lipid metabolism provide lumped representations, hindering detailed study of component modulation. Here, we present the Plant Lipid Module (PLM) which provides a mechanistic description of lipid metabolism in the Arabidopsis thaliana rosette. We demonstrate that the PLM can be readily integrated in models of A. thaliana Col-0 metabolism, yielding accurate predictions (83%) of single lethal knock-outs and 75% concordance between measured transcript and predicted flux changes under extended darkness. Genome-wide associations with fluxes obtained by integrating the PLM in diel condition- and accession-specific models identify up to 65 candidate genes modulating A. thaliana lipid metabolism. Using mutant lines, we validate up to 40% of the candidates, paving the way for identification of metabolic gene function based on models capturing natural variability in metabolism.
Metabolic engineering of microalgae offers a promising solution for sustainable biofuel production, and rational design of engineering strategies can be improved by employing metabolic models that integrate enzyme turnover numbers. However, the coverage of turnover numbers for Chlamydomonas reinhardtii, a model eukaryotic microalga accessible to metabolic engineering, is 17-fold smaller compared to the heterotrophic cell factory Saccharomyces cerevisiae. Here we generate quantitative protein abundance data of Chlamydomonas covering 2337 to 3708 proteins in various growth conditions to estimate in vivo maximum apparent turnover numbers. Using constrained-based modeling we provide proxies for in vivo turnover numbers of 568 reactions, representing a 10-fold increase over the in vitro data for Chlamydomonas. Integration of the in vivo estimates instead of in vitro values in a metabolic model of Chlamydomonas improved the accuracy of enzyme usage predictions. Our results help in extending the knowledge on uncharacterized enzymes and improve biotechnological applications of Chlamydomonas.
Genomic prediction has revolutionized crop breeding despite remaining issues of transferability of models to unseen environmental conditions and environments. Usage of endophenotypes rather than genomic markers leads to the possibility of building phenomic prediction models that can account, in part, for this challenge. Here, we compare and contrast genomic prediction and phenomic prediction models for 3 growth-related traits, namely, leaf count, tree height, and trunk diameter, from 2 coffee 3-way hybrid populations exposed to a series of treatment-inducing environmental conditions. The models are based on 7 different statistical methods built with genomic markers and ChlF data used as predictors. This comparative analysis demonstrates that the best-performing phenomic prediction models show higher predictability than the best genomic prediction models for the considered traits and environments in the vast majority of comparisons within 3-way hybrid populations. In addition, we show that phenomic prediction models are transferrable between conditions but to a lower extent between populations and we conclude that chlorophyll a fluorescence data can serve as alternative predictors in statistical models of coffee hybrid performance. Future directions will explore their combination with other endophenotypes to further improve the prediction of growth-related traits for crops.
As autotrophic organisms, plants capture light energy to convert carbon dioxide into ATP, nicotinamide adenine dinucleotide phosphate (NADPH), and sugars, which are essential for the biosynthesis of building blocks, storage, and growth. At night, metabolism and growth can be sustained by mobilizing carbon (C) reserves. In response to changing environmental conditions, such as light-dark cycles, the small-molecule regulation of enzymatic activities is critical for reprogramming cellular metabolism. We have recently demonstrated that proteogenic dipeptides, protein degradation products, act as metabolic switches at the interface of proteostasis and central metabolism in both plants and yeast. Dipeptides accumulate in response to the environmental changes and act via direct binding and regulation of critical enzymatic activities, enabling C flux distribution. Here, we provide evidence pointing to the involvement of dipeptides in the metabolic rewiring characteristics for the day-night cycle in plants. Specifically, we measured the abundance of 13 amino acids and 179 dipeptides over short- (SD) and long-day (LD) diel cycles, each with different light intensities. Of the measured dipeptides, 38 and eight were characterized by day-night oscillation in SD and LD, respectively, reaching maximum accumulation at the end of the day and then gradually falling in the night. Not only the number of dipeptides, but also the amplitude of the oscillation was higher in SD compared with LD conditions. Notably, rhythmic dipeptides were enriched in the glucogenic amino acids that can be converted into glucose. Considering the known role of Target of Rapamycin (TOR) signaling in regulating both autophagy and metabolism, we subsequently investigated whether diurnal fluctuations of dipeptides levels are dependent on the TOR Complex (TORC). The Raptor1b mutant (raptor1b), known for the substantial reduction of TOR kinase activity, was characterized by the augmented accumulation of dipeptides, which is especially pronounced under LD conditions. We were particularly intrigued by the group of 16 dipeptides, which, based on their oscillation under SD conditions and accumulation in raptor1b, can be associated with limited C availability or photoperiod. By mining existing protein-metabolite interaction data, we delineated putative protein interactors for a representative dipeptide Pro-Gln. The obtained list included enzymes of C and amino acid metabolism, which are also linked to the TORC-mediated metabolic network. Based on the obtained results, we speculate that the diurnal accumulation of dipeptides contributes to its metabolic adaptation in response to changes in C availability. We hypothesize that dipeptides would act as alternative respiratory substrates and by directly modulating the activity of the focal enzymes.
Identification of protein complexes from protein-protein interaction (PPI) networks is a key problem in PPI mining, solved by parameter-dependent approaches that suffer from small recall rates. Here we introduce GCC-v, a family of efficient, parameter-free algorithms to accurately predict protein complexes using the (weighted) clustering coefficient of proteins in PPI networks. Through comparative analyses with gold standards and PPI networks from Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens, we demonstrate that GCC-v outperforms twelve state-of-the-art approaches for identification of protein complexes with respect to twelve performance measures in at least 85.71% of scenarios. We also show that GCC-v results in the exact recovery of similar to 35% of protein complexes in a pan-plant PPI network and discover 144 new protein complexes in Arabidopsis thaliana, with high support from GO semantic similarity. Our results indicate that findings from GCC-v are robust to network perturbations, which has direct implications to assess the impact of the PPI network quality on the predicted protein complexes. (C) 2021 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.
Selection of high-performance lines with respect to traits of interest is a key step in plant breeding. Genomic prediction allows to determine the genomic estimated breeding values of unseen lines for trait of interest using genetic markers, e.g. single-nucleotide polymorphisms (SNPs), and machine learning approaches, which can therefore shorten breeding cycles, referring to genomic selection (GS). Here, we applied GS approaches in two populations of Solanaceous crops, i.e. tomato and pepper, to predict morphometric and colorimetric traits. The traits were measured by using scoring-based conventional descriptors (CDs) as well as by Tomato Analyzer (TA) tool using the longitudinally and latitudinally cut fruit images. The GS performance was assessed in cross-validations of classification-based and regression-based machine learning models for CD and TA traits, respectively. The results showed the usage of TA traits and tag SNPs provide a powerful combination to predict morphology and color-related traits of Solanaceous fruits. The highest predictability of 0.89 was achieved for fruit width in pepper, with an average predictability of 0.69 over all traits. The multi-trait GS models are of slightly better predictability than single-trait models for some colorimetric traits in pepper. While model validation performs poorly on wild tomato accessions, the usage as many as one accession per wild species in the training set can increase the transferability of models to unseen populations for some traits (e.g. fruit shape for which predictability in unseen scenario increased from zero to 0.6). Overall, GS approaches can assist the selection of high-performance Solanaceous fruits in crop breeding.
Physically interacting proteins form macromolecule complexes that drive diverse cellular processes. Advances in experimental techniques that capture interactions between proteins provide us with protein-protein interaction (PPI) networks from several model organisms. These datasets have enabled the prediction and other computational analyses of protein complexes. Here we provide a systematic review of the state-of-the-art algorithms for protein complex prediction from PPI networks proposed in the past two decades. The existing approaches that solve this problem are categorized into three groups, including: cluster-quality-based, node affinity-based, and network embedding-based approaches, and we compare and contrast the advantages and disadvantages. We further include a comparative analysis by computing the performance of eighteen methods based on twelve well-established performance measures on four widely used benchmark protein-protein interaction networks. Finally, the limitations and drawbacks of both, current data and approaches, along with the potential solutions in this field are discussed, with emphasis on the points that pave the way for future research efforts in this field. (c) 2022 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. This is an open access article under the CC BY license (http://creativecommons. org/licenses/by/4.0/).
COMMIT
(2022)
Composition and functions of microbial communities affect important traits in diverse hosts, from crops to humans. Yet, mechanistic understanding of how metabolism of individual microbes is affected by the community composition and metabolite leakage is lacking. Here, we first show that the consensus of automatically generated metabolic reconstructions improves the quality of the draft reconstructions, measured by comparison to reference models. We then devise an approach for gap filling, termed COMMIT, that considers metabolites for secretion based on their permeability and the composition of the community. By applying COMMIT with two soil communities from the Arabidopsis thaliana culture collection, we could significantly reduce the gap-filling solution in comparison to filling gaps in individual reconstructions without affecting the genomic support. Inspection of the metabolic interactions in the soil communities allows us to identify microbes with community roles of helpers and beneficiaries. Therefore, COMMIT offers a versatile fully automated solution for large-scale modelling of microbial communities for diverse biotechnological applications. <br /> Author summaryMicrobial communities are important in ecology, human health, and crop productivity. However, detailed information on the interactions within natural microbial communities is hampered by the community size, lack of detailed information on the biochemistry of single organisms, and the complexity of interactions between community members. Metabolic models are comprised of biochemical reaction networks based on the genome annotation, and can provide mechanistic insights into community functions. Previous analyses of microbial community models have been performed with high-quality reference models or models generated using a single reconstruction pipeline. However, these models do not contain information on the composition of the community that determines the metabolites exchanged between the community members. In addition, the quality of metabolic models is affected by the reconstruction approach used, with direct consequences on the inferred interactions between community members. Here, we use fully automated consensus reconstructions from four approaches to arrive at functional models with improved genomic support while considering the community composition. We applied our pipeline to two soil communities from the Arabidopsis thaliana culture collection, providing only genome sequences. Finally, we show that the obtained models have 90% genomic support and demonstrate that the derived interactions are corroborated by independent computational predictions.
The reactive oxygen species (ROS) gene network, consisting of both ROS-generating and detoxifying enzymes, adjusts ROS levels in response to various stimuli. We performed a cross-kingdom comparison of ROS gene networks to investigate how they have evolved across all Eukaryotes, including protists, fungi, plants and animals. We included the genomes of 16 extremotolerant Eukaryotes to gain insight into ROS gene evolution in organisms that experience extreme stress conditions. Our analysis focused on ROS genes found in all Eukaryotes (such as catalases, superoxide dismutases, glutathione reductases, peroxidases and glutathione peroxidase/peroxiredoxins) as well as those specific to certain groups, such as ascorbate peroxidases, dehydroascorbate/monodehydroascorbate reductases in plants and other photosynthetic organisms. ROS-producing NADPH oxidases (NOX) were found in most multicellular organisms, although several NOX-like genes were identified in unicellular or filamentous species. However, despite the extreme conditions experienced by extremophile species, we found no evidence for expansion of ROS-related gene families in these species compared to other Eukaryotes. Tardigrades and rotifers do show ROS gene expansions that could be related to their extreme lifestyles, although a high rate of lineage-specific horizontal gene transfer events, coupled with recent tetraploidy in rotifers, could explain this observation. This suggests that the basal Eukaryotic ROS scavenging systems are sufficient to maintain ROS homeostasis even under the most extreme conditions.
Ribosome biogenesis is tightly associated to plant metabolism due to the usage of ribosomes in the synthesis of proteins necessary to drive metabolic pathways. Given the central role of ribosome biogenesis in cell physiology, it is important to characterize the impact of different components involved in this process on plant metabolism. Double mutants of the Arabidopsis thaliana cytosolic 60S maturation factors REIL1 and REIL2 do not resume growth after shift to moderate 10 degrees C chilling conditions. To gain mechanistic insights into the metabolic effects of this ribosome biogenesis defect on metabolism, we developed TC-iReMet2, a constraint-based modelling approach that integrates relative metabolomics and transcriptomics time-course data to predict differential fluxes on a genome-scale level. We employed TC-iReMet2 with metabolomics and transcriptomics data from the Arabidopsis Columbia 0 wild type and the reil1-1 reil2-1 double mutant before and after cold shift. We identified reactions and pathways that are highly altered in a mutant relative to the wild type. These pathways include the Calvin-Benson cycle, photorespiration, gluconeogenesis, and glycolysis. Our findings also indicated differential NAD(P)/NAD(P)H ratios after cold shift. TC-iReMet2 allows for mechanistic hypothesis generation and interpretation of system biology experiments related to metabolic fluxes on a genome-scale level.
Large-scale biochemical models are of increasing sizes due to the consideration of interacting organisms and tissues. Model reduction approaches that preserve the flux phenotypes can simplify the analysis and predictions of steady-state metabolic phenotypes. However, existing approaches either restrict functionality of reduced models or do not lead to significant decreases in the number of modelled metabolites. Here, we introduce an approach for model reduction based on the structural property of balancing of complexes that preserves the steady-state fluxes supported by the network and can be efficiently determined at genome scale. Using two large-scale mass-action kinetic models of Escherichia coli, we show that our approach results in a substantial reduction of 99% of metabolites. Applications to genome-scale metabolic models across kingdoms of life result in up to 55% and 85% reduction in the number of metabolites when arbitrary and mass-action kinetics is assumed, respectively. We also show that predictions of the specific growth rate from the reduced models match those based on the original models. Since steady-state flux phenotypes from the original model are preserved in the reduced, the approach paves the way for analysing other metabolic phenotypes in large-scale biochemical networks.
Reaction lumping in metabolic networks for application with thermodynamic metabolic flux analysis
(2021)
Thermodynamic metabolic flux analysis (TMFA) can narrow down the space of steady-state flux distributions, but requires knowledge of the standard Gibbs free energy for the modelled reactions. The latter are often not available due to unknown Gibbs free energy change of formation ,Delta fG0, of metabolites. To optimize the usage of data on thermodynamics in constraining a model, reaction lumping has been proposed to eliminate metabolites with unknown Delta fG0. However, the lumping procedure has not been formalized nor implemented for systematic identification of lumped reactions. Here, we propose, implement, and test a combined procedure for reaction lumping, applicable to genome-scale metabolic models. It is based on identification of groups of metabolites with unknown Delta fG0 whose elimination can be conducted independently of the others via: (1) group implementation, aiming to eliminate an entire such group, and, if this is infeasible, (2) a sequential implementation to ensure that a maximal number of metabolites with unknown Delta fG0 are eliminated. Our comparative analysis with genome-scale metabolic models of Escherichia coli, Bacillus subtilis, and Homo sapiens shows that the combined procedure provides an efficient means for systematic identification of lumped reactions. We also demonstrate that TMFA applied to models with reactions lumped according to the proposed procedure lead to more precise predictions in comparison to the original models. The provided implementation thus ensures the reproducibility of the findings and their application with standard TMFA.
Trade-offs are inherent to biochemical networks governing diverse cellular functions, from gene expression to metabolism. Yet, trade-offs between fluxes of biochemical reactions in a metabolic network have not been formally studied. Here, we introduce the concept of absolute flux trade-offs and devise a constraint-based approach, termed FluTO, to identify and enumerate flux trade-offs in a given genome-scale metabolic network. By employing the metabolic networks of Escherichia coli and Saccharomyces cerevisiae, we demonstrate that the flux trade-offs are specific to carbon sources provided but that reactions involved in the cofactor and prosthetic group biosynthesis are present in trade-offs across all carbon sources supporting growth. We also show that absolute flux trade-offs depend on the biomass reaction used to model the growth of Arabidopsis thaliana under different carbon and nitrogen conditions. The identified flux trade-offs reflect the tight coupling between nitrogen, carbon, and sulphur metabolisms in leaves of C-3 plants. Altogether, FluTO provides the means to explore the space of alternative metabolic routes reflecting the constraints imposed by inherent flux trade-offs in large-scale metabolic networks.
Understanding the complexity of metabolic networks has implications for manipulation of their functions. The complexity of metabolic networks can be characterized by identifying multireaction dependencies that are challenging to determine due to the sheer number of combinations to consider. Here, we propose the concept of concordant complexes that captures multireaction dependencies and can be efficiently determined from the algebraic structure and operational constraints of metabolic networks. The concordant complexes imply the existence of concordance modules based on which the apparent complexity of 12 metabolic networks of organisms from all kingdoms of life can be reduced by at least 78%. A comparative analysis against an ensemble of randomized metabolic networks shows that the metabolic network of Escherichia coli contains fewer concordance modules and is, therefore, more tightly coordinated than expected by chance. Together, our findings demonstrate that metabolic networks are considerably simpler than what can be perceived from their structure alone.
The current trends of crop yield improvements are not expected to meet the projected rise in demand. Genomic selection uses molecular markers and machine learning to identify superior genotypes with improved traits, such as growth. Plant growth directly depends on rates of metabolic reactions which transform nutrients into the building blocks of biomass. Here, we predict growth of Arabidopsis thaliana accessions by employing genomic prediction of reaction rates estimated from accession-specific metabolic models. We demonstrate that, comparing to classical genomic selection on the available data sets for 67 accessions, our approach improves the prediction accuracy for growth within and across nitrogen environments by 32.6% and 51.4%, respectively, and from optimal nitrogen to low carbon environment by 50.4%. Therefore, integration of molecular markers into metabolic models offers an approach to predict traits directly related to metabolism, and its usefulness in breeding can be examined by gathering matching datasets in crops. An increase in genomic selection (GS) accuracy can accelerate genetic gain by shortening the breeding cycles. Here, the authors introduce a network-based GS method that uses metabolic models and improves the prediction accuracy of Arabidopsis growth within and across environments.