TY - JOUR A1 - Razaghi-Moghadam, Zahra A1 - Nikoloski, Zoran T1 - Supervised learning of gene-regulatory networks based on graph distance profiles of transcriptomics data JF - npj Systems biology and applications N2 - Characterisation of gene-regulatory network (GRN) interactions provides a stepping stone to understanding how genes affect cellular phenotypes. Yet, despite advances in profiling technologies, GRN reconstruction from gene expression data remains a pressing problem in systems biology. Here, we devise a supervised learning approach, GRADIS, which utilises support vector machine to reconstruct GRNs based on distance profiles obtained from a graph representation of transcriptomics data. By employing the data fromEscherichia coliandSaccharomyces cerevisiaeas well as synthetic networks from the DREAM4 and five network inference challenges, we demonstrate that our GRADIS approach outperforms the state-of-the-art supervised and unsupervided approaches. This holds when predictions about target genes for individual transcription factors as well as for the entire network are considered. We employ experimentally verified GRNs fromE. coliandS. cerevisiaeto validate the predictions and obtain further insights in the performance of the proposed approach. Our GRADIS approach offers the possibility for usage of other network-based representations of large-scale data, and can be readily extended to help the characterisation of other cellular networks, including protein-protein and protein-metabolite interactions. Y1 - 2020 U6 - https://doi.org/10.1038/s41540-020-0140-1 SN - 2056-7189 VL - 6 IS - 1 PB - Nature Publ. Group CY - London ER - TY - JOUR A1 - Razaghi-Moghadam, Zahra A1 - Nikoloski, Zoran T1 - GeneReg BT - a constraint-based approach for design of feasible metabolic engineering strategies at the gene level JF - Bioinformatics N2 - Motivation Large-scale metabolic models are widely used to design metabolic engineering strategies for diverse biotechnological applications. However, the existing computational approaches focus on alteration of reaction fluxes and often neglect the manipulations of gene expression to implement these strategies. Results Here, we find that the association of genes with multiple reactions leads to infeasibility of engineering strategies at the flux level, since they require contradicting manipulations of gene expression. Moreover, we identify that all of the existing approaches to design gene knockout strategies do not ensure that the resulting design may also require other gene alterations, such as up- or downregulations, to match the desired flux distribution. To address these issues, we propose a constraint-based approach, termed GeneReg, that facilitates the design of feasible metabolic engineering strategies at the gene level and that is readily applicable to large-scale metabolic networks. We show that GeneReg can identify feasible strategies to overproduce ethanol in Escherichia coli and lactate in Saccharomyces cerevisiae, but overproduction of the TCA cycle intermediates is not feasible in five organisms used as cell factories under default growth conditions. Therefore, GeneReg points at the need to couple gene regulation and metabolism to design rational metabolic engineering strategies. Y1 - 2020 U6 - https://doi.org/10.1093/bioinformatics/btaa996 SN - 1367-4803 SN - 1460-2059 VL - 37 IS - 12 SP - 1717 EP - 1723 PB - Oxford Univ. Press CY - Oxford ER - TY - GEN A1 - Razaghi-Moghadam, Zahra A1 - Nikoloski, Zoran T1 - Supervised learning of gene regulatory networks T2 - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe N2 - Identifying the entirety of gene regulatory interactions in a biological system offers the possibility to determine the key molecular factors that affect important traits on the level of cells, tissues, and whole organisms. Despite the development of experimental approaches and technologies for identification of direct binding of transcription factors (TFs) to promoter regions of downstream target genes, computational approaches that utilize large compendia of transcriptomics data are still the predominant methods used to predict direct downstream targets of TFs, and thus reconstruct genome-wide gene-regulatory networks (GRNs). These approaches can broadly be categorized into unsupervised and supervised, based on whether data about known, experimentally verified gene-regulatory interactions are used in the process of reconstructing the underlying GRN. Here, we first describe the generic steps of supervised approaches for GRN reconstruction, since they have been recently shown to result in improved accuracy of the resulting networks? We also illustrate how they can be used with data from model organisms to obtain more accurate prediction of gene regulatory interactions. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1185 KW - gene expression profiles KW - gene regulatory networks KW - supervised learning KW - support vector machine Y1 - 2021 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-516561 SN - 1866-8372 ER - TY - JOUR A1 - Tong, Hao A1 - Küken, Anika A1 - Razaghi-Moghadam, Zahra A1 - Nikoloski, Zoran T1 - Characterization of effects of genetic variants via genome-scale metabolic modelling JF - Cellular and molecular life sciences : CMLS N2 - Genome-scale metabolic networks for model plants and crops in combination with approaches from the constraint-based modelling framework have been used to predict metabolic traits and design metabolic engineering strategies for their manipulation. With the advances in technologies to generate large-scale genotyping data from natural diversity panels and other populations, genome-wide association and genomic selection have emerged as statistical approaches to determine genetic variants associated with and predictive of traits. Here, we review recent advances in constraint-based approaches that integrate genetic variants in genome-scale metabolic models to characterize their effects on reaction fluxes. Since some of these approaches have been applied in organisms other than plants, we provide a critical assessment of their applicability particularly in crops. In addition, we further dissect the inferred effects of genetic variants with respect to reaction rate constants, abundances of enzymes, and concentrations of metabolites, as main determinants of reaction fluxes and relate them with their combined effects on complex traits, like growth. Through this systematic review, we also provide a roadmap for future research to increase the predictive power of statistical approaches by coupling them with mechanistic models of metabolism. KW - Single-nucleotide polymorphisms KW - Metabolic models KW - Genome-wide KW - association studies KW - Genomic selection Y1 - 2021 U6 - https://doi.org/10.1007/s00018-021-03844-4 SN - 1420-682X SN - 1420-9071 VL - 78 IS - 12 SP - 5123 EP - 5138 PB - Springer International Publishing AG CY - Cham ER - TY - JOUR A1 - Pries, Christopher A1 - Razaghi-Moghadam, Zahra A1 - Kopka, Joachim A1 - Nikoloski, Zoran T1 - Integration of relative metabolomics and transcriptomics time-course data in a metabolic model pinpoints effects of ribosome biogenesis defects on Arabidopsis thaliana metabolism JF - Scientific reports N2 - Ribosome biogenesis is tightly associated to plant metabolism due to the usage of ribosomes in the synthesis of proteins necessary to drive metabolic pathways. Given the central role of ribosome biogenesis in cell physiology, it is important to characterize the impact of different components involved in this process on plant metabolism. Double mutants of the Arabidopsis thaliana cytosolic 60S maturation factors REIL1 and REIL2 do not resume growth after shift to moderate 10 degrees C chilling conditions. To gain mechanistic insights into the metabolic effects of this ribosome biogenesis defect on metabolism, we developed TC-iReMet2, a constraint-based modelling approach that integrates relative metabolomics and transcriptomics time-course data to predict differential fluxes on a genome-scale level. We employed TC-iReMet2 with metabolomics and transcriptomics data from the Arabidopsis Columbia 0 wild type and the reil1-1 reil2-1 double mutant before and after cold shift. We identified reactions and pathways that are highly altered in a mutant relative to the wild type. These pathways include the Calvin-Benson cycle, photorespiration, gluconeogenesis, and glycolysis. Our findings also indicated differential NAD(P)/NAD(P)H ratios after cold shift. TC-iReMet2 allows for mechanistic hypothesis generation and interpretation of system biology experiments related to metabolic fluxes on a genome-scale level. Y1 - 2021 U6 - https://doi.org/10.1038/s41598-021-84114-y SN - 2045-2322 VL - 11 IS - 1 PB - Macmillan Publishers Limited, part of Springer Nature CY - London ER - TY - JOUR A1 - Seep, Lea A1 - Razaghi-Moghadam, Zahra A1 - Nikoloski, Zoran T1 - Reaction lumping in metabolic networks for application with thermodynamic metabolic flux analysis JF - Scientific reports N2 - Thermodynamic metabolic flux analysis (TMFA) can narrow down the space of steady-state flux distributions, but requires knowledge of the standard Gibbs free energy for the modelled reactions. The latter are often not available due to unknown Gibbs free energy change of formation ,Delta fG0, of metabolites. To optimize the usage of data on thermodynamics in constraining a model, reaction lumping has been proposed to eliminate metabolites with unknown Delta fG0. However, the lumping procedure has not been formalized nor implemented for systematic identification of lumped reactions. Here, we propose, implement, and test a combined procedure for reaction lumping, applicable to genome-scale metabolic models. It is based on identification of groups of metabolites with unknown Delta fG0 whose elimination can be conducted independently of the others via: (1) group implementation, aiming to eliminate an entire such group, and, if this is infeasible, (2) a sequential implementation to ensure that a maximal number of metabolites with unknown Delta fG0 are eliminated. Our comparative analysis with genome-scale metabolic models of Escherichia coli, Bacillus subtilis, and Homo sapiens shows that the combined procedure provides an efficient means for systematic identification of lumped reactions. We also demonstrate that TMFA applied to models with reactions lumped according to the proposed procedure lead to more precise predictions in comparison to the original models. The provided implementation thus ensures the reproducibility of the findings and their application with standard TMFA. Y1 - 2021 U6 - https://doi.org/10.1038/s41598-021-87643-8 SN - 2045-2322 VL - 11 IS - 1 PB - Macmillan Publishers Limited, part of Springer Nature CY - London ER - TY - JOUR A1 - Hashemi, Seirana A1 - Razaghi-Moghadam, Zahra A1 - Nikoloski, Zoran T1 - Identification of flux trade-offs in metabolic networks JF - Scientific reports N2 - Trade-offs are inherent to biochemical networks governing diverse cellular functions, from gene expression to metabolism. Yet, trade-offs between fluxes of biochemical reactions in a metabolic network have not been formally studied. Here, we introduce the concept of absolute flux trade-offs and devise a constraint-based approach, termed FluTO, to identify and enumerate flux trade-offs in a given genome-scale metabolic network. By employing the metabolic networks of Escherichia coli and Saccharomyces cerevisiae, we demonstrate that the flux trade-offs are specific to carbon sources provided but that reactions involved in the cofactor and prosthetic group biosynthesis are present in trade-offs across all carbon sources supporting growth. We also show that absolute flux trade-offs depend on the biomass reaction used to model the growth of Arabidopsis thaliana under different carbon and nitrogen conditions. The identified flux trade-offs reflect the tight coupling between nitrogen, carbon, and sulphur metabolisms in leaves of C-3 plants. Altogether, FluTO provides the means to explore the space of alternative metabolic routes reflecting the constraints imposed by inherent flux trade-offs in large-scale metabolic networks. Y1 - 2021 U6 - https://doi.org/10.1038/s41598-021-03224-9 SN - 2045-2322 VL - 11 IS - 1 PB - Macmillan Publishers Limited, part of Springer Nature CY - London ER - TY - JOUR A1 - Xu, Rudan A1 - Razaghi-Moghadam, Zahra A1 - Nikoloski, Zoran T1 - Maximization of non-idle enzymes improves the coverage of the estimated maximal in vivo enzyme catalytic rates in Escherichia coli JF - Bioinformatics N2 - Motivation: Constraint-based modeling approaches allow the estimation of maximal in vivo enzyme catalytic rates that can serve as proxies for enzyme turnover numbers. Yet, genome-scale flux profiling remains a challenge in deploying these approaches to catalogue proxies for enzyme catalytic rates across organisms. Results: Here, we formulate a constraint-based approach, termed NIDLE-flux, to estimate fluxes at a genome-scale level by using the principle of efficient usage of expressed enzymes. Using proteomics data from Escherichia coli, we show that the fluxes estimated by NIDLE-flux and the existing approaches are in excellent qualitative agreement (Pearson correlation > 0.9). We also find that the maximal in vivo catalytic rates estimated by NIDLE-flux exhibits a Pearson correlation of 0.74 with in vitro enzyme turnover numbers. However, NIDLE-flux results in a 1.4-fold increase in the size of the estimated maximal in vivo catalytic rates in comparison to the contenders. Integration of the maximum in vivo catalytic rates with publically available proteomics and metabolomics data provide a better match to fluxes estimated by NIDLE-flux. Therefore, NIDLE-flux facilitates more effective usage of proteomics data to estimate proxies for kcatomes. Y1 - 2021 U6 - https://doi.org/10.1093/bioinformatics/btab575 SN - 1367-4803 SN - 1460-2059 VL - 37 IS - 21 SP - 3848 EP - 3855 PB - Oxford Univ. Press CY - Oxford ER -