TY - JOUR A1 - Steinfath, Matthias A1 - Gärtner, Tanja A1 - Lisec, Jan A1 - Meyer, Rhonda Christiane A1 - Altmann, Thomas A1 - Willmitzer, Lothar A1 - Selbig, Joachim T1 - Prediction of hybrid biomass in Arabidopsis thaliana by selected parental SNP and metabolic markers JF - Theoretical and applied genetics : TAG ; international journal of plant breeding research N2 - A recombinant inbred line (RIL) population, derived from two Arabidopsis thaliana accessions, and the corresponding testcrosses with these two original accessions were used for the development and validation of machine learning models to predict the biomass of hybrids. Genetic and metabolic information of the RILs served as predictors. Feature selection reduced the number of variables (genetic and metabolic markers) in the models by more than 80% without impairing the predictive power. Thus, potential biomarkers have been revealed. Metabolites were shown to bear information on inherited macroscopic phenotypes. This proof of concept could be interesting for breeders. The example population exhibits substantial mid-parent biomass heterosis. The results of feature selection could therefore be used to shed light on the origin of heterosis. In this respect, mainly dominance effects were detected. KW - Quantitative Trait Locus KW - feature selection KW - Partial Little Square KW - recombinant inbred line KW - Quantitative Trait Locus analysis Y1 - 2009 U6 - https://doi.org/10.1007/s00122-009-1191-2 SN - 0040-5752 SN - 1432-2242 VL - 120 SP - 239 EP - 247 PB - Springer CY - Berlin ER - TY - JOUR A1 - Steinfath, Matthias A1 - Strehmel, Nadine A1 - Peters, Rolf A1 - Schauer, Nicolas A1 - Groth, Detlef A1 - Hummel, Jan A1 - Steup, Martin A1 - Selbig, Joachim A1 - Kopka, Joachim A1 - Geigenberger, Peter A1 - Dongen, Joost T. van T1 - Discovering plant metabolic biomarkers for phenotype prediction using an untargeted approach N2 - Biomarkers are used to predict phenotypical properties before these features become apparent and, therefore, are valuable tools for both fundamental and applied research. Diagnostic biomarkers have been discovered in medicine many decades ago and are now commonly applied. While this is routine in the field of medicine, it is of surprise that in agriculture this approach has never been investigated. Up to now, the prediction of phenotypes in plants was based on growing plants and assaying the organs of interest in a time intensive process. For the first time, we demonstrate in this study the application of metabolomics to predict agronomic important phenotypes of a crop plant that was grown in different environments. Our procedure consists of established techniques to screen untargeted for a large amount of metabolites in parallel, in combination with machine learning methods. By using this combination of metabolomics and biomathematical tools metabolites were identified that can be used as biomarkers to improve the prediction of traits. The predictive metabolites can be selected and used subsequently to develop fast, targeted and low-cost diagnostic biomarker assays that can be implemented in breeding programs or quality assessment analysis. The identified metabolic biomarkers allow for the prediction of crop product quality. Furthermore, marker-assisted selection can benefit from the discovery of metabolic biomarkers when other molecular markers come to its limitation. The described marker selection method was developed for potato tubers, but is generally applicable to any crop and trait as it functions independently of genomic information. Y1 - 2010 UR - http://www3.interscience.wiley.com/cgi-bin/issn?DESCRIPTOR=PRINTISSN&VALUE=1467-7644 U6 - https://doi.org/10.1111/j.1467-7652.2010.00516.x SN - 1467-7644 ER - TY - JOUR A1 - Steuer, Ralf A1 - Gross, Thilo A1 - Selbig, Joachim A1 - Blasius, Bernd T1 - Structural kinetic modeling of metabolic networks JF - Proceedings of the National Academy of Sciences of the United States of America N2 - To develop and investigate detailed mathematical models of metabolic processes is one of the primary challenges in systems biology. However, despite considerable advance in the topological analysis of metabolic networks, kinetic modeling is still often severely hampered by inadequate knowledge of the enzyme-kinetic rate laws and their associated parameter values. Here we propose a method that aims to give a quantitative account of the dynamical capabilities of a metabolic system, without requiring any explicit information about the functional form of the rate equations. Our approach is based on constructing a local linear model at each point in parameter space, such that each element of the model is either directly experimentally accessible or amenable to a straightforward biochemical interpretation. This ensemble of local linear models, encompassing all possible explicit kinetic models, then allows for a statistical exploration of the comprehensive parameter space. The method is exemplified on two paradigmatic metabolic systems: the glycolytic pathway of yeast and a realistic-scale representation of the photosynthetic Calvin cycle. KW - systems biology KW - computational biochemistry KW - metabolomics KW - metabolic regulation KW - biological robustness Y1 - 2006 U6 - https://doi.org/10.1073/pnas.0600013103 SN - 0027-8424 SN - 1091-6490 VL - 103 IS - 32 SP - 11868 EP - 11873 PB - National Academy of Sciences CY - Washington ER - TY - JOUR A1 - Steuer, Ralf A1 - Humburg, Peter A1 - Selbig, Joachim T1 - Validation and functional annotation of expression-based clusters based on gene ontology JF - BMC bioinformatics N2 - Background: The biological interpretation of large-scale gene expression data is one of the paramount challenges in current bioinformatics. In particular, placing the results in the context of other available functional genomics data, such as existing bio-ontologies, has already provided substantial improvement for detecting and categorizing genes of interest. One common approach is to look for functional annotations that are significantly enriched within a group or cluster of genes, as compared to a reference group. Results: In this work, we suggest the information-theoretic concept of mutual information to investigate the relationship between groups of genes, as given by data-driven clustering, and their respective functional categories. Drawing upon related approaches (Gibbons and Roth, Genome Research 12: 1574-1581, 2002), we seek to quantify to what extent individual attributes are sufficient to characterize a given group or cluster of genes. Conclusion: We show that the mutual information provides a systematic framework to assess the relationship between groups or clusters of genes and their functional annotations in a quantitative way. Within this framework, the mutual information allows us to address and incorporate several important issues, such as the interdependence of functional annotations and combinatorial combinations of attributes. It thus supplements and extends the conventional search for overrepresented attributes within a group or cluster of genes. In particular taking combinations of attributes into account, the mutual information opens the way to uncover specific functional descriptions of a group of genes or clustering result. All datasets and functional annotations used in this study are publicly available. All scripts used in the analysis are provided as additional files. Y1 - 2006 U6 - https://doi.org/10.1186/1471-2105-7-380 SN - 1471-2105 VL - 7 IS - 380 PB - BioMed Central CY - London ER - TY - JOUR A1 - Sulpice, Ronan A1 - Nikoloski, Zoran A1 - Tschoep, Hendrik A1 - Antonio, Carla A1 - Kleessen, Sabrina A1 - Larhlimi, Abdelhalim A1 - Selbig, Joachim A1 - Ishihara, Hirofumi A1 - Gibon, Yves A1 - Fernie, Alisdair R. A1 - Stitt, Mark T1 - Impact of the Carbon and Nitrogen Supply on Relationships and Connectivity between Metabolism and Biomass in a Broad Panel of Arabidopsis Accessions(1[W][OA]) JF - Plant physiology : an international journal devoted to physiology, biochemistry, cellular and molecular biology, biophysics and environmental biology of plants N2 - Natural genetic diversity provides a powerful tool to study the complex interrelationship between metabolism and growth. Profiling of metabolic traits combined with network-based and statistical analyses allow the comparison of conditions and identification of sets of traits that predict biomass. However, it often remains unclear why a particular set of metabolites is linked with biomass and to what extent the predictive model is applicable beyond a particular growth condition. A panel of 97 genetically diverse Arabidopsis (Arabidopsis thaliana) accessions was grown in near-optimal carbon and nitrogen supply, restricted carbon supply, and restricted nitrogen supply and analyzed for biomass and 54 metabolic traits. Correlation-based metabolic networks were generated from the genotype-dependent variation in each condition to reveal sets of metabolites that show coordinated changes across accessions. The networks were largely specific for a single growth condition. Partial least squares regression from metabolic traits allowed prediction of biomass within and, slightly more weakly, across conditions (cross-validated Pearson correlations in the range of 0.27-0.58 and 0.21-0.51 and P values in the range of <0.001-<0.13 and <0.001-<0.023, respectively). Metabolic traits that correlate with growth or have a high weighting in the partial least squares regression were mainly condition specific and often related to the resource that restricts growth under that condition. Linear mixed-model analysis using the combined metabolic traits from all growth conditions as an input indicated that inclusion of random effects for the conditions improves predictions of biomass. Thus, robust prediction of biomass across a range of conditions requires condition-specific measurement of metabolic traits to take account of environment-dependent changes of the underlying networks. Y1 - 2013 U6 - https://doi.org/10.1104/pp.112.210104 SN - 0032-0889 SN - 1532-2548 VL - 162 IS - 1 SP - 347 EP - 363 PB - American Society of Plant Physiologists CY - Rockville ER - TY - JOUR A1 - Sulpice, Ronan A1 - Pyl, Eva-Theresa A1 - Ishihara, Hirofumi A1 - Trenkamp, Sandra A1 - Steinfath, Matthias A1 - Witucka-Wall, Hanna A1 - Gibon, Yves A1 - Usadel, Björn A1 - Poree, Fabien A1 - Piques, Maria Conceicao A1 - von Korff, Maria A1 - Steinhauser, Marie Caroline A1 - Keurentjes, Joost J. B. A1 - Guenther, Manuela A1 - Hoehne, Melanie A1 - Selbig, Joachim A1 - Fernie, Alisdair R. A1 - Altmann, Thomas A1 - Stitt, Mark T1 - Starch as a major integrator in the regulation of plant growth N2 - Rising demand for food and bioenergy makes it imperative to breed for increased crop yield. Vegetative plant growth could be driven by resource acquisition or developmental programs. Metabolite profiling in 94 Arabidopsis accessions revealed that biomass correlates negatively with many metabolites, especially starch. Starch accumulates in the light and is degraded at night to provide a sustained supply of carbon for growth. Multivariate analysis revealed that starch is an integrator of the overall metabolic response. We hypothesized that this reflects variation in a regulatory network that balances growth with the carbon supply. Transcript profiling in 21 accessions revealed coordinated changes of transcripts of more than 70 carbon-regulated genes and identified 2 genes (myo-inositol-1- phosphate synthase, a Kelch-domain protein) whose transcripts correlate with biomass. The impact of allelic variation at these 2 loci was shown by association mapping, identifying them as candidate lead genes with the potential to increase biomass production. Y1 - 2009 UR - http://www.pnas.org/ U6 - https://doi.org/10.1073/pnas.0903478106 SN - 0027-8424 ER - TY - GEN A1 - Szymanski, Jedrzej A1 - Jozefczuk, Szymon A1 - Nikoloski, Zoran A1 - Selbig, Joachim A1 - Nikiforova, Victoria A1 - Catchpole, Gareth A1 - Willmitzer, Lothar T1 - Stability of metabolic correlations under changing environmental conditions in Escherichia coli : a systems approach N2 - Background: Biological systems adapt to changing environments by reorganizing their cellula r and physiological program with metabolites representing one important response level. Different stresses lead to both conserved and specific responses on the metabolite level which should be reflected in the underl ying metabolic network. Methodology/Principal Findings: Starting from experimental data obtained by a GC-MS based high-throughput metabolic profiling technology we here develop an approach that: (1) extracts network representations from metabolic conditiondependent data by using pairwise correlations, (2) determines the sets of stable and condition-dependent correlations based on a combination of statistical significance and homogeneity tests, and (3) can identify metabolites related to the stress response, which goes beyond simple ob servation s about the changes of metabolic concentrations. The approach was tested with Escherichia colias a model organism observed under four different environmental stress conditions (cold stress, heat stress, oxidative stress, lactose diau xie) and control unperturbed conditions. By constructing the stable network component, which displays a scale free topology and small-world characteristics, we demonstrated that: (1) metabolite hubs in this reconstructed correlation networks are significantly enriched for those contained in biochemical networks such as EcoCyc, (2) particular components of the stable network are enriched for functionally related biochemical path ways, and (3) ind ependently of the response scale, based on their importance in the reorganization of the cor relation network a set of metabolites can be identified which represent hypothetical candidates for adjusting to a stress-specific response. Conclusions/Significance: Network-based tools allowed the identification of stress-dependent and general metabolic correlation networks. This correlation-network-ba sed approach does not rely on major changes in concentration to identify metabolites important for st ress adaptation, but rather on the changes in network properties with respect to metabolites. This should represent a useful complementary technique in addition to more classical approaches. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - paper 147 KW - Small-world networks KW - saccharomyces-cerevisiae KW - trehalose synthesis KW - gene-expression KW - stress-response Y1 - 2009 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus-45253 ER - TY - JOUR A1 - Timmer, Marco A1 - Theiss, Hans A1 - Jürchott, Katrin A1 - Ries, Christian A1 - Paron, Igor A1 - Franz, W. A1 - Selbig, Joachim A1 - Guo, Ketai A1 - Tonn, Jörg A1 - Schichor, Christian T1 - Stromal-Derived Factor 1a (Sdf-1a), a Homing Factor for Mesenchymal Progenitor Cells, Is Elevated in Tumor Tissue and Plasma of Glioma Patients N2 - Malignant gliomas are a fatal disease lacking sufficient possibilities for early diagnosis and chemical markers to detect remission or relapse. The recruitment of progenitor cells such as mesenchymal stem cells (MSC) is a main feature of gliomas. Stromal cell-derived factor-1 (SDF-1), a chemokine produced in glioma cell lines, enhances migration in MSC and has been associated with cell survival and apoptosis in gliomas. Therefore, this study was performed to evaluate (i) whether SDF-1 and its receptors are expressed in human malignant gliomas in situ and (ii) if SDF-1 might potentially play a role in recruiting MSCs into human glioma. In glioblastoma tissue, immunohistochemistry revealed that SDF-1 and its receptor CXCR4 are expressed in regions of angiogenesis and necrosis, and qPCR showed that SDF-1 is elevated. Public expression data indicated that CXCR4 was upregulated. The latter data also illustrate that SDF-1 could be up- or downregulated in glioma compared to normal brain in a transcript-specific manner. In plasma, SDF-1 is elevated in glioma patients. The level is reduced by both dexamethasone intake and surgery. Dexamethasone also decreased SDF-1 production in cells in vitro. The undirected migration of human MSC (hMSC) was not enhanced by the addition of SDF-1. However, SDF-1 stimulated directed invasion of hMSC in a dose-dependent manner. Taken together, we show that SDF-1 is a potent chemoattractant of progenitor cells such as hMSCs and that its expression is elevated in glioma tissue, which results in elevated SDF-1 levels in the patient's plasma samples with concomittant decrease after tumor resection. The fact that elevated SDF-1 plasma levels are significantly decreased after tumor surgery could be a first hint that SDF-1 might act as tumor marker for malignant gliomas in order to detect disease progression or remission, respectively. Y1 - 2010 UR - http://neuro-oncology.oxfordjournals.org/ SN - 1522-8517 ER -