Refine
Year of publication
Document Type
- Article (16)
- Habilitation Thesis (1)
- Postprint (1)
Language
- English (18)
Is part of the Bibliography
- yes (18)
Keywords
- Arabidopsis (2)
- Metabolomics (2)
- drought tolerance (2)
- machine learning (2)
- metabolite markers (2)
- potato (Solanum tuberosum) (2)
- prediction models (2)
- transcript markers (2)
- Biomass (1)
- Brassinosteroids (1)
Potato (Solanum tuberosum L.) is one of the most important food crops worldwide. Current potato varieties are highly susceptible to drought stress. In view of global climate change, selection of cultivars with improved drought tolerance and high yield potential is of paramount importance. Drought tolerance breeding of potato is currently based on direct selection according to yield and phenotypic traits and requires multiple trials under drought conditions. Marker-assisted selection (MAS) is cheaper, faster and reduces classification errors caused by noncontrolled environmental effects. We analysed 31 potato cultivars grown under optimal and reduced water supply in six independent field trials. Drought tolerance was determined as tuber starch yield. Leaf samples from young plants were screened for preselected transcript and nontargeted metabolite abundance using qRT-PCR and GC-MS profiling, respectively. Transcript marker candidates were selected from a published RNA-Seq data set. A Random Forest machine learning approach extracted metabolite and transcript markers for drought tolerance prediction with low error rates of 6% and 9%, respectively. Moreover, by combining transcript and metabolite markers, the prediction error was reduced to 4.3%. Feature selection from Random Forest models allowed model minimization, yielding a minimal combination of only 20 metabolite and transcript markers that were successfully tested for their reproducibility in 16 independent agronomic field trials. We demonstrate that a minimum combination of transcript and metabolite markers sampled at early cultivation stages predicts potato yield stability under drought largely independent of seasonal and regional agronomic conditions.
Motivation: Visualizing and analysing the potential non-linear structure of a dataset is becoming an important task in molecular biology. This is even more challenging when the data have missing values. Results: Here, we propose an inverse model that performs non-linear principal component analysis (NLPCA) from incomplete datasets. Missing values are ignored while optimizing the model, but can be estimated afterwards. Results are shown for both artificial and experimental datasets. In contrast to linear methods, non-linear methods were able to give better missing value estimations for non-linear structured data. Application: We applied this technique to a time course of metabolite data from a cold stress experiment on the model plant Arabidopsis thaliana, and could approximate the mapping function from any time point to the metabolite responses. Thus, the inverse NLPCA provides greatly improved information for better understanding the complex response to cold stress
The gene family of subtilisin-like serine proteases (subtilases) in Arabidopsis thaliana comprises 56 members, divided into six distinct subfamilies. Whereas the members of five subfamilies are similar to pyrolysins, two genes share stronger similarity to animal kexins. Mutant screens confirmed 144 T-DNA insertion lines with knockouts for 55 out of the 56 subtilases. Apart from SDD1, none of the confirmed homozygous mutants revealed any obvious visible phenotypic alteration during growth under standard conditions. Apart from this specific case, forward genetics gave us no hints about the function of the individual 54 non-characterized subtilase genes. Therefore, the main objective of our work was to overcome the shortcomings of the forward genetic approach and to infer alternative experimental approaches by using an integrative biolinformatics and biological approach. Computational analyses based on transcriptional co-expression and co-response pattern revealed at least two expression networks, suggesting that functional redundancy may exist among subtilases with limited similarity. Furthermore, two hubs were identified, which may be involved in signalling or may represent higher-order regulatory factors involved in responses to environmental cues. A particular enrichment of co- regulated genes with metabolic functions was observed for four subtilases possibly representing late responsive elements of environmental stress. The kexin homologs show stronger associations with genes of transcriptional regulation context. Based on the analyses presented here and in accordance with previously characterized subtilases, we propose three main functions of subtilases: involvement in (i) control of development, (ii) protein turnover, and (iii) action as downstream components of signalling cascades
The comprehensive systems-biology database (CSB.DB) was used to reveal brassinosteroid (BR)-related genes from expression profiles based on co-response analyses. Genes exhibiting simultaneous changes in transcript levels are candidates of common transcriptional regulation. Combining numerous different experiments in data matrices allows ruling out outliers and conditional changes of transcript levels. CSB.DB was queried for transcriptional co-responses with the BR-signalling components BRI1 and BAK1: 301 out of 9694 genes represented in the nasc0271 database showed co-responses with both genes. As expected, these genes comprised pathway-involved genes (e.g. 72 BR-induced genes), because the BRI1 and BAK1 proteins are required for BR-responses. But transcript co-response takes the analysis a step further compared with direct approaches because BR-related non BR-responsive genes were identified. Insights into networks and the functional context of genes are provided, because factors determining expression patterns are reflected in correlations. Our findings demonstrate that transcript co-response analysis presents a valuable resource to uncover common regulatory patterns of genes. Different data matrices in CSB.DB allow examination of specific biological questions. All matrices are publicly available through CSB.DB. This work presents one possible roadmap to use the CSB.DB resources
Biomarkers are used to predict phenotypical properties before these features become apparent and, therefore, are valuable tools for both fundamental and applied research. Diagnostic biomarkers have been discovered in medicine many decades ago and are now commonly applied. While this is routine in the field of medicine, it is of surprise that in agriculture this approach has never been investigated. Up to now, the prediction of phenotypes in plants was based on growing plants and assaying the organs of interest in a time intensive process. For the first time, we demonstrate in this study the application of metabolomics to predict agronomic important phenotypes of a crop plant that was grown in different environments. Our procedure consists of established techniques to screen untargeted for a large amount of metabolites in parallel, in combination with machine learning methods. By using this combination of metabolomics and biomathematical tools metabolites were identified that can be used as biomarkers to improve the prediction of traits. The predictive metabolites can be selected and used subsequently to develop fast, targeted and low-cost diagnostic biomarker assays that can be implemented in breeding programs or quality assessment analysis. The identified metabolic biomarkers allow for the prediction of crop product quality. Furthermore, marker-assisted selection can benefit from the discovery of metabolic biomarkers when other molecular markers come to its limitation. The described marker selection method was developed for potato tubers, but is generally applicable to any crop and trait as it functions independently of genomic information.
We applied a top-down systems biology approach to understand how Chlamydomonas reinhardtii acclimates to long-term heat stress (HS) and recovers from it. For this, we shifted cells from 25 to 42 degrees C for 24 h and back to 25 degrees C for >= 8 h and monitored abundances of 1856 proteins/protein groups, 99 polar and 185 lipophilic metabolites, and cytological and photosynthesis parameters. Our data indicate that acclimation of Chlamydomonas to long-term HS consists of a temporally ordered, orchestrated implementation of response elements at various system levels. These comprise (1) cell cycle arrest; (2) catabolism of larger molecules to generate compounds with roles in stress protection; (3) accumulation of molecular chaperones to restore protein homeostasis together with compatible solutes; (4) redirection of photosynthetic energy and reducing power from the Calvin cycle to the de novo synthesis of saturated fatty acids to replace polyunsaturated ones in membrane lipids, which are deposited in lipid bodies; and (5) when sinks for photosynthetic energy and reducing power are depleted, resumption of Calvin cycle activity associated with increased photorespiration, accumulation of reactive oxygen species scavengers, and throttling of linear electron flow by antenna uncoupling. During recovery from HS, cells appear to focus on processes allowing rapid resumption of growth rather than restoring pre-HS conditions.
Two dimensional gas chromatography coupled to time-of-flight mass spectrometry (GCxGC-TOF-MS) is a promising technique to overcome limits of complex metabolome analysis using one dimensional GC-TOF-MS. Especially at the stage of data export and data mining, however, convenient procedures to cope with the complexity of GCxGC-TOF-MS data are still in development. Here, we present a high sample throughput protocol exploiting first and second retention index for spectral library search and subsequent construction of a high dimensional data matrix useful for statistical analysis. The method was applied to the analysis of 13 C-labelling experiments in the unicellular green alga Chlamydomonas reinhardtii. We developed a rapid sampling and extraction procedure for Chlamydomonas reinhardtii laboratory strain (CC503), a cell wall deficient mutant. By testing all published quenching protocols we observed dramatic metabolite leakage rates for certain metabolites. To circumvent metabolite leakage, samples were directly quenched and analyzed without separation of the medium. The growth medium was adapted to this rapid sampling protocol to avoid interference with GCxGC-TOF-MS analysis. To analyse batches of samples a new software tool, MetMax, was implemented which extracts the isotopomer matrix from stable isotope labelling experiments together with the first and second retention index (RI1 and RI2). To exploit RI1 and RI2 for metabolite identification we used the Golm metabolome database (GMD [1] with RI1/ RI2-reference spectra and new search algorithms. Using those techniques we analysed the dynamics of (CO2)-C-13 and C-13- acetate uptake in Chlamydomonas reinhardtii cells in two different steady states namely photoautotrophic and mixotrophic growth conditions.
Background: Recent studies using transcript and metabolite profiles of wild-type and gene deletion mutants revealed that photorespiratory pathways are essential for the growth of Synechocystis sp. PCC 6803 under atmospheric conditions. Pool size changes of primary metabolites, such as glycine and glycolate, indicated a link to photorespiration.
Methodology/Principal Findings: The (13)C labelling kinetics of primary metabolites were analysed in photoautotrophically grown cultures of Synechocystis sp. PCC 6803 by gas chromatography-mass spectrometry (GC-MS) to demonstrate the link with photorespiration. Cells pre-acclimated to high CO(2) (5%, HC) or limited CO(2) (0.035%, LC) conditions were pulse-labelled under very high (2% w/w) (13)C-NaHCO(3) (VHC) conditions followed by treatment with ambient (12)C at HC and LC conditions, respectively. The (13)C enrichment, relative changes in pool size, and (13)C flux of selected metabolites were evaluated. We demonstrate two major paths of CO(2) assimilation via Rubisco in Synechocystis, i.e., from 3PGA via PEP to aspartate, malate and citrate or, to a lesser extent, from 3PGA via glucose-6-phosphate to sucrose. The results reveal evidence of carbon channelling from 3PGA to the PEP pool. Furthermore, (13)C labelling of glycolate was observed under conditions thought to suppress photorespiration. Using the glycolate-accumulating Delta glcD1 mutant, we demonstrate enhanced (13)C partitioning into the glycolate pool under conditions favouring photorespiration and enhanced (13)C partitioning into the glycine pool of the glycine-accumulating Delta gcvT mutant. Under LC conditions, the photorespiratory mutants Delta glcD1 and Delta gcvT showed enhanced activity of the additional carbon-fixing PEP carboxylase pathway.
Conclusions/Significance: With our approach of non-steady-state (13)C labelling and analysis of metabolite pool sizes with respective (13)C enrichments, we identify the use and modulation of major pathways of carbon assimilation in Synechocystis in the presence of high and low inorganic carbon supplies.
Potato (Solanum tuberosum L.) is one of the most important food crops worldwide. Current potato varieties are highly susceptible to drought stress. In view of global climate change, selection of cultivars with improved drought tolerance and high yield potential is of paramount importance. Drought tolerance breeding of potato is currently based on direct selection according to yield and phenotypic traits and requires multiple trials under drought conditions. Marker‐assisted selection (MAS) is cheaper, faster and reduces classification errors caused by noncontrolled environmental effects. We analysed 31 potato cultivars grown under optimal and reduced water supply in six independent field trials. Drought tolerance was determined as tuber starch yield. Leaf samples from young plants were screened for preselected transcript and nontargeted metabolite abundance using qRT‐PCR and GC‐MS profiling, respectively. Transcript marker candidates were selected from a published RNA‐Seq data set. A Random Forest machine learning approach extracted metabolite and transcript markers for drought tolerance prediction with low error rates of 6% and 9%, respectively. Moreover, by combining transcript and metabolite markers, the prediction error was reduced to 4.3%. Feature selection from Random Forest models allowed model minimization, yielding a minimal combination of only 20 metabolite and transcript markers that were successfully tested for their reproducibility in 16 independent agronomic field trials. We demonstrate that a minimum combination of transcript and metabolite markers sampled at early cultivation stages predicts potato yield stability under drought largely independent of seasonal and regional agronomic conditions.
Climate models predict an increased likelihood of seasonal droughts for many areas of the world. Breeding for drought tolerance could be accelerated by marker-assisted selection. As a basis for marker identification, we studied the genetic variance, predictability of field performance and potential costs of tolerance in potato (Solanum tuberosum L.). Potato produces high calories per unit of water invested, but is drought-sensitive. In 14 independent pot or field trials, 34 potato cultivars were grown under optimal and reduced water supply to determine starch yield. In an artificial dataset, we tested several stress indices for their power to distinguish tolerant and sensitive genotypes independent of their yield potential. We identified the deviation of relative starch yield from the experimental median (DRYM) as the most efficient index. DRYM corresponded qualitatively to the partial least square model-based metric of drought stress tolerance in a stress effect model. The DRYM identified significant tolerance variation in the European potato cultivar population to allow tolerance breeding and marker identification. Tolerance results from pot trials correlated with those from field trials but predicted field performance worse than field growth parameters. Drought tolerance correlated negatively with yield under optimal conditions in the field. The distribution of yield data versus DRYM indicated that tolerance can be combined with average yield potentials, thus circumventing potential yield penalties in tolerance breeding.