Refine
Year of publication
Document Type
- Article (16)
- Habilitation Thesis (1)
- Postprint (1)
Language
- English (18)
Is part of the Bibliography
- yes (18)
Keywords
- Arabidopsis (2)
- Metabolomics (2)
- drought tolerance (2)
- machine learning (2)
- metabolite markers (2)
- potato (Solanum tuberosum) (2)
- prediction models (2)
- transcript markers (2)
- Biomass (1)
- Brassinosteroids (1)
Potato (Solanum tuberosum L.) is one of the most important food crops worldwide. Current potato varieties are highly susceptible to drought stress. In view of global climate change, selection of cultivars with improved drought tolerance and high yield potential is of paramount importance. Drought tolerance breeding of potato is currently based on direct selection according to yield and phenotypic traits and requires multiple trials under drought conditions. Marker-assisted selection (MAS) is cheaper, faster and reduces classification errors caused by noncontrolled environmental effects. We analysed 31 potato cultivars grown under optimal and reduced water supply in six independent field trials. Drought tolerance was determined as tuber starch yield. Leaf samples from young plants were screened for preselected transcript and nontargeted metabolite abundance using qRT-PCR and GC-MS profiling, respectively. Transcript marker candidates were selected from a published RNA-Seq data set. A Random Forest machine learning approach extracted metabolite and transcript markers for drought tolerance prediction with low error rates of 6% and 9%, respectively. Moreover, by combining transcript and metabolite markers, the prediction error was reduced to 4.3%. Feature selection from Random Forest models allowed model minimization, yielding a minimal combination of only 20 metabolite and transcript markers that were successfully tested for their reproducibility in 16 independent agronomic field trials. We demonstrate that a minimum combination of transcript and metabolite markers sampled at early cultivation stages predicts potato yield stability under drought largely independent of seasonal and regional agronomic conditions.
Motivation: Visualizing and analysing the potential non-linear structure of a dataset is becoming an important task in molecular biology. This is even more challenging when the data have missing values. Results: Here, we propose an inverse model that performs non-linear principal component analysis (NLPCA) from incomplete datasets. Missing values are ignored while optimizing the model, but can be estimated afterwards. Results are shown for both artificial and experimental datasets. In contrast to linear methods, non-linear methods were able to give better missing value estimations for non-linear structured data. Application: We applied this technique to a time course of metabolite data from a cold stress experiment on the model plant Arabidopsis thaliana, and could approximate the mapping function from any time point to the metabolite responses. Thus, the inverse NLPCA provides greatly improved information for better understanding the complex response to cold stress
The gene family of subtilisin-like serine proteases (subtilases) in Arabidopsis thaliana comprises 56 members, divided into six distinct subfamilies. Whereas the members of five subfamilies are similar to pyrolysins, two genes share stronger similarity to animal kexins. Mutant screens confirmed 144 T-DNA insertion lines with knockouts for 55 out of the 56 subtilases. Apart from SDD1, none of the confirmed homozygous mutants revealed any obvious visible phenotypic alteration during growth under standard conditions. Apart from this specific case, forward genetics gave us no hints about the function of the individual 54 non-characterized subtilase genes. Therefore, the main objective of our work was to overcome the shortcomings of the forward genetic approach and to infer alternative experimental approaches by using an integrative biolinformatics and biological approach. Computational analyses based on transcriptional co-expression and co-response pattern revealed at least two expression networks, suggesting that functional redundancy may exist among subtilases with limited similarity. Furthermore, two hubs were identified, which may be involved in signalling or may represent higher-order regulatory factors involved in responses to environmental cues. A particular enrichment of co- regulated genes with metabolic functions was observed for four subtilases possibly representing late responsive elements of environmental stress. The kexin homologs show stronger associations with genes of transcriptional regulation context. Based on the analyses presented here and in accordance with previously characterized subtilases, we propose three main functions of subtilases: involvement in (i) control of development, (ii) protein turnover, and (iii) action as downstream components of signalling cascades
The comprehensive systems-biology database (CSB.DB) was used to reveal brassinosteroid (BR)-related genes from expression profiles based on co-response analyses. Genes exhibiting simultaneous changes in transcript levels are candidates of common transcriptional regulation. Combining numerous different experiments in data matrices allows ruling out outliers and conditional changes of transcript levels. CSB.DB was queried for transcriptional co-responses with the BR-signalling components BRI1 and BAK1: 301 out of 9694 genes represented in the nasc0271 database showed co-responses with both genes. As expected, these genes comprised pathway-involved genes (e.g. 72 BR-induced genes), because the BRI1 and BAK1 proteins are required for BR-responses. But transcript co-response takes the analysis a step further compared with direct approaches because BR-related non BR-responsive genes were identified. Insights into networks and the functional context of genes are provided, because factors determining expression patterns are reflected in correlations. Our findings demonstrate that transcript co-response analysis presents a valuable resource to uncover common regulatory patterns of genes. Different data matrices in CSB.DB allow examination of specific biological questions. All matrices are publicly available through CSB.DB. This work presents one possible roadmap to use the CSB.DB resources
Two dimensional gas chromatography coupled to time-of-flight mass spectrometry (GCxGC-TOF-MS) is a promising technique to overcome limits of complex metabolome analysis using one dimensional GC-TOF-MS. Especially at the stage of data export and data mining, however, convenient procedures to cope with the complexity of GCxGC-TOF-MS data are still in development. Here, we present a high sample throughput protocol exploiting first and second retention index for spectral library search and subsequent construction of a high dimensional data matrix useful for statistical analysis. The method was applied to the analysis of 13 C-labelling experiments in the unicellular green alga Chlamydomonas reinhardtii. We developed a rapid sampling and extraction procedure for Chlamydomonas reinhardtii laboratory strain (CC503), a cell wall deficient mutant. By testing all published quenching protocols we observed dramatic metabolite leakage rates for certain metabolites. To circumvent metabolite leakage, samples were directly quenched and analyzed without separation of the medium. The growth medium was adapted to this rapid sampling protocol to avoid interference with GCxGC-TOF-MS analysis. To analyse batches of samples a new software tool, MetMax, was implemented which extracts the isotopomer matrix from stable isotope labelling experiments together with the first and second retention index (RI1 and RI2). To exploit RI1 and RI2 for metabolite identification we used the Golm metabolome database (GMD [1] with RI1/ RI2-reference spectra and new search algorithms. Using those techniques we analysed the dynamics of (CO2)-C-13 and C-13- acetate uptake in Chlamydomonas reinhardtii cells in two different steady states namely photoautotrophic and mixotrophic growth conditions.
Background: The identification of brassinosteroid (BR) deficient and BR insensitive mutants provided conclusive evidence that BR is a potent growth-promoting phytohormone. Arabidopsis mutants are characterized by a compact rosette structure, decreased plant height and reduced root system, delayed development, and reduced fertility. Cell expansion, cell division, and multiple developmental processes depend on BR. The molecular and physiological basis of BR action is diverse. The BR signalling pathway controls the activity of transcription factors, and numerous BR responsive genes have been identified. The analysis of dwarf mutants, however, may to some extent reveal phenotypic changes that are an effect of the altered morphology and physiology. This restriction holds particularly true for the analysis of established organs such as rosette leaves. Results: In this study, the mode of BR action was analysed in established leaves by means of two approaches. First, an inhibitor of BR biosynthesis (brassinazole) was applied to 21-day-old wild-type plants. Secondly, BR complementation of BR deficient plants, namely CPD (constitutive photomorphogenic dwarf)-antisense and cbb1 (cabbage1) mutant plants was stopped after 21 days. BR action in established leaves is associated with stimulated cell expansion, an increase in leaf index, starch accumulation, enhanced CO2 release by the tricarboxylic acid cycle, and increased biomass production. Cell number and protein content were barely affected. Conclusion: Previous analysis of BR promoted growth focused on genomic effects. However, the link between growth and changes in gene expression patterns barely provided clues to the physiological and metabolic basis of growth. Our study analysed comprehensive metabolic data sets of leaves with altered BR levels. The data suggest that BR promoted growth may depend on the increased provision and use of carbohydrates and energy. BR may stimulate both anabolic and catabolic pathways.
Background: Recent studies using transcript and metabolite profiles of wild-type and gene deletion mutants revealed that photorespiratory pathways are essential for the growth of Synechocystis sp. PCC 6803 under atmospheric conditions. Pool size changes of primary metabolites, such as glycine and glycolate, indicated a link to photorespiration.
Methodology/Principal Findings: The (13)C labelling kinetics of primary metabolites were analysed in photoautotrophically grown cultures of Synechocystis sp. PCC 6803 by gas chromatography-mass spectrometry (GC-MS) to demonstrate the link with photorespiration. Cells pre-acclimated to high CO(2) (5%, HC) or limited CO(2) (0.035%, LC) conditions were pulse-labelled under very high (2% w/w) (13)C-NaHCO(3) (VHC) conditions followed by treatment with ambient (12)C at HC and LC conditions, respectively. The (13)C enrichment, relative changes in pool size, and (13)C flux of selected metabolites were evaluated. We demonstrate two major paths of CO(2) assimilation via Rubisco in Synechocystis, i.e., from 3PGA via PEP to aspartate, malate and citrate or, to a lesser extent, from 3PGA via glucose-6-phosphate to sucrose. The results reveal evidence of carbon channelling from 3PGA to the PEP pool. Furthermore, (13)C labelling of glycolate was observed under conditions thought to suppress photorespiration. Using the glycolate-accumulating Delta glcD1 mutant, we demonstrate enhanced (13)C partitioning into the glycolate pool under conditions favouring photorespiration and enhanced (13)C partitioning into the glycine pool of the glycine-accumulating Delta gcvT mutant. Under LC conditions, the photorespiratory mutants Delta glcD1 and Delta gcvT showed enhanced activity of the additional carbon-fixing PEP carboxylase pathway.
Conclusions/Significance: With our approach of non-steady-state (13)C labelling and analysis of metabolite pool sizes with respective (13)C enrichments, we identify the use and modulation of major pathways of carbon assimilation in Synechocystis in the presence of high and low inorganic carbon supplies.
Potato (Solanum tuberosum L.) is one of the most important food crops worldwide. Current potato varieties are highly susceptible to drought stress. In view of global climate change, selection of cultivars with improved drought tolerance and high yield potential is of paramount importance. Drought tolerance breeding of potato is currently based on direct selection according to yield and phenotypic traits and requires multiple trials under drought conditions. Marker‐assisted selection (MAS) is cheaper, faster and reduces classification errors caused by noncontrolled environmental effects. We analysed 31 potato cultivars grown under optimal and reduced water supply in six independent field trials. Drought tolerance was determined as tuber starch yield. Leaf samples from young plants were screened for preselected transcript and nontargeted metabolite abundance using qRT‐PCR and GC‐MS profiling, respectively. Transcript marker candidates were selected from a published RNA‐Seq data set. A Random Forest machine learning approach extracted metabolite and transcript markers for drought tolerance prediction with low error rates of 6% and 9%, respectively. Moreover, by combining transcript and metabolite markers, the prediction error was reduced to 4.3%. Feature selection from Random Forest models allowed model minimization, yielding a minimal combination of only 20 metabolite and transcript markers that were successfully tested for their reproducibility in 16 independent agronomic field trials. We demonstrate that a minimum combination of transcript and metabolite markers sampled at early cultivation stages predicts potato yield stability under drought largely independent of seasonal and regional agronomic conditions.
Climate models predict an increased likelihood of seasonal droughts for many areas of the world. Breeding for drought tolerance could be accelerated by marker-assisted selection. As a basis for marker identification, we studied the genetic variance, predictability of field performance and potential costs of tolerance in potato (Solanum tuberosum L.). Potato produces high calories per unit of water invested, but is drought-sensitive. In 14 independent pot or field trials, 34 potato cultivars were grown under optimal and reduced water supply to determine starch yield. In an artificial dataset, we tested several stress indices for their power to distinguish tolerant and sensitive genotypes independent of their yield potential. We identified the deviation of relative starch yield from the experimental median (DRYM) as the most efficient index. DRYM corresponded qualitatively to the partial least square model-based metric of drought stress tolerance in a stress effect model. The DRYM identified significant tolerance variation in the European potato cultivar population to allow tolerance breeding and marker identification. Tolerance results from pot trials correlated with those from field trials but predicted field performance worse than field growth parameters. Drought tolerance correlated negatively with yield under optimal conditions in the field. The distribution of yield data versus DRYM indicated that tolerance can be combined with average yield potentials, thus circumventing potential yield penalties in tolerance breeding.
The uptake of nutrients and their subsequent chemical conversion by reactions which provide energy and building blocks for growth and propagation is a fundamental property of life. This property is termed metabolism. In the course of evolution life has been dependent on chemical reactions which generate molecules that are common and indispensable to all life forms. These molecules are the so-called primary metabolites. In addition, life has evolved highly diverse biochemical reactions. These reactions allow organisms to produce unique molecules, the so-called secondary metabolites, which provide a competitive advantage for survival. The sum of all metabolites produced by the complex network of reactions within an organism has since 1998 been called the metabolome. The size of the metabolome can only be estimated and may range from less than 1,000 metabolites in unicellular organisms to approximately 200,000 in the whole plant kingdom. In current biology, three additional types of molecules are thought to be important to the understanding of the phenomena of life: (1) the proteins, in other words the proteome, including enzymes which perform the metabolic reactions, (2) the ribonucleic acids (RNAs) which constitute the so-called transcriptome, and (3) all genes of the genome which are encoded within the double strands of desoxyribonucleic acid (DNA). Investigations of each of these molecular levels of life require analytical technologies which should best enable the comprehensive analysis of all proteins, RNAs, et cetera. At the beginning of this thesis such analytical technologies were available for DNA, RNA and proteins, but not for metabolites. Therefore, this thesis was dedicated to the implementation of the gas chromatography – mass spectrometry technology, in short GC-MS, for the in-parallel analysis of as many metabolites as possible. Today GC-MS is one of the most widely applied technologies and indispensable for the efficient profiling of primary metabolites. The main achievements and research topics of this work can be divided into technological advances and novel insights into the metabolic mechanisms which allow plants to cope with environmental stresses. Firstly, the GC-MS profiling technology has been highly automated and standardized. The major technological achievements were (1) substantial contributions to the development of automated and, within the limits of GC-MS, comprehensive chemical analysis, (2) contributions to the implementation of time of flight mass spectrometry for GC-MS based metabolite profiling, (3) the creation of a software platform for reproducible GC-MS data processing, named TagFinder, and (4) the establishment of an internationally coordinated library of mass spectra which allows the identification of metabolites in diverse and complex biological samples. In addition, the Golm Metabolome Database (GMD) has been initiated to harbor this library and to cope with the increasing amount of generated profiling data. This database makes publicly available all chemical information essential for GC-MS profiling and has been extended to a global resource of GC-MS based metabolite profiles. Querying the concentration changes of hundreds of known and yet non-identified metabolites has recently been enabled by uploading standardized, TagFinder-processed data. Long-term technological aims have been pursued with the central aims (1) to enhance the precision of absolute and relative quantification and (2) to enable the combined analysis of metabolite concentrations and metabolic flux. In contrast to concentrations which provide information on metabolite amounts, flux analysis provides information on the speed of biochemical reactions or reaction sequences, for example on the rate of CO2 conversion into metabolites. This conversion is an essential function of plants which is the basis of life on earth. Secondly, GC-MS based metabolite profiling technology has been continuously applied to advance plant stress physiology. These efforts have yielded a detailed description of and new functional insights into metabolic changes in response to high and low temperatures as well as common and divergent responses to salt stress among higher plants, such as Arabidopsis thaliana, Lotus japonicus and rice (Oryza sativa). Time course analysis after temperature stress and investigations into salt dosage responses indicated that metabolism changed in a gradual manner rather than by stepwise transitions between fixed states. In agreement with these observations, metabolite profiles of the model plant Lotus japonicus, when exposed to increased soil salinity, were demonstrated to have a highly predictive power for both NaCl accumulation and plant biomass. Thus, it may be possible to use GC-MS based metabolite profiling as a breeding tool to support the selection of individual plants that cope best with salt stress or other environmental challenges.