Refine
Year of publication
Document Type
- Article (10)
- Doctoral Thesis (10)
- Postprint (6)
- Review (1)
Language
- English (27)
Is part of the Bibliography
- yes (27)
Keywords
- metabolomics (27) (remove)
The inclusion of exotic germplasm serves as a crucial means to enhance allelic and
consequently phenotypic diversity in inbred crop species. Such species have experienced a reduction in diversity due to artificial selection focused on a limited set of traits. The natural biodiversity within ecosystems presents an opportunity to explore various traits influencing plant survival, reproductive fitness and yield potential. In agricultural research, the study of wild species closely related to cultivated plants serves as a means to comprehend the genetic foundations of past domestication events and the polymorphisms essential for future breeding efforts to develop superior varieties. In order to examine the metabolic composition, pinpoint quantitative trait loci (QTL) and facilitate their resolution an extensive large-scale analysis of metabolic QTL (mQTL) was conducted on tomato backcross inbred lines (BILs) derived from a cross between the wild species S. pennellii (5240) incorporated into the background of S. lycopersicum cv. LEA determinate inbred which can be grown in open fields and cv. TOP indeterminate which can be grown in greenhouse conditions. A large number of mQTL associated with primary secondary and lipid metabolism in fruit were identified across the two BIL populations. Epistasis, the interactions between genes at different loci, has been an interest in molecular and quantitative genetics for many decades. The study of epistasis requires the analysis of very large populations with multiple independent genotypes that carry specific genomic regions. In order to understand the genetic basis of tomato fruit metabolism, I extended the work to investigate epistatic interactions of the genomic regions. In addition, two candidate genes were identified through quantitative trait loci underlying fruit-specific sucrose and jasmonic acid derivatives. Finally, in this study, I assessed the genetic framework of fruit metabolic traits with a high level of detail, utilizing the newly created Solanum pennellii (5240) backcrossed introgression lines (n=3000). This investigation resulted in the discovery of promising candidate loci associated with significant fruit quality traits, including those to the abundance of glutamic acid and aspartic acid crucial elements contributing to the development of acidity and flavors.
Aging is a complex process characterized by several factors, including loss of genetic and epigenetic information, accumulation of chronic oxidative stress, protein damage and aggregates and it is becoming an emergent drug target. Therefore, it is the utmost importance to study aging and agerelated diseases, to provide treatments to develop a healthy aging process. Skeletal muscle is one of the earliest tissues affected by age-related changes with progressive loss of muscle mass and function from 30 years old, effect known as sarcopenia. Several studies have shown the accumulation of protein aggregates in different animal models, as well as in humans, suggesting impaired proteostasis, a hallmark of aging, especially regarding degradation systems. Thus, different publications have explored the role of the main proteolytic systems in skeletal muscle from rodents and humans, like ubiquitin proteasomal system (UPS) and autophagy lysosomal system (ALS), however with contradictory results. Yet, most of the published studies are performed in muscles that comprise more than one fiber type, that means, muscles composed by slow and fast fibers. These fiber types, exhibit different metabolism and contraction speed; the slow fibers or type I display an oxidative metabolism, while fast fibers function towards a glycolytic metabolism ranging from fast oxidative to fast glycolytic fibers. To this extent, the aim of this thesis sought to understand on how aging impacts both fiber types not only regarding proteostasis but also at a metabolome and transcriptome network levels. Therefore, the first part of this thesis, presents the differences between slow oxidative (from Soleus muscle) and fast glycolytic fibers (Extensor digitorum longus, EDL) in terms of degradation systems and how they cope with oxidative stress during aging, while the second part explores the differences between young and old EDL muscle transcriptome and metabolome, unraveling molecular features. More specifically, the results from the present work show that slow oxidative muscle performs better at maintaining the function of UPS and ALS during aging than EDL muscle, which is clearly affected, accounting for the decline in the catalytic activity rates and accumulation of autophagy-related proteins. Strinkingly, transcriptome and metabolome analyses reveal that fast glycolytic muscle evidences significant downregulation of mitochondrial related processes and damaged mitochondria morphology during aging, despite of having a lower oxidative metabolism compared to oxidative fibers. Moreover, predictive analyses reveal a negative association between aged EDL gene signature and lifespan extending interventions such as caloric restriction (CR). Although, CR intervention does not alter the levels of mitochondrial markers in aged EDL muscle, it can reverse the higher mRNA levels of muscle damage markers. Together, the results from this thesis give new insights about how different metabolic muscle fibers cope with age-related changes and why fast glycolytic fibers are more susceptible to aging than slow oxidative fibers.
Plant metabolism serves as the primary mechanism for converting assimilated carbon into essential compounds crucial for plant growth and ultimately, crop yield. This renders it a focal point of research with significant implications. Despite notable strides in comprehending the genetic principles underpinning metabolism and yield, there remains a dearth of knowledge regarding the genetic factors responsible for trait variation under varying environmental conditions. Given the burgeoning global population and the advancing challenges posed by climate change, unraveling the intricacies of metabolic and yield responses to water scarcity became increasingly important in safeguarding food security.
Our research group has recently started to work on the genetic resources of legume species. To this end, the study presented here investigates the metabolic diversity across five different legume species at a tissue level, identifying species-specific biosynthesis of alkaloids as well as iso-/flavonoids with diverse functional groups, namely prenylation, phenylacylation as well as methoxylation, to create a resource for follow up studies investigation the metabolic diversity in natural diverse populations of legume species.
Following this, the second study investigates the genetic architecture of drought-induced changes in a global common bean population. Here, a plethora of quantitative trait loci (QTL) associated with various traits are identified by performing genome-wide association studies (GWAS), including for lipid signaling. On this site, overexpression of candidates highlighted the induction of several oxylipins reported to be pivotal in coping with harsh environmental conditions such as water scarcity.
Diverging from the common bean and GWAS, the following study focuses on identifying drought-related QTL in tomato using a bi-parental breeding population. This descriptive study highlights novel multi-omic QTL, including metabolism, photosynthesis as well as fruit setting, some of which are uniquely assigned under drought. Compared to conventional approaches using the bi-parental IL population, the study presented improves the resolution by assessing further backcrossed ILs, named sub-ILs.
In the final study, a photosynthetic gene, namely a PetM subunit of the cytochrome b6f complex encoding gene, involved in electron flow is characterized in an horticultural important crop. While several advances have been made in model organisms, this study highlights the transition of this fundamental knowledge to horticultural important crops, such as tomato, and investigates its function under differing light conditions. Overall, the presented thesis combines different strategies in unveiling the genetic components in multi-omic traits under drought using conventional breeding populations as well as a diverse global population. To this end, it allows a comparison of either approach and highlights their strengths and weaknesses.
Cells are built from a variety of macromolecules and metabolites. Both, the proteome and the metabolome are highly dynamic and responsive to environmental cues and developmental processes. But it is not their bare numbers, but their interactions that enable life. The protein-protein (PPI) and protein-metabolite interactions (PMI) facilitate and regulate all aspects of cell biology, from metabolism to mitosis. Therefore, the study of PPIs and PMIs and their dynamics in a cell-wide context is of great scientific interest. In this dissertation, I aim to chart a map of the dynamic PPIs and PMIs across metabolic and cellular transitions. As a model system, I study the shift from the fermentative to the respiratory growth, known as the diauxic shift, in the budding yeast Saccharomyces cerevisiae. To do so, I am applying a co-fractionation mass spectrometry (CF-MS) based method, dubbed protein metabolite interactions using size separation (PROMIS). PROMIS, as well as comparable methods, will be discussed in detail in chapter 1.
Since PROMIS was developed originally for Arabidopsis thaliana, in chapter 2, I will describe the adaptation of PROMIS to S. cerevisiae. Here, the obtained results demonstrated a wealth of protein-metabolite interactions, and experimentally validated 225 previously predicted PMIs. Applying orthogonal, targeted approaches to validate the interactions of a proteogenic dipeptide, Ser-Leu, five novel protein-interactors were found. One of those proteins, phosphoglycerate kinase, is inhibited by Ser-Leu, placing the dipeptide at the regulation of glycolysis.
In chapter 3, I am presenting PROMISed, a novel web-tool designed for the analysis of PROMIS- and other CF-MS-datasets. Starting with raw fractionation profiles, PROMISed enables data pre-processing, profile deconvolution, scores differences in fractionation profiles between experimental conditions, and ultimately charts interaction networks. PROMISed comes with a user-friendly graphic interface, and thus enables the routine analysis of CF-MS data by non-computational biologists.
Finally, in chapter 4, I applied PROMIS in combination with the isothermal shift assay to the diauxic shift in S. cerevisiae to study changes in the PPI and PMI landscape across this metabolic transition. I found a major rewiring of protein-protein-metabolite complexes, exemplified by the disassembly of the proteasome in the respiratory phase, the loss of interaction of an enzyme involved in amino acid biosynthesis and its cofactor, as well as phase and structure specific interactions between dipeptides and enzymes of central carbon metabolism.
In chapter 5, I am summarizing the presented results, and discuss a strategy to unravel the potential patterns of dipeptide accumulation and binding specificities. Lastly, I recapitulate recently postulated guidelines for CF-MS experiments, and give an outlook of protein interaction studies in the near future.
Omics and male infertility
(2022)
Male infertility is a multifaceted disorder affecting approximately 50% of male partners in infertile couples.
Over the years, male infertility has been diagnosed mainly through semen analysis, hormone evaluations, medical records and physical examinations, which of course are fundamental, but yet inefficient, because 30% of male infertility cases remain idiopathic. This dilemmatic status of the unknown needs to be addressed with more sophisticated and result-driven technologies and/or techniques.
Genetic alterations have been linked with male infertility, thereby unveiling the practicality of investigating this disorder from the "omics" perspective.
Omics aims at analyzing the structure and functions of a whole constituent of a given biological function at different levels, including the molecular gene level (genomics), transcript level (transcriptomics), protein level (proteomics) and metabolites level (metabolomics). In the current study, an overview of the four branches of omics and their roles in male infertility are briefly discussed; the potential usefulness of assessing transcriptomic data to understand this pathology is also elucidated.
After assessing the publicly obtainable transcriptomic data for datasets on male infertility, a total of 1385 datasets were retrieved, of which 10 datasets met the inclusion criteria and were used for further analysis.
These datasets were classified into groups according to the disease or cause of male infertility.
The groups include non-obstructive azoospermia (NOA), obstructive azoospermia (OA), non-obstructive and obstructive azoospermia (NOA and OA), spermatogenic dysfunction, sperm dysfunction, and Y chromosome microdeletion.
Findings revealed that 8 genes (LDHC, PDHA2, TNP1, TNP2, ODF1, ODF2, SPINK2, PCDHB3) were commonly differentially expressed between all disease groups.
Likewise, 56 genes were common between NOA versus NOA and OA (ADAD1, BANF2, BCL2L14, C12orf50, C20orf173, C22orf23, C6orf99, C9orf131, C9orf24, CABS1, CAPZA3, CCDC187, CCDC54, CDKN3, CEP170, CFAP206, CRISP2, CT83, CXorf65, FAM209A, FAM71F1, FAM81B, GALNTL5, GTSF1, H1FNT, HEMGN, HMGB4, KIF2B, LDHC, LOC441601, LYZL2, ODF1, ODF2, PCDHB3, PDHA2, PGK2, PIH1D2, PLCZ1, PROCA1, RIMBP3, ROPN1L, SHCBP1L, SMCP, SPATA16, SPATA19, SPINK2, TEX33, TKTL2, TMCO2, TMCO5A, TNP1, TNP2, TSPAN16, TSSK1B, TTLL2, UBQLN3).
These genes, particularly the above-mentioned 8 genes, are involved in diverse biological processes such as germ cell development, spermatid development, spermatid differentiation, regulation of proteolysis, spermatogenesis and metabolic processes.
Owing to the stage-specific expression of these genes, any mal-expression can ultimately lead to male infertility.
Therefore, currently available data on all branches of omics relating to male fertility can be used to identify biomarkers for diagnosing male infertility, which can potentially help in unravelling some idiopathic cases.
Due to global climate change providing food security for an increasing world population is a big challenge. Especially abiotic stressors have a strong negative effect on crop yield. To develop climate-adapted crops a comprehensive understanding of molecular alterations in the response of varying levels of environmental stresses is required. High throughput or ‘omics’ technologies can help to identify key-regulators and pathways of abiotic stress responses. In addition to obtain omics data also tools and statistical analyses need to be designed and evaluated to get reliable biological results.
To address these issues, I have conducted three different studies covering two omics technologies. In the first study, I used transcriptomic data from the two polymorphic Arabidopsis thaliana accessions, namely Col-0 and N14, to evaluate seven computational tools for their ability to map and quantify Illumina single-end reads. Between 92% and 99% of the reads were mapped against the reference sequence. The raw count distributions obtained from the different tools were highly correlated. Performing a differential gene expression analysis between plants exposed to 20 °C or 4°C (cold acclimation), a large pairwise overlap between the mappers was obtained. In the second study, I obtained transcript data from ten different Oryza sativa (rice) cultivars by PacBio Isoform sequencing that can capture full-length transcripts. De novo reference transcriptomes were reconstructed resulting in 38,900 to 54,500 high-quality isoforms per cultivar. Isoforms were collapsed to reduce sequence redundancy and evaluated, e.g. for protein completeness level (BUSCO), transcript length, and number of unique transcripts per gene loci. For the heat and drought tolerant aus cultivar N22, I identified around 650 unique and novel transcripts of which 56 were significantly differentially expressed in developing seeds during combined drought and heat stress. In the last study, I measured and analyzed the changes in metabolite profiles of eight rice cultivars exposed to high night temperature (HNT) stress and grown during the dry and wet season on the field in the Philippines. Season-specific changes in metabolite levels, as well as for agronomic parameters, were identified and metabolic pathways causing a yield decline at HNT conditions suggested.
In conclusion, the comparison of mapper performances can help plant scientists to decide on the right tool for their data. The de novo reconstruction of rice cultivars without a genome sequence provides a targeted, cost-efficient approach to identify novel genes responding to stress conditions for any organism. With the metabolomics approach for HNT stress in rice, I identified stress and season-specific metabolites which might be used as molecular markers for crop improvement in the future.
We recently demonstrated that the sympathetic nervous system can be voluntarily activated following a training program consisting of cold exposure, breathing exercises, and meditation. This resulted in profound attenuation of the systemic inflammatory response elicited by lipopolysaccharide (LPS) administration. Herein, we assessed whether this training program affects the plasma metabolome and if these changes are linked to the immunomodulatory effects observed. A total of 224 metabolites were identified in plasma obtained from 24 healthy male volunteers at six timepoints, of which 98 were significantly altered following LPS administration. Effects of the training program were most prominent shortly after initiation of the acquired breathing exercises but prior to LPS administration, and point towards increased activation of the Cori cycle. Elevated concentrations of lactate and pyruvate in trained individuals correlated with enhanced levels of anti-inflammatory interleukin (IL)-10. In vitro validation experiments revealed that co-incubation with lactate and pyruvate enhances IL-10 production and attenuates the release of pro-inflammatory IL-1 beta and IL-6 by LPS-stimulated leukocytes. Our results demonstrate that practicing the breathing exercises acquired during the training program results in increased activity of the Cori cycle. Furthermore, this work uncovers an important role of lactate and pyruvate in the anti-inflammatory phenotype observed in trained subjects.
We recently demonstrated that the sympathetic nervous system can be voluntarily activated following a training program consisting of cold exposure, breathing exercises, and meditation. This resulted in profound attenuation of the systemic inflammatory response elicited by lipopolysaccharide (LPS) administration. Herein, we assessed whether this training program affects the plasma metabolome and if these changes are linked to the immunomodulatory effects observed. A total of 224 metabolites were identified in plasma obtained from 24 healthy male volunteers at six timepoints, of which 98 were significantly altered following LPS administration. Effects of the training program were most prominent shortly after initiation of the acquired breathing exercises but prior to LPS administration, and point towards increased activation of the Cori cycle. Elevated concentrations of lactate and pyruvate in trained individuals correlated with enhanced levels of anti-inflammatory interleukin (IL)-10. In vitro validation experiments revealed that co-incubation with lactate and pyruvate enhances IL-10 production and attenuates the release of pro-inflammatory IL-1 beta and IL-6 by LPS-stimulated leukocytes. Our results demonstrate that practicing the breathing exercises acquired during the training program results in increased activity of the Cori cycle. Furthermore, this work uncovers an important role of lactate and pyruvate in the anti-inflammatory phenotype observed in trained subjects.
Neuroinflammatory and neurodegenerative diseases such as Parkinson's (PD) and multiple sclerosis (MS) often result in a severe impairment of the patient´s quality of life. Effective therapies for the treatment are currently not available, which results in a high socio-economic burden. Due to the heterogeneity of the disease subtypes, stratification is particularly difficult in the early phase of the disease and is mainly based on clinical parameters such as neurophysiological tests and central nervous imaging. Due to good accessibility and stability, blood and cerebrospinal fluid metabolite markers could serve as surrogates for neurodegenerative processes. This can lead to an improved mechanistic understanding of these diseases and further be used as "treatment response" biomarkers in preclinical and clinical development programs. Therefore, plasma and CSF metabolite profiles will be identified that allow differentiation of PD from healthy controls, association of PD with dementia (PDD) and differentiation of PD subtypes such as akinetic rigid and tremor dominant PD patients. In addition, plasma metabolites for the diagnosis of primary progressive MS (PPMS) should be investigated and tested for their specificity to relapsing-remitting MS (RRMS) and their development during PPMS progression.
By applying untargeted high-resolution metabolomics of PD patient samples and in using random forest and partial least square machine learning algorithms, this study identified 20 plasma metabolites and 14 CSF metabolite biomarkers. These differentiate against healthy individuals with an AUC of 0.8 and 0.9 in PD, respectively. We also identify ten PDD specific serum metabolites, which differentiate against healthy individuals and PD patients without dementia with an AUC of 1.0, respectively. Furthermore, 23 akinetic-rigid specific plasma markers were identified, which differentiate against tremor-dominant PD patients with an AUC of 0.94 and against healthy individuals with an AUC of 0.98. These findings also suggest more severe disease pathology in the akinetic-rigid PD than in tremor dominant PD. In the analysis of MS patient samples a partial least square analysis yielded predictive models for the classification of PPMS and resulted in 20 PPMS specific metabolites. In another MS study unknown changes in human metabolism were identified after administration of the multiple sclerosis drug dimethylfumarate, which is used for the treatment of RRMS. These results allow to describe and understand the hitherto completely unknown mechanism of action of this new drug and to use these findings for the further development of new drugs and targets against RRMS.
In conclusion, these results have the potential for improved diagnosis of these diseases and improvement of mechanistic understandings, as multiple deregulated pathways were identified. Moreover, novel Dimethylfumarate targets can be used to aid drug development and treatment efficiency. Overall, metabolite profiling in combination with machine learning identified as a promising approach for biomarker discovery and mode of action elucidation.
Background/Aims: Gestational diabetes (GDM) might be associated with alterations in the metabolomic profile of affected mothers and their offspring. Until now, there is a paucity of studies that investigated both, the maternal and the fetal serum metabolome in the setting of GDM. Mounting evidence suggests that the fetus is not just passively affected by gestational disease but might play an active role in it. Metabolomic studies performed in maternal blood and fetal cord blood could help to better discern distinct fetal from maternal disease interactions. Methods: At the time of birth, serum samples from mothers and newborns (cord blood samples) were collected and screened for 163 metabolites utilizing tandem mass spectrometry. The cohort consisted of 412 mother/child pairs, including 31 cases of maternal GDM. Results: An initial non-adjusted analysis showed that eight metabolites in the maternal blood and 54 metabolites in the cord blood were associated with GDM. After Benjamini-Hochberg (BH) procedure and adjustment for confounding factors for GDM, fetal phosphatidylcholine acyl-alkyl C 32:1 and proline still showed an independent association with GDM. Conclusions: This study found metabolites in cord blood which were associated with GDM, even after adjustment for established risk factors of GDM. To the best of our knowledge, this is the first study demonstrating an independent association between fetal serum metabolites and maternal GDM. Our findings might suggest a potential effect of the fetal metabolome on maternal GDM. (c) 2018 The Author(s) Published by S. Karger AG, Basel
Background/Aims: Impaired birth outcomes, like low birth weight, have consistently been associated with increased disease susceptibility to hypertension in later life. Alterations in the maternal or fetal metabolism might impact on fetal growth and influence birth outcomes. Discerning associations between the maternal and fetal metabolome and surrogate parameters of fetal growth could give new insight into the complex relationship between intrauterine conditions, birth outcomes, and later life disease susceptibility. Methods: Using flow injection tandem mass spectrometry, targeted metabolomics was performed in serum samples obtained from 226 mother/child pairs at delivery. Associations between neonatal birth weight and concentrations of 163 maternal and fetal metabolites were analyzed. Results: After FDR adjustment using the Benjamini-Hochberg procedure lysophosphatidylcholines (LPC) 14:0, 16:1, and 18:1 were strongly positively correlated with birth weight. In a stepwise linear regression model corrected for established confounding factors of birth weight, LPC 16: 1 showed the strongest independent association with birth weight (CI: 93.63 - 168.94; P = 6.94x10(-11)). The association with birth weight was stronger than classical confounding factors such as offspring sex (CI: - 258.81- -61.32; P = 0.002) and maternal smoking during pregnancy (CI: -298.74 - -29.51; P = 0.017). Conclusions: After correction for multiple testing and adjustment for potential confounders, LPC 16:1 showed a very strong and independent association with birth weight. The underlying molecular mechanisms linking fetal LPCs with birth weight need to be addressed in future studies. (c) 2018 The Author(s) Published by S. Karger AG, Basel
The availability of high-throughput data from transcriptomics and metabolomics technologies provides the opportunity to characterize the transcriptional effects on metabolism. Here we propose and evaluate two computational approaches rooted in data reduction techniques to identify and categorize transcriptional effects on metabolism by combining data on gene expression and metabolite levels. The approaches determine the partial correlation between two metabolite data profiles upon control of given principal components extracted from transcriptomics data profiles. Therefore, they allow us to investigate both data types with all features simultaneously without doing preselection of genes. The proposed approaches allow us to categorize the relation between pairs of metabolites as being under transcriptional or post-transcriptional regulation. The resulting classification is compared to existing literature and accumulated evidence about regulatory mechanism of reactions and pathways in the cases of Escherichia coil, Saccharomycies cerevisiae, and Arabidopsis thaliana.
Systems biology aims at investigating biological systems in its entirety by gathering and analyzing large-scale data sets about the underlying components. Computational systems biology approaches use these large-scale data sets to create models at different scales and cellular levels. In addition, it is concerned with generating and testing hypotheses about biological processes. However, such approaches are inevitably leading to computational challenges due to the high dimensionality of the data and the differences in the dimension of data from different cellular layers.
This thesis focuses on the investigation and development of computational approaches to analyze metabolite profiles in the context of cellular networks. This leads to determining what aspects of the network functionality are reflected in the metabolite levels. With these methods at hand, this thesis aims to answer three questions: (1) how observability of biological systems is manifested in metabolite profiles and if it can be used for phenotypical comparisons; (2) how to identify couplings of reaction rates from metabolic profiles alone; and (3) which regulatory mechanism that affect metabolite levels can be distinguished by integrating transcriptomics and metabolomics read-outs.
I showed that sensor metabolites, identified by an approach from observability theory, are more correlated to each other than non-sensors. The greater correlations between sensor metabolites were detected both with publicly available metabolite profiles and synthetic data simulated from a medium-scale kinetic model. I demonstrated through robustness analysis that correlation was due to the position of the sensor metabolites in the network and persisted irrespectively of the experimental conditions. Sensor metabolites are therefore potential candidates for phenotypical comparisons between conditions through targeted metabolic analysis.
Furthermore, I demonstrated that the coupling of metabolic reaction rates can be investigated from a purely data-driven perspective, assuming that metabolic reactions can be described by mass action kinetics. Employing metabolite profiles from domesticated and wild wheat and tomato species, I showed that the process of domestication is associated with a loss of regulatory control on the level of reaction rate coupling. I also found that the same metabolic pathways in Arabidopsis thaliana and Escherichia coli exhibit differences in the number of reaction rate couplings.
I designed a novel method for the identification and categorization of transcriptional effects on metabolism by combining data on gene expression and metabolite levels. The approach determines the partial correlation of metabolites with control by the principal components of the transcript levels. The principle components contain the majority of the transcriptomic information allowing to partial out the effect of the transcriptional layer from the metabolite profiles. Depending whether the correlation between metabolites persists upon controlling for the effect of the transcriptional layer, the approach allows us to group metabolite pairs into being associated due to post-transcriptional or transcriptional regulation, respectively. I showed that the classification of metabolite pairs into those that are associated due to transcriptional or post-transcriptional regulation are in agreement with existing literature and findings from a Bayesian inference approach.
The approaches developed, implemented, and investigated in this thesis open novel ways to jointly study metabolomics and transcriptomics data as well as to place metabolic profiles in the network context. The results from these approaches have the potential to provide further insights into the regulatory machinery in a biological system.
Metabolites and lipids are the final products of enzymatic processes, distinguishing the different cellular functions and activities of single cells or whole tissues. Understanding these cellular functions within a well-established model system requires a systemic collection of molecular and physiological information. In the current report, the green alga Chlamydomonas reinhardtii was selected to establish a comprehensive workflow for the detailed multi-omics analysis of a synchronously growing cell culture system. After implementation and benchmarking of the synchronous cell culture, a two-phase extraction method was adopted for the analysis of proteins, lipids, metabolites and starch from a single sample aliquot of as little as 10-15million Chlamydomonas cells. In a proof of concept study, primary metabolites and lipids were sampled throughout the diurnal cell cycle. The results of these time-resolved measurements showed that single compounds were not only coordinated with each other in different pathways, but that these complex metabolic signatures have the potential to be used as biomarkers of various cellular processes. Taken together, the developed workflow, including the synchronized growth of the photoautotrophic cell culture, in combination with comprehensive extraction methods and detailed metabolic phenotyping has the potential for use in in-depth analysis of complex cellular processes, providing essential information for the understanding of complex biological systems.
Background: Consumption of whole-grain, coffee, and red meat were consistently related to the risk of developing type 2 diabetes in prospective cohort studies, but potentially underlying biological mechanisms are not well understood. Metabolomics profiles were shown to be sensitive to these dietary exposures, and at the same time to be informative with respect to the risk of type 2 diabetes. Moreover, graphical network-models were demonstrated to reflect the biological processes underlying high-dimensional metabolomics profiles.
Aim: The aim of this study was to infer hypotheses on the biological mechanisms that link consumption of whole-grain bread, coffee, and red meat, respectively, to the risk of developing type 2 diabetes. More specifically, it was aimed to consider network models of amino acid and lipid profiles as potential mediators of these risk-relations.
Study population: Analyses were conducted in the prospective EPIC-Potsdam cohort (n = 27,548), applying a nested case-cohort design (n = 2731, including 692 incident diabetes cases). Habitual diet was assessed with validated semiquantitative food-frequency questionnaires. Concentrations of 126 metabolites (acylcarnitines, phosphatidylcholines, sphingomyelins, amino acids) were determined in baseline-serum samples. Incident type 2 diabetes cases were assed and validated in an active follow-up procedure. The median follow-up time was 6.6 years.
Analytical design: The methodological approach was conceptually based on counterfactual causal inference theory. Observations on the network-encoded conditional independence structure restricted the space of possible causal explanations of observed metabolomics-data patterns. Given basic directionality assumptions (diet affects metabolism; metabolism affects future diabetes incidence), adjustment for a subset of direct neighbours was sufficient to consistently estimate network-independent direct effects. Further model-specification, however, was limited due to missing directionality information on the links between metabolites. Therefore, a multi-model approach was applied to infer the bounds of possible direct effects. All metabolite-exposure links and metabolite-outcome links, respectively, were classified into one of three categories: direct effect, ambiguous (some models indicated an effect others not), and no-effect.
Cross-sectional and longitudinal relations were evaluated in multivariable-adjusted linear regression and Cox proportional hazard regression models, respectively. Models were comprehensively adjusted for age, sex, body mass index, prevalence of hypertension, dietary and lifestyle factors, and medication.
Results: Consumption of whole-grain bread was related to lower levels of several lipid metabolites with saturated and monounsaturated fatty acids. Coffee was related to lower aromatic and branched-chain amino acids, and had potential effects on the fatty acid profile within lipid classes. Red meat was linked to lower glycine levels and was related to higher circulating concentrations of branched-chain amino acids. In addition, potential marked effects of red meat consumption on the fatty acid composition within the investigated lipid classes were identified.
Moreover, potential beneficial and adverse direct effects of metabolites on type 2 diabetes risk were detected. Aromatic amino acids and lipid metabolites with even-chain saturated (C14-C18) and with specific polyunsaturated fatty acids had adverse effects on type 2 diabetes risk. Glycine, glutamine, and lipid metabolites with monounsaturated fatty acids and with other species of polyunsaturated fatty acids were classified as having direct beneficial effects on type 2 diabetes risk.
Potential mediators of the diet-diabetes links were identified by graphically overlaying this information in network models. Mediation analyses revealed that effects on lipid metabolites could potentially explain about one fourth of the whole-grain bread effect on type 2 diabetes risk; and that effects of coffee and red meat consumption on amino acid and lipid profiles could potentially explain about two thirds of the altered type 2 diabetes risk linked to these dietary exposures.
Conclusion: An algorithm was developed that is capable to integrate single external variables (continuous exposures, survival time) and high-dimensional metabolomics-data in a joint graphical model. Application to the EPIC-Potsdam cohort study revealed that the observed conditional independence patterns were consistent with the a priori mediation hypothesis: Early effects on lipid and amino acid metabolism had the potential to explain large parts of the link between three of the most widely discussed diabetes-related dietary exposures and the risk of developing type 2 diabetes.
Background:
First metabolomics studies have indicated that metabolic fingerprints from accessible tissues might
be useful to better understand the etiological links between metabolism and cancer. However, there is still a lack
of prospective metabolomics studies on pre-diagnostic metabolic alterations and cancer risk.
Methods:
Associations between pre-diagnostic levels of 120 circulating metabolites (acylcarnitines, amino acids,
biogenic amines, phosphatidylcholines, sphingolipids, and hexoses) and the risks of breast, prostate, and colorectal
cancer were evaluated by Cox regression analyses using data of a prospective case-cohort study including 835
incident cancer cases.
Results:
The median follow-up duration was 8.3 years among non-cases and 6.5 years among incident cases of
cancer. Higher levels of lysophosphatidylcholines (lysoPCs), and especially lysoPC a C18:0, were consistently related
to lower risks of breast, prostate, and colorectal cancer, independent of background factors. In contrast, higher
levels of phosphatidylcholine PC ae C30:0 were associated with increased cancer risk. There was no heterogeneity
in the observed associations by lag time between blood draw and cancer diagnosis.
Conclusion:
Changes in blood lipid composition precede the diagnosis of common malignancies by several years.
Considering the consistency of the present results across three cancer types the observed alterations point to a
global metabolic shift in phosphatidylcholine metabolism that may drive tumorigenesis.
Continuing advances in 'omics methodologies and instrumentation is enhancing the understanding of how plants cope with the dynamic nature of their growing environment. 'Omics platforms have been only recently extended to cover horticultural crop species. Many of the most widely cultivated vegetable crops belong to the genus Brassica: these include plants grown for their root (turnip, rutabaga/swede), their swollen stem base (kohlrabi), their leaves (cabbage, kale, pak choi) and their inflorescence (cauliflower, broccoli). Characterization at the genome, transcript, protein and metabolite levels has illustrated the complexity of the cellular response to a whole series of environmental stresses, including nutrient deficiency, pathogen attack, heavy metal toxicity, cold acclimation, and excessive and sub optimal irradiation. This review covers recent applications of omics technologies to the brassicaceous vegetables, and discusses future scenarios in achieving improvements in crop end-use quality.
Continuing advances in 'omics methodologies and instrumentation is enhancing the understanding of how plants cope with the dynamic nature of their growing environment. 'Omics platforms have been only recently extended to cover horticultural crop species. Many of the most widely cultivated vegetable crops belong to the genus Brassica: these include plants grown for their root (turnip, rutabaga/swede), their swollen stem base (kohlrabi), their leaves (cabbage, kale, pak choi) and their inflorescence (cauliflower, broccoli). Characterization at the genome, transcript, protein and metabolite levels has illustrated the complexity of the cellular response to a whole series of environmental stresses, including nutrient deficiency, pathogen attack, heavy metal toxicity, cold acclimation, and excessive and sub optimal irradiation. This review covers recent applications of omics technologies to the brassicaceous vegetables, and discusses future scenarios in achieving improvements in crop end-use quality.
Leaf senescence is a developmentally controlled process, which is additionally modulated by a number of adverse environmental conditions. Nitrogen shortage is a well-known trigger of precocious senescence in many plant species including crops, generally limiting biomass and seed yield. However, leaf senescence induced by nitrogen starvation may be reversed when nitrogen is resupplied at the onset of senescence. Here, the transcriptomic, hormonal, and global metabolic rearrangements occurring during nitrogen resupply-induced reversal of senescence in Arabidopsis thaliana were analysed. The changes induced by senescence were essentially in keeping with those previously described; however, these could, by and large, be reversed. The data thus indicate that plants undergoing senescence retain the capacity to sense and respond to the availability of nitrogen nutrition. The combined data are discussed in the context of the reversibility of the senescence programme and the evolutionary benefit afforded thereby. Future prospects for understanding and manipulating this process in both Arabidopsis and crop plants are postulated.
Background
High blood glucose and diabetes are amongst the conditions causing the greatest losses in years of healthy life worldwide. Therefore, numerous studies aim to identify reliable risk markers for development of impaired glucose metabolism and type 2 diabetes. However, the molecular basis of impaired glucose metabolism is so far insufficiently understood. The development of so called 'omics' approaches in the recent years promises to identify molecular markers and to further understand the molecular basis of impaired glucose metabolism and type 2 diabetes. Although univariate statistical approaches are often applied, we demonstrate here that the application of multivariate statistical approaches is highly recommended to fully capture the complexity of data gained using high-throughput methods.
Methods
We took blood plasma samples from 172 subjects who participated in the prospective Metabolic Syndrome Berlin Potsdam follow-up study (MESY-BEPO Follow-up). We analysed these samples using Gas Chromatography coupled with Mass Spectrometry (GC-MS), and measured 286 metabolites. Furthermore, fasting glucose levels were measured using standard methods at baseline, and after an average of six years. We did correlation analysis and built linear regression models as well as Random Forest regression models to identify metabolites that predict the development of fasting glucose in our cohort.
Results
We found a metabolic pattern consisting of nine metabolites that predicted fasting glucose development with an accuracy of 0.47 in tenfold cross-validation using Random Forest regression. We also showed that adding established risk markers did not improve the model accuracy. However, external validation is eventually desirable. Although not all metabolites belonging to the final pattern are identified yet, the pattern directs attention to amino acid metabolism, energy metabolism and redox homeostasis.
Conclusions
We demonstrate that metabolites identified using a high-throughput method (GC-MS) perform well in predicting the development of fasting plasma glucose over several years. Notably, not single, but a complex pattern of metabolites propels the prediction and therefore reflects the complexity of the underlying molecular mechanisms. This result could only be captured by application of multivariate statistical approaches. Therefore, we highly recommend the usage of statistical methods that seize the complexity of the information given by high-throughput methods.