Refine
Year of publication
- 2018 (2) (remove)
Document Type
- Doctoral Thesis (2) (remove)
Language
- English (2)
Is part of the Bibliography
- yes (2)
Keywords
- metabolomics (2) (remove)
Institute
Systems biology aims at investigating biological systems in its entirety by gathering and analyzing large-scale data sets about the underlying components. Computational systems biology approaches use these large-scale data sets to create models at different scales and cellular levels. In addition, it is concerned with generating and testing hypotheses about biological processes. However, such approaches are inevitably leading to computational challenges due to the high dimensionality of the data and the differences in the dimension of data from different cellular layers.
This thesis focuses on the investigation and development of computational approaches to analyze metabolite profiles in the context of cellular networks. This leads to determining what aspects of the network functionality are reflected in the metabolite levels. With these methods at hand, this thesis aims to answer three questions: (1) how observability of biological systems is manifested in metabolite profiles and if it can be used for phenotypical comparisons; (2) how to identify couplings of reaction rates from metabolic profiles alone; and (3) which regulatory mechanism that affect metabolite levels can be distinguished by integrating transcriptomics and metabolomics read-outs.
I showed that sensor metabolites, identified by an approach from observability theory, are more correlated to each other than non-sensors. The greater correlations between sensor metabolites were detected both with publicly available metabolite profiles and synthetic data simulated from a medium-scale kinetic model. I demonstrated through robustness analysis that correlation was due to the position of the sensor metabolites in the network and persisted irrespectively of the experimental conditions. Sensor metabolites are therefore potential candidates for phenotypical comparisons between conditions through targeted metabolic analysis.
Furthermore, I demonstrated that the coupling of metabolic reaction rates can be investigated from a purely data-driven perspective, assuming that metabolic reactions can be described by mass action kinetics. Employing metabolite profiles from domesticated and wild wheat and tomato species, I showed that the process of domestication is associated with a loss of regulatory control on the level of reaction rate coupling. I also found that the same metabolic pathways in Arabidopsis thaliana and Escherichia coli exhibit differences in the number of reaction rate couplings.
I designed a novel method for the identification and categorization of transcriptional effects on metabolism by combining data on gene expression and metabolite levels. The approach determines the partial correlation of metabolites with control by the principal components of the transcript levels. The principle components contain the majority of the transcriptomic information allowing to partial out the effect of the transcriptional layer from the metabolite profiles. Depending whether the correlation between metabolites persists upon controlling for the effect of the transcriptional layer, the approach allows us to group metabolite pairs into being associated due to post-transcriptional or transcriptional regulation, respectively. I showed that the classification of metabolite pairs into those that are associated due to transcriptional or post-transcriptional regulation are in agreement with existing literature and findings from a Bayesian inference approach.
The approaches developed, implemented, and investigated in this thesis open novel ways to jointly study metabolomics and transcriptomics data as well as to place metabolic profiles in the network context. The results from these approaches have the potential to provide further insights into the regulatory machinery in a biological system.
Neuroinflammatory and neurodegenerative diseases such as Parkinson's (PD) and multiple sclerosis (MS) often result in a severe impairment of the patient´s quality of life. Effective therapies for the treatment are currently not available, which results in a high socio-economic burden. Due to the heterogeneity of the disease subtypes, stratification is particularly difficult in the early phase of the disease and is mainly based on clinical parameters such as neurophysiological tests and central nervous imaging. Due to good accessibility and stability, blood and cerebrospinal fluid metabolite markers could serve as surrogates for neurodegenerative processes. This can lead to an improved mechanistic understanding of these diseases and further be used as "treatment response" biomarkers in preclinical and clinical development programs. Therefore, plasma and CSF metabolite profiles will be identified that allow differentiation of PD from healthy controls, association of PD with dementia (PDD) and differentiation of PD subtypes such as akinetic rigid and tremor dominant PD patients. In addition, plasma metabolites for the diagnosis of primary progressive MS (PPMS) should be investigated and tested for their specificity to relapsing-remitting MS (RRMS) and their development during PPMS progression.
By applying untargeted high-resolution metabolomics of PD patient samples and in using random forest and partial least square machine learning algorithms, this study identified 20 plasma metabolites and 14 CSF metabolite biomarkers. These differentiate against healthy individuals with an AUC of 0.8 and 0.9 in PD, respectively. We also identify ten PDD specific serum metabolites, which differentiate against healthy individuals and PD patients without dementia with an AUC of 1.0, respectively. Furthermore, 23 akinetic-rigid specific plasma markers were identified, which differentiate against tremor-dominant PD patients with an AUC of 0.94 and against healthy individuals with an AUC of 0.98. These findings also suggest more severe disease pathology in the akinetic-rigid PD than in tremor dominant PD. In the analysis of MS patient samples a partial least square analysis yielded predictive models for the classification of PPMS and resulted in 20 PPMS specific metabolites. In another MS study unknown changes in human metabolism were identified after administration of the multiple sclerosis drug dimethylfumarate, which is used for the treatment of RRMS. These results allow to describe and understand the hitherto completely unknown mechanism of action of this new drug and to use these findings for the further development of new drugs and targets against RRMS.
In conclusion, these results have the potential for improved diagnosis of these diseases and improvement of mechanistic understandings, as multiple deregulated pathways were identified. Moreover, novel Dimethylfumarate targets can be used to aid drug development and treatment efficiency. Overall, metabolite profiling in combination with machine learning identified as a promising approach for biomarker discovery and mode of action elucidation.