Manuela Hische, Abdelhalim Larhlimi, Franziska Schwarz, Antje Fischer-Rosinský, Thomas Bobbert, Anke Assmann, Gareth S. Catchpole, Andreas F. H. Pfeiffer, Lothar Willmitzer, Joachim Selbig, Joachim Spranger
- Background
High blood glucose and diabetes are amongst the conditions causing the greatest losses in years of healthy life worldwide. Therefore, numerous studies aim to identify reliable risk markers for development of impaired glucose metabolism and type 2 diabetes. However, the molecular basis of impaired glucose metabolism is so far insufficiently understood. The development of so called 'omics' approaches in the recent years promises to identify molecular markers and to further understand the molecular basis of impaired glucose metabolism and type 2 diabetes. Although univariate statistical approaches are often applied, we demonstrate here that the application of multivariate statistical approaches is highly recommended to fully capture the complexity of data gained using high-throughput methods.
Methods
We took blood plasma samples from 172 subjects who participated in the prospective Metabolic Syndrome Berlin Potsdam follow-up study (MESY-BEPO Follow-up). We analysed these samples using Gas Chromatography coupled withBackground
High blood glucose and diabetes are amongst the conditions causing the greatest losses in years of healthy life worldwide. Therefore, numerous studies aim to identify reliable risk markers for development of impaired glucose metabolism and type 2 diabetes. However, the molecular basis of impaired glucose metabolism is so far insufficiently understood. The development of so called 'omics' approaches in the recent years promises to identify molecular markers and to further understand the molecular basis of impaired glucose metabolism and type 2 diabetes. Although univariate statistical approaches are often applied, we demonstrate here that the application of multivariate statistical approaches is highly recommended to fully capture the complexity of data gained using high-throughput methods.
Methods
We took blood plasma samples from 172 subjects who participated in the prospective Metabolic Syndrome Berlin Potsdam follow-up study (MESY-BEPO Follow-up). We analysed these samples using Gas Chromatography coupled with Mass Spectrometry (GC-MS), and measured 286 metabolites. Furthermore, fasting glucose levels were measured using standard methods at baseline, and after an average of six years. We did correlation analysis and built linear regression models as well as Random Forest regression models to identify metabolites that predict the development of fasting glucose in our cohort.
Results
We found a metabolic pattern consisting of nine metabolites that predicted fasting glucose development with an accuracy of 0.47 in tenfold cross-validation using Random Forest regression. We also showed that adding established risk markers did not improve the model accuracy. However, external validation is eventually desirable. Although not all metabolites belonging to the final pattern are identified yet, the pattern directs attention to amino acid metabolism, energy metabolism and redox homeostasis.
Conclusions
We demonstrate that metabolites identified using a high-throughput method (GC-MS) perform well in predicting the development of fasting plasma glucose over several years. Notably, not single, but a complex pattern of metabolites propels the prediction and therefore reflects the complexity of the underlying molecular mechanisms. This result could only be captured by application of multivariate statistical approaches. Therefore, we highly recommend the usage of statistical methods that seize the complexity of the information given by high-throughput methods.…
MetadatenAuthor details: | Manuela Hische, Abdelhalim Larhlimi, Franziska SchwarzGND, Antje Fischer-RosinskýORCiD, Thomas BobbertGND, Anke Assmann, Gareth S. Catchpole, Andreas F. H. PfeifferORCiDGND, Lothar WillmitzerORCiDGND, Joachim SelbigGND, Joachim SprangerORCiDGND |
---|
URN: | urn:nbn:de:kobv:517-opus4-427400 |
---|
DOI: | https://doi.org/10.25932/publishup-42740 |
---|
ISSN: | 1866-8372 |
---|
Title of parent work (German): | Postprints der Universität Potsdam : Mathematisch Naturwissenschaftliche Reihe |
---|
Publication series (Volume number): | Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe (850) |
---|
Publication type: | Postprint |
---|
Language: | English |
---|
Date of first publication: | 2020/03/16 |
---|
Publication year: | 2012 |
---|
Publishing institution: | Universität Potsdam |
---|
Release date: | 2020/03/16 |
---|
Tag: | biomarker; fasting glucose; metabolite; metabolomics; plasma; prediction; random forest; regression; type 2 diabetes |
---|
Issue: | 850 |
---|
Number of pages: | 12 |
---|
Source: | Journal of Clinical Bioinformatics 2 (2012) 3 DOI: 10.1186/2043-9113-2-3 |
---|
Organizational units: | Mathematisch-Naturwissenschaftliche Fakultät |
---|
DDC classification: | 5 Naturwissenschaften und Mathematik / 54 Chemie / 540 Chemie und zugeordnete Wissenschaften |
---|
Peer review: | Referiert |
---|
Publishing method: | Open Access |
---|
License (English): | Creative Commons - Namensnennung 2.0 Generic |
---|