570 Biowissenschaften; Biologie
Refine
Year of publication
- 2023 (63) (remove)
Document Type
- Doctoral Thesis (47)
- Article (13)
- Postprint (3)
Is part of the Bibliography
- yes (63) (remove)
Keywords
- Arabidopsis thaliana (3)
- photosynthesis (3)
- 5-methoxycarbonylmethyl-2-thiouridine (2)
- H2S biosynthesis (2)
- Immunoassay (2)
- Klimawandel (2)
- Moco biosynthesis (2)
- Photosynthese (2)
- Pipistrellus nathusii (2)
- Stärke (2)
Sulfur is an important element that is incorporated into many biomolecules in humans. The incorporation and transfer of sulfur into biomolecules is, however, facilitated by a series of different sulfurtransferases. Among these sulfurtransferases is the human mercaptopyruvate sulfurtransferase (MPST) also designated as tRNA thiouridine modification protein (TUM1). The role of the human TUM1 protein has been suggested in a wide range of physiological processes in the cell among which are but not limited to involvement in Molybdenum cofactor (Moco) biosynthesis, cytosolic tRNA thiolation and generation of H2S as signaling molecule both in mitochondria and the cytosol. Previous interaction studies showed that TUM1 interacts with the L-cysteine desulfurase NFS1 and the Molybdenum cofactor biosynthesis protein 3 (MOCS3). Here, we show the roles of TUM1 in human cells using CRISPR/Cas9 genetically modified Human Embryonic Kidney cells. Here, we show that TUM1 is involved in the sulfur transfer for Molybdenum cofactor synthesis and tRNA thiomodification by spectrophotometric measurement of the activity of sulfite oxidase and liquid chromatography quantification of the level of sulfur-modified tRNA. Further, we show that TUM1 has a role in hydrogen sulfide production and cellular bioenergetics.
In late summer, migratory bats of the temperate zone face the challenge of accomplishing two energy-demanding tasks almost at the same time: migration and mating. Both require information and involve search efforts, such as localizing prey or finding potential mates. In non-migrating bat species, playback studies showed that listening to vocalizations of other bats, both con-and heterospecifics, may help a recipient bat to find foraging patches and mating sites. However, we are still unaware of the degree to which migrating bats depend on con-or heterospecific vocalizations for identifying potential feeding or mating opportunities during nightly transit flights. Here, we investigated the vocal responses of Nathusius’ pipistrelle bats, Pipistrellus nathusii, to simulated feeding and courtship aggregations at a coastal migration corridor. We presented migrating bats either feeding buzzes or courtship calls of their own or a heterospecific migratory species, the common noctule, Nyctalus noctula. We expected that during migratory transit flights, simulated feeding opportunities would be particularly attractive to bats, as well as simulated mating opportunities which may indicate suitable roosts for a stopover. However, we found that when compared to the natural silence of both pre-and post-playback phases, bats called indifferently during the playback of conspecific feeding sounds, whereas P. nathusii echolocation call activity increased during simulated feeding of N. noctula. In contrast, the call activity of P. nathusii decreased during the playback of conspecific courtship calls, while no response could be detected when heterospecific call types were broadcasted. Our results suggest that while on migratory transits, P. nathusii circumnavigate conspecific mating aggregations, possibly to save time or to reduce the risks associated with social interactions where aggression due to territoriality might be expected. This avoidance behavior could be a result of optimization strategies by P. nathusii when performing long-distance migratory flights, and it could also explain the lack of a response to simulated conspecific feeding. However, the observed increase of activity in response to simulated feeding of N. noctula, suggests that P. nathusii individuals may be eavesdropping on other aerial hawking insectivorous species during migration, especially if these occupy a slightly different foraging niche.
The African weakly electric fish genus Campylomormyrus includes 15 described species mostly native to the Congo River and its tributaries. They are considered sympatric species, because their distribution area overlaps. These species generate species-specific electric organ discharges (EODs) varying in waveform characteristics, including duration, polarity, and phase number. They exhibit also pronounced divergence in their snout, i.e. the length, thickness, and curvature. The diversifications in these two phenotypical traits (EOD and snout) have been proposed as key factors promoting adaptive radiation in Campylomormyrus. The role of EODs as a pre-zygotic isolation mechanism driving sympatric speciation by promoting assortative mating has been examined using behavioral, genetical, and histological approaches. However, the evolutionary effects of the snout morphology and its link to species divergence have not been closely examined. Hence, the main objective of this study is to investigate the effect of snout morphology diversification and its correlated EOD to better understand their sympatric speciation and evolutionary drivers. Moreover, I aim to utilize the intragenus and intergenus hybrids of Campylomormyrus to better understand trait divergence as well as underlying molecular/genetic mechanisms involved in the radiation scenario. To this end, I utilized three different approaches: feeding behavior analysis, diet assessment, and geometric morphometrics analysis. I performed feeding behavior experiments to evaluate the concept of the phenotype-environment correlation by testing whether Campylomormyrus species show substrate preferences. The behavioral experiments showed that the short snout species exhibits preference to sandy substrate, the long snout species prefers a stone substrate, and the species with intermediate snout size does not exhibit any substrate preference. The experiments suggest that the diverse feeding apparatus in the genus Campylomormyrus may have evolved in adaptation to their microhabitats. I also performed diet assessments of sympatric Campylomormyrus species and a sister genus species (Gnathonemus petersii) with markedly different snout morphologies and EOD using NGS-based DNA metabarcoding of their stomach contents. The diet of each species was documented showing that aquatic insects such as dipterans, coleopterans and trichopterans represent the major diet component. The results showed also that all species are able to exploit diverse food niches in their habitats. However, comparing the diet overlap indices showed that different snout morphologies and the associated divergence in the EOD translated into different prey spectra. These results further support the idea that the EOD could be a ‘magic trait’ triggering both adaptation and reproductive isolation. Geometric morphometrics method was also used to compare the phenotypical shape traits of the F1 intragenus (Campylomormyrus) and intergenus (Campylomormyrus species and Gnathonemus petersii) hybrids relative to their parents. The hybrids of these species were well separated based on the morphological traits, however the hybrid phenotypic traits were closer to the short-snouted species. In addition, the likelihood that the short snout expressed in the hybrids increases with increasing the genetic distance of the parental species. The results confirmed that additive effects produce intermediate phenotypes in F1-hybrids. It seems, therefore, that morphological shape traits in hybrids, unlike the physiological traits, were not expressed straightforward.
Ribosomes decode mRNA to synthesize proteins. Ribosomes, once considered static, executing machines, are now viewed as dynamic modulators of translation. Increasingly detailed analyses of structural ribosome heterogeneity led to a paradigm shift toward ribosome specialization for selective translation. As sessile organisms, plants cannot escape harmful environments and evolved strategies to withstand. Plant cytosolic ribosomes are in some respects more diverse than those of other metazoans. This diversity may contribute to plant stress acclimation. The goal of this thesis was to determine whether plants use ribosome heterogeneity to regulate protein synthesis through specialized translation. I focused on temperature acclimation, specifically on shifts to low temperatures. During cold acclimation, Arabidopsis ceases growth for seven days while establishing the responses required to resume growth. Earlier results indicate that ribosome biogenesis is essential for cold acclimation. REIL mutants (reil-dkos) lacking a 60S maturation factor do not acclimate successfully and do not resume growth. Using these genotypes, I ascribed cold-induced defects of ribosome biogenesis to the assembly of the polypeptide exit tunnel (PET) by performing spatial statistics of rProtein changes mapped onto the plant 80S structure. I discovered that growth cessation and PET remodeling also occurs in barley, suggesting a general cold response in plants. Cold triggered PET remodeling is consistent with the function of Rei-1, a REIL homolog of yeast, which performs PET quality control. Using seminal data of ribosome specialization, I show that yeast remodels the tRNA entry site of ribosomes upon change of carbon sources and demonstrate that spatially constrained remodeling of ribosomes in metazoans may modulate protein synthesis. I argue that regional remodeling may be a form of ribosome specialization and show that heterogeneous cytosolic polysomes accumulate after cold acclimation, leading to shifts in the translational output that differs between wild-type and reil-dkos. I found that heterogeneous complexes consist of newly synthesized and reused proteins. I propose that tailored ribosome complexes enable free 60S subunits to select specific 48S initiation complexes for translation. Cold acclimated ribosomes through ribosome remodeling synthesize a novel proteome consistent with known mechanisms of cold acclimation. The main hypothesis arising from my thesis is that heterogeneous/ specialized ribosomes alter translation preferences, adjust the proteome and thereby activate plant programs for successful cold acclimation.
Background: The role of fatty acid (FA) intake and metabolism in type 2 diabetes (T2D) incidence is controversial. Some FAs are not synthesised endogenously and, therefore, these circulating FAs reflect dietary intake, for example, the trans fatty acids (TFAs), saturated odd chain fatty acids (OCFAs), and linoleic acid, an n-6 polyunsaturated fatty acids (PUFA). It remains unclear if intake of TFA influence T2D risk and whether industrial TFAs (iTFAs) and ruminant TFAs (rTFAs) exert the same effect. Unlike even chain saturated FAs, the OCFAs have been inversely associated with T2D risk, but this association is poorly understood. Furthermore, the associations of n-6 PUFAs intake with T2D risk are still debated, while delta-5 desaturase (D5D), a key enzyme in the metabolism of PUFAs, has been consistently related to T2D risk. To better understand these relationships, the FA composition in circulating lipid fractions can be used as biomarkers of dietary intake and metabolism. The exploration of TFAs subtypes in plasma phospholipids and OCFAs and n-6 PUFAs within a wide range of lipid classes may give insights into the pathophysiology of T2D.
Aim: This thesis aimed mainly to analyse the association of TFAs, OCFAs and n-6 PUFAs with self-reported dietary intake and prospective T2D risk, using seven types of TFAs in plasma phospholipids and deep lipidomics profiling data from fifteen lipid classes.
Methods: A prospective case-cohort study was designed within the European Prospective Investigation into Cancer and Nutrition (EPIC)-Potsdam study, including all the participants who developed T2D (median follow-up 6.5 years) and a random subsample of the full cohort (subcohort: n=1248; T2D cases: n=820). The main analyses included two lipid profiles. The first was an assessment of seven TFA in plasma phospholipids, with a modified method for analysis of FA with very low abundances. The second lipid profile was derived from a high-throughout lipid profiling technology, which identified 940 distinct molecular species and allowed to quantify OCFAs and PUFAs composition across 15 lipid classes. Delta-5 desaturase (D5D) activity was estimated as 20:4/20:3-ratio. Using multivariable Cox regression models, we examined the associations of TFA subtypes with incident T2D and class-specific associations of OCFA and n-6 PUFAs with T2D risk.
Results: 16:1n-7t, 18:1n-7t, and c9t11-CLA were positively correlated with the intake of fat-rich dairy foods. iTFA 18:1 isomers were positively correlated with margarine. After adjustment for confounders and other TFAs, higher plasma phospholipid concentrations of two rTFAs were associated with a lower incidence of T2D: 18:1n-7t and t10c12-CLA. In contrast, the rTFA c9t11-CLA was associated with a higher incidence of T2D. rTFA 16:1n-7t and iTFAs (18:1n-6t, 18:1n-9t, 18:2n-6,9t) were not statistically significantly associated with T2D risk.
We observed heterogeneous integration of OCFA in different lipid classes, and the contribution of 15:0 versus 17:0 to the total OCFA abundance differed across lipid classes. Consumption of fat-rich dairy and fiber-rich foods were positively and red meat inversely correlated to OCFA abundance in plasma phospholipid classes. In women only, higher abundances of 15:0 in phosphatidylcholines (PC) and diacylglycerols (DG), and 17:0 in PC, lysophosphatidylcholines (LPC), and cholesterol esters (CE) were inversely associated with T2D risk. In men and women, a higher abundance of 15:0 in monoacylglycerols (MG) was also inversely associated with T2D. Conversely, a higher 15:0 concentration in LPC and triacylglycerols (TG) was associated with higher T2D risk in men. Women with a higher concentration of 17:0 as free fatty acids (FFA) also had higher T2D incidence.
The integration of n-6 PUFAs in lipid classes was also heterogeneous. 18:2 was highly abundant in phospholipids (particularly PC), CE, and TG; 20:3 represented a small fraction of FA in most lipid classes, and 20:4 accounted for a large proportion of circulating phosphatidylinositol (PI) and phosphatidylethanolamines (PE). Higher concentrations of 18:2 were inversely associated with T2D risk, especially within DG, TG, and LPC. However, 18:2 as part of MG was positively associated with T2D risk. Higher concentrations of 20:3 in phospholipids (PC, PE, PI), FFA, CE, and MG were linked to higher T2D incidence. 20:4 was unrelated to risk in most lipid classes, except positive associations were observed for 20:4 enriched in FFA and PE. The estimated D5D activities in PC, PE, PI, LPC, and CE were inversely associated with T2D and explained variance of estimated D5D activity by genomic variation in the FADS locus was only substantial in those lipid classes.
Conclusion: The TFAs' conformation is essential in their relationship to diabetes risk, as indicated by plasma rTFA subtypes concentrations having opposite directions of associations with diabetes risk. Plasma OCFA concentration is linked to T2D risk in a lipid class and sex-specific manner. Plasma n-6 PUFA concentrations are associated differently with T2D incidence depending on the specific FA and the lipid class. Overall, these results highlight the complexity of circulating FAs and their heterogeneous association with T2D risk depending on the specific FA structure, lipid class, and sex. My results extend the evidence of the relationship between diet, lipid metabolism, and subsequent T2D risk. In addition, my work generated several potential new biomarkers of dietary intake and prospective T2D risk.
Mitochondria and plastids are organelles with an endosymbiotic origin. During evolution, many genes are lost from the organellar genomes and get integrated in the nuclear genome, in what is known as intracellular/endosymbiotic gene transfer (IGT/EGT). IGT has been reproduced experimentally in Nicotiana tabacum at a gene transfer rate (GTR) of 1 event in 5 million cells, but, despite its centrality to eukaryotic evolution, there are no genetic factors known to influence the frequency of IGT in higher eukaryotes. The focus of this work was to determine the role of different DNA repair pathways of double strand break repair (DSBR) in the integration step of organellar DNA in the nuclear genome during IGT. Here, a CRISPR/Cas9 mutagenesis strategy was implemented in N. tabacum, with the aim of generating mutants in nuclear genes without expected visible phenotypes. This strategy led to the generation of a collection of independent mutants in the LIG4 (necessary for non-homologous end joining, NHEJ) and POLQ genes (necessary for microhomology mediated end joining, MMEJ). Targeting of other DSBR genes (KU70, KU80, RPA1C) generated mutants with unexpectedly strong developmental phenotypes.. These factors have telomeric roles, hinting towards a possible relationship between telomere length, and strength of developmental disruption upon loss of telomere structure in plants. The mutants were made in a genetic background encoding a plastid-encoded IGT reporter, that confers kanamycin resistance upon transfer to the nucleus. Through large scale independent experiments, increased IGT from the chloroplast to the nucleus was observed in lig4 mutants, as well as lines encoding a POLQ gene with a defective polymerase domain (polqΔPol). This shows that NHEJ or MMEJ have a double-sided relationship with IGT: while transferred genes may integrate using either pathway, the presence of both pathways suppresses IGT in wild-type somatic cells, thus demonstrating for the first time the extent on which nuclear genes control IGT frequency in plants. The IGT frequency increases in the mutants are likely mediated by increased availability of double strand breaks for integration. Additionally, kinetic analysis reveals that gene transfer (GT) events accumulate linearly as a function of time spent under antibiotic selection in the experiment, demonstrating that, contrary to what was previously thought, there is no such thing as a single GTR in somatic IGT experiments. Furthermore, IGT in tissue culture experiments appears to be the result of a "race against the clock" for integration in the nuclear genome, that starts when the organellar DNA arrives to the nucleus granting transient antibiotic resistance. GT events and escapes of kanamycin selection may be two possible outcomes from this race: those instances where the organellar DNA gets to integrate are recovered as GT events, and in those cases where timely integration fails, antibiotic resistance cannot be sustained, and end up considered as escapes. In the mutants, increased opportunities for integration in the nuclear genome change the overall ratio between IGT and escape events. The resources generated here are promising starting points for future research: (1) the mutant collection, for the further study of processes that depend on DNA repair in plants (2) the collection of GT lines obtained from these experiments, for the study of the effect of DSBR pathways over integration patterns and stability of transferred genes and (3) the developed CRISPR/Cas9 workflow for mutant generation, to make N. tabacum meet its potential as an attractive model for answering complex biological questions.
The G protein-coupled estrogen receptor (GPER1) is acknowledged as an important mediator of estrogen signaling. Given the ubiquitous expression of GPER1, it is likely that the receptor plays a role in a variety of malignancies, not only in the classic hormonally regulated tissues (e.g., breast, ovary, and prostate), but also in the colon. As colorectal cancer (CRC) is the third most common cancer in both men and women worldwide and environmental factors and dietary habits are important risk factors, it is increasingly recognized that natural and synthetic hormones and their associated receptors might play a role in CRC. Through oral consumption, environmental contaminants with endocrine activity are in contact with the gastrointestinal mucosa, where they might exert their toxic effects. Although GPER1 has been shown to be engaged in physiological and pathophysiological processes, its role in CRC remains poorly understood. Thus, pro- as well as anti-tumorigenic effects are described in the literature. This thesis has uncovered novel roles of GPER1 in mediating major CRC-associated phenotypes in transformed and non-transformed colon cell lines. Exposure to the estrogens 17β-estradiol (E2), bisphenol-A (BPA) and diethylstilbestrol (DES) but also the androgen dihydrotestosterone (DHT) resulted in GPER1-dependent induction of supernumerary centrosomes, whole chromosomal instability (w-CIN) and aneuploidy. Indeed, both knockdown and inhibition of GPER1 attenuated the generation of (xeno)hormone-driven supernumerary centrosomes and karyotype instability. Mechanistically, (xeno)hormone-induced centrosome amplification was associated with transient multipolar mitosis and the generation of so called anaphase “lagging” chromosomes. The results of this thesis propose a GPER1/PKA/AKAP9-pathway in regulating centrosome numbers in colorectal cancer cells and the involvement of the centriolar protein centrin. Remarkably, exposure to (xeno)hormones resulted in atypical enlargement and unexpected phosphorylation of the centriole marker centrin in interphase. These findings provide a novel role for GPER1 in key CRC-prone lesions and shed light on underlying mechanisms that involve GPER1 function in the colon. Elucidating to what extent centrosomal proteins are involved in the GPER1-mediated aneugenic effect will be an important task for future studies. The present study was intended to lay a first foundation to understand the molecular basis and potential risk factors of CRC which might help to reduce the use of laboratory animals. Since numerous animal experiments are conducted in biomedical research, the development of alternative methods is indispensable. The Federal Institute for Risk Assessment (BfR) as the German Center for the Protection of Laboratory Animals (Bf3R) addresses this issue by uncovering underlying mechanisms leading to colorectal cancer as necessary prerequisite in order to develop alternative methods.
Photosynthesis converts light into metabolic energy which fuels plant growth. In nature, many factors influence light availability for photosynthesis on different time scales, from shading by leaves within seconds up to seasonal changes over months. Variability of light energy supply for photosynthesis can limit a plant´s biomass accumulation. Plants have evolved multiple strategies to cope with strongly fluctuation light (FL). These range from long-term optimization of leaf morphology and physiology and levels of pigments and proteins in a process called light acclimation, to rapid changes in protein activity within seconds. Therefore, uncovering how plants deal with FL on different time scales may provide key ideas for improving crop yield. Photosynthesis is not an isolated process but tightly integrates with metabolism through mutual regulatory interactions. We thus require mechanistic understanding of how long-term light acclimation shapes both, dynamic photosynthesis and its interactions with downstream metabolism. To approach this, we analyzed the influence of growth light on i) the function of known rapid photosynthesis regulators KEA3 and VCCN1 in dynamic photosynthesis (Chapter 2-3) and ii) the interconnection of photosynthesis with photorespiration (PR; Chapter 4).
We approached topic (i) by quantifying the effect of different growth light regimes on photosynthesis and photoprotection by using kea3 and vccn1 mutants. Firstly, we found that, besides photosynthetic capacity, the activities of VCCN1 and KEA3 during a sudden high light phase also correlated with growth light intensity. This finding suggests regulation of both proteins by the capacity of downstream metabolism. Secondly, we showed that KEA3 accelerated photoprotective non-photochemical quenching (NPQ) kinetics in two ways: Directly via downregulating the lumen proton concentration and thereby de-activating pH-dependent NPQ, and indirectly via suppressing accumulation of the photoprotective pigment zeaxanthin.
For topic (ii), we analyzed the role of PR, a process which recycles a toxic byproduct of the carbon fixation reactions, in metabolic flexibility in a dynamically changing light environment. For this we employed the mutants hpr1 and ggt1 with a partial block in PR. We characterized the function of PR during light acclimation by tracking molecular and physiological changes of the two mutants. Our data, in contrast to previous reports, disprove a generally stronger physiological relevance of PR under dynamic light conditions. Additionally, the two different mutants showed pronounced and distinct metabolic changes during acclimation to a condition inducing higher photosynthetic activity. This underlines that PR cannot be regarded purely as a cyclic detoxification pathway for 2PG. Instead, PR is highly interconnected with plant metabolism, with GGT1 and HPR1 representing distinct metabolic modulators.
In summary, the presented work provides further insight into how energetic and metabolic flexibility is ensured by short-term regulators and PR during long-term light acclimation.
The light reactions of photosynthesis are carried out by a series of multiprotein complexes embedded in thylakoid membranes. Among them, photosystem I (PSI), acting as plastocyanin-ferderoxin oxidoreductase, catalyzes the final reaction. Together with light-harvesting antenna I, PSI forms a high-molecular-weight supercomplex of ~600 kDa, consisting of eighteen subunits and nearly two hundred co-factors. Assembly of the various components into a functional thylakoid membrane complex requires precise coordination, which is provided by the assembly machinery. Although this includes a small number of proteins (PSI assembly factors) that have been shown to play a role in the formation of PSI, the process as a whole, as well as the intricacy of its members, remains largely unexplored.
In the present work, two approaches were used to find candidate PSI assembly factors. First, EnsembleNet was used to select proteins thought to be functionally related to known PSI assembly factors in Arabidopsis thaliana (approach I), and second, co-immunoprecipitation (Co-IP) of tagged PSI assembly factors in Nicotiana tabacum was performed (approach II).
Here, the novel PSI assembly factors designated CO-EXPRESSED WITH PSI ASSEMBLY 1 (CEPA1) and Ycf4-INTERACTING PROTEIN 1 (Y4IP1) were identified. A. thaliana null mutants for CEPA1 and Y4IP1 showed a growth phenotype and pale leaves compared with the wild type. Biophysical experiments using pulse amplitude modulation (PAM) revealed insufficient electron transport on the PSII acceptor side. Biochemical analyses revealed that both CEPA1 and Y4IP1 are specifically involved in PSI accumulation in A. thaliana at the post-translational level but are not essential. Consistent with their roles as factors in the assembly of a thylakoid membrane protein complex, the two proteins localize to thylakoid membranes. Remarkably, cepa1 y4ip1 double mutants exhibited lethal phenotypes in early developmental stages under photoautotrophic growth. Finally, co-IP and native gel experiments supported a possible role for CEPA1 and Y4IP1 in mediating PSI assembly in conjunction with other PSI assembly factors (e.g., PPD1- and PSA3-CEPA1 and Ycf4-Y4IP1). The fact that CEPA1 and Y4IP1 are found exclusively in green algae and higher plants suggests eukaryote-specific functions. Although the specific mechanisms need further investigation, CEPA1 and Y4IP1 are two novel assembly factors that contribute to PSI formation.
Following the extinction of dinosaurs, the great adaptive radiation of mammals occurred, giving rise to an astonishing ecological and phenotypic diversity of mammalian species. Even closely related species often inhabit vastly different habitats, where they encounter diverse environmental challenges and are exposed to different evolutionary pressures. As a response, mammals evolved various adaptive phenotypes over time, such as morphological, physiological and behavioural ones. Mammalian genomes vary in their content and structure and this variation represents the molecular mechanism for the long-term evolution of phenotypic variation. However, understanding this molecular basis of adaptive phenotypic variation is usually not straightforward.
The recent development of sequencing technologies and bioinformatics tools has enabled a better insight into mammalian genomes. Through these advances, it was acknowledged that mammalian genomes differ more, both within and between species, as a consequence of structural variation compared to single-nucleotide differences. Structural variant types investigated in this thesis - such as deletion, duplication, inversion and insertion, represent a change in the structure of the genome, impacting the size, copy number, orientation and content of DNA sequences. Unlike short variants, structural variants can span multiple genes. They can alter gene dosage, and cause notable gene expression differences and subsequently phenotypic differences. Thus, they can lead to a more dramatic effect on the fitness (reproductive success) of individuals, local adaptation of populations and speciation.
In this thesis, I investigated and evaluated the potential functional effect of structural variations on the genomes of mustelid species. To detect the genomic regions associated with phenotypic variation I assembled the first reference genome of the tayra (Eira barbara) relying on linked-read sequencing technology to achieve a high level of genome completeness important for reliable structural variant discovery. I then set up a bioinformatics pipeline to conduct a comparative genomic analysis and explore variation between mustelid species living in different environments. I found numerous genes associated with species-specific phenotypes related to diet, body condition and reproduction among others, to be impacted by structural variants.
Furthermore, I investigated the effects of artificial selection on structural variants in mice selected for high fertility, increased body mass and high endurance. Through selective breeding of each mouse line, the desired phenotypes have spread within these populations, while maintaining structural variants specific to each line. In comparison to the control line, the litter size has doubled in the fertility lines, individuals in the high body mass lines have become considerably larger, and mice selected for treadmill performance covered substantially more distance. Structural variants were found in higher numbers in these trait-selected lines than in the control line when compared to the mouse reference genome. Moreover, we have found twice as many structural variants spanning protein-coding genes (specific to each line) in trait-selected lines. Several of these variants affect genes associated with selected phenotypic traits. These results imply that structural variation does indeed contribute to the evolution of the selected phenotypes and is heritable.
Finally, I suggest a set of critical metrics of genomic data that should be considered for a stringent structural variation analysis as comparative genomic studies strongly rely on the contiguity and completeness of genome assemblies. Because most of the available data used to represent reference genomes of mammalian species is generated using short-read sequencing technologies, we may have incomplete knowledge of genomic features. Therefore, a cautious structural variation analysis is required to minimize the effect of technical constraints.
The impact of structural variants on the adaptive evolution of mammalian genomes is slowly gaining more focus but it is still incorporated in only a small number of population studies. In my thesis, I advocate the inclusion of structural variants in studies of genomic diversity for a more comprehensive insight into genomic variation within and between species, and its effect on adaptive evolution.
The musculoskeletal system provides support and enables movement to the body, and its deterioration is a crucial aspect of age-related functional decline. Mesenchymal stromal cells (MSCs) play an important role in musculoskeletal homeostasis due to their broad differentiation potentials and their ability to support osteogenic and myogenic tissue maintenance and regeneration. In the bone, MSCs differentiate either into osteochondrogenic progenitors to form osteocytes and chondrocytes, or increasingly with age into adipogenic progenitors which give rise to bone-resident adipocytes. In skeletal muscle, during healthy regeneration MSCs provide regulatory signals that activate local, tissue-specific stem cells, known as satellite cells, which regenerate contractile myofibres. This process involves a significant cross-talk to immune cells stemming from both lymphoid and myeloid lineages. During ageing, muscle-resident MSCs undergo increased adipogenic lineage commitment, causing niche changes that contribute to fatty infiltration in muscles. These shifts in cell populations in bone lead to the loss of osteogenic cells and subsequently osteoporosis, or in muscle to impaired regeneration and to the development of sarcopenia. However, the signals that drive transition of MSCs into their respective cellular fates remain elusive.
This thesis aims to elucidate the transcriptional shifts modulating cell states and cell types in musculoskeletal MSC fate determination. Single-cell RNA-sequencing (scRNA-seq) was used to characterise cell type-specific transcript regulation. State-of-the-art bioinformatics tools were combined with different analytical platforms that include both droplet-based scRNA-seq for large heterogeneous populations, and microfluidics-based scRNA-seq to assess small, rare subpopulations. For each platform, distinct computational pipelines were established including filtering steps to exclude low-quality cells, and data visualisation was performed by dimensionality reduction. Downstream analysis included clustering, cell type annotation, and differential gene expression to investigate transcriptional states in defined cell types during ageing and injury in the muscle and bone. Finally, a novel tool to assess publication activities in defined areas of research for the identified marker genes was developed.
The results in the bone indicate that ageing MSCs increasingly commit towards an adipogenic fate at the expense of osteogenic specialisation. The data also suggests that significant cell population shifts of MSC-type fibro-adipogenic progenitors during muscle ageing underlie the pathologies observed in homeostatic and post-injury regenerative conditions. High-throughput visualisation of publication activity for candidate genes enabled more effective biological evaluation of scRNA-seq data. These results expose critical age-related changes in the stem cell niches of skeletal muscle and bone, highlight their respective sensitivity to nutrition and pathology, and elucidate novel factors that modulate stem cell-based regeneration. Targeting these processes might improve musculoskeletal health in the context of ageing and prevent the negative effects of pathological lineage determination.
Selenium (Se) is an essential trace element that is ubiquitously present in the environment in small concentrations. Essential functions of Se in the human body are manifested through the wide range of proteins, containing selenocysteine as their active center. Such proteins are called selenoproteins which are found in multiple physiological processes like antioxidative defense and the regulation of thyroid hormone functions. Therefore, Se deficiency is known to cause a broad spectrum of physiological impairments, especially in endemic regions with low Se content. Nevertheless, being an essential trace element, Se could exhibit toxic effects, if its intake exceeds tolerable levels. Accordingly, this range between deficiency and overexposure represents optimal Se supply. However, this range was found to be narrower than for any other essential trace element. Together with significantly varying Se concentrations in soil and the presence of specific bioaccumulation factors, this represents a noticeable difficulty in the assessment of Se
epidemiological status. While Se is acting in the body through multiple selenoproteins, its intake occurs mainly in form of small organic or inorganic molecular mass species. Thus, Se exposure not only depends on daily intake but also on the respective chemical form, in which it is present.
The essential functions of selenium have been known for a long time and its primary forms in different food sources have been described. Nevertheless, analytical capabilities for a comprehensive investigation of Se species and their derivatives have been introduced only in the last decades. A new Se compound was identified in 2010 in the blood and tissues of bluefin tuna. It was called selenoneine (SeN) since it is an isologue of naturally occurring antioxidant ergothioneine (ET), where Se replaces sulfur. In the following years, SeN was identified in a number of edible fish species and attracted attention as a new dietary Se source and potentially strong antioxidant. Studies in populations whose diet largely relies on fish revealed that SeN
represents the main non-protein bound Se pool in their blood. First studies, conducted with enriched fish extracts, already demonstrated the high antioxidative potential of SeN and its possible function in the detoxification of methylmercury in fish. Cell culture studies demonstrated, that SeN can utilize the same transporter as ergothioneine, and SeN metabolite was found in human urine.
Until recently, studies on SeN properties were severely limited due to the lack of ways to obtain the pure compound. As a predisposition to this work was firstly a successful approach to SeN synthesis in the University of Graz, utilizing genetically modified yeasts. In the current study, by use of HepG2 liver carcinoma cells, it was demonstrated, that SeN does not cause toxic effectsup to 100 μM concentration in hepatocytes. Uptake experiments showed that SeN is not bioavailable to the used liver cells.
In the next part a blood-brain barrier (BBB) model, based on capillary endothelial cells from the porcine brain, was used to describe the possible transfer of SeN into the central nervous system (CNS). The assessment of toxicity markers in these endothelial cells and monitoring of barrier conditions during transfer experiments demonstrated the absence of toxic effects from SeN on the BBB endothelium up to 100 μM concentration. Transfer data for SeN showed slow but substantial transfer. A statistically significant increase was observed after 48 hours following SeN incubation from the blood-facing side of the barrier. However, an increase in Se content was clearly visible already after 6 hours of incubation with 1 μM of SeN. While the transfer rate of SeN after application of 0.1 μM dose was very close to that for 1 μM, incubation with 10 μM of SeN resulted in a significantly decreased transfer rate. Double-sided application of SeN caused no side-specific transfer of SeN, thus suggesting a passive diffusion mechanism of SeN across the BBB. This data is in accordance with animal studies, where ET accumulation was observed in the rat brain, even though rat BBB does not have the primary ET transporter – OCTN1. Investigation of capillary endothelial cell monolayers after incubation with SeN and reference selenium compounds showed no significant increase of intracellular selenium concentration. Speciesspecific Se measurements in medium samples from apical and basolateral compartments, as good as in cell lysates, showed no SeN metabolization. Therefore, it can be concluded that SeN may reach the brain without significant transformation.
As the third part of this work, the assessment of SeN antioxidant properties was performed in Caco-2 human colorectal adenocarcinoma cells. Previous studies demonstrated that the intestinal epithelium is able to actively transport SeN from the intestinal lumen to the blood side and accumulate SeN. Further investigation within current work showed a much higher antioxidant potential of SeN compared to ET. The radical scavenging activity after incubation with SeN was close to the one observed for selenite and selenomethionine. However, the SeN effect on the viability of intestinal cells under oxidative conditions was close to the one caused by ET. To answer the question if SeN is able to be used as a dietary Se source and induce the activity of selenoproteins, the activity of glutathione peroxidase (GPx) and the secretion of selenoprotein P (SelenoP) were measured in Caco-2 cells, additionally. As expected, reference selenium compounds selenite and selenomethionine caused efficient induction of GPx activity. In contrast to those SeN had no effect on GPx activity. To examine the possibility of SeN being embedded into the selenoproteome, SelenoP was measured in a culture medium. Even though Caco-2 cells effectively take up SeN in quantities much higher than selenite or selenomethionine, no secretion of SelenoP was observed after SeN incubation.
Summarizing, we can conclude that SeN can hardly serve as a Se source for selenoprotein synthesis. However, SeN exhibit strong antioxidative properties, which appear when sulfur in ET is exchanged by Se. Therefore, SeN is of particular interest for research not as part of Se metabolism, but important endemic dietary antioxidant.
Hantaviruses (HVs) are a group of zoonotic viruses that infect human beings primarily through aerosol transmission of rodent excreta and urine samplings. HVs are classified geographically into: Old World HVs (OWHVs) that are found in Europe and Asia, and New World HVs (NWHVs) that are observed in the Americas. These different strains can cause severe hantavirus diseases with pronounced renal syndrome or severe cardiopulmonary system distress. HVs can be extremely lethal, with NWHV infections reaching up to 40 % mortality rate. HVs are known to generate epidemic outbreaks in many parts of the world including Germany, which has seen periodic HV infections over the past decade. HV has a trisegmented genome. The small segment (S) encodes the nucleocapsid protein (NP), the middle segment (M) encodes the glycoproteins (GPs) Gn and Gc which forms up to tetramers and primarily monomers \& dimers upon independent expression respectively and large segment (L) encodes RNA dependent RNA polymerase (RdRp). Interactions between these viral proteins are crucial in providing mechanistic insights into HV virion development. Despite best efforts, there continues to be lack of quantification of these associations in living cells. This is required in developing the mechanistic models for HV viral assembly. This dissertation focuses on three key questions pertaining to the initial steps of virion formation that primarily involves the GPs and NP.
The research investigations in this work were completed using Fluorescence Correlation Spectroscopy (FCS) approaches. FCS is frequently used in assessing the biophysical features of bio-molecules including protein concentration and diffusion dynamics and circumvents the requirement of protein overexpression. FCS was primarily applied in this thesis to evaluate protein multimerization, at single cell resolution.
The first question addressed which GP spike formation model proposed by Hepojoki et al.(2010) appropriately describes the evidence in living cells. A novel in cellulo assay was developed to evaluate the amount of fluorescently labelled and unlabeled GPs upon co-expression. The results clearly showed that Gn and Gc initially formed a heterodimeric Gn:Gc subunit. This sub-unit then multimerizes with congruent Gn:Gc subunits to generate the final GP spike. Based on these interactions, models describing the formation of GP complex (with multiple GP spike subunits) were additionally developed.
HV GP assembly primarily takes place in the Golgi apparatus (GA) of infected cells. Interestingly, NWHV GPs are hypothesized to assemble at the plasma membrane (PM). This led to the second research question in this thesis, in which a systematic comparison between OWHV and NWHV GPs was conducted to validate this hypothesis. Surprisingly, GP localization at the PM was congruently observed with OWHV and NWHV GPs. Similar results were also discerned with OWHV and NWHV GP localization in the absence of cytoskeletal factors that regulate HV trafficking in cells.
The final question focused on quantifying the NP-GP interactions and understanding their influence of NP and GP multimerization. Gc mutlimers were detected in the presence of NP and complimented by the presence of localized regions of high NP-Gc interactions in the perinuclear region of living cells. Gc-CT domain was shown to influence NP-Gc associations. Gn, on the other hand, formed up to tetrameric complexes, independent from the presence of NP.
The results in this dissertation sheds light on the initial steps of HV virion formation by quantifying homo and heterotypic interactions involving NP and GPs, which otherwise are very difficult to perform. Finally, the in cellulo methodologies implemented in this work can be potentially extended to understand other key interactions involved in HV virus assembly.
Pichia pastoris (syn. Komagataella phaffi) is a distinguished expression system widely used in industrial production processes. Recent molecular research has focused on numerous approaches to increase recombinant protein yield in P. pastoris. For example, the design of expression vectors and synthetic genetic elements, gene copy number optimization, or co-expression of helper proteins
(transcription factors, chaperones, etc.). However, high clonal variability of transformants and low screening throughput have hampered significant success.
To enhance screening capacities, display-based methodologies inherit the potential for efficient isolation of producer clones via fluorescence-activated cell sorting (FACS). Therefore, this study focused on developing a novel clone selection method that is based on the non-covalent attachment of Fab fragments on the P. pastoris cell surface to be applicable for FACS.
Initially, a P. pastoris display system was developed, which is a prerequisite for the surface capture of secreted Fabs. A Design of Experiments approach was applied to analyze the influence of various genetic elements on antibody fragment display. The combined P. pastoris formaldehyde dehydrogenase promoter (PFLD1), Saccharomyces cerevisiae invertase 2 signal peptide (ScSUC2), - agglutinin (ScSAG1) anchor protein, and the ARS of Kluyveromyces lactis (panARS) conferred highest display levels.
Subsequently, eight single-chain variable fragments (scFv) specific for the constant part of the Fab heavy or light chain were individually displayed in P. pastoris. Among the tested scFvs, the anti-human CH1 IgG domain scFv allowed the most efficient Fab capture detected by flow cytometry.
Irrespective of the Fab sequence, exogenously added as well as simultaneously secreted Fabs were successfully captured on the cell surface. Furthermore, Fab secretion capacities were shown to correlate to the level of surface-bound Fabs as demonstrated for characterized producer clones.
Flow-sorted clones presenting high amounts of Fabs showed an increase in median Fab titers (factor of 21 to 49) compared to unsorted clones when screened in deep-well plates. For selected candidates, improved functional Fab yields of sorted cells vs. unsorted cells were confirmed in an upscaled shake flask production. Since the scFv capture matrix was encoded on an episomal plasmid with inherently unstable autonomously replicating sequences (ARS), efficient plasmid curing was observed after removing the selective pressure. Hence, sorted clones could be immediately used for production without the need to modify the expression host or vector. The resulting switchable display/secretion system provides a streamlined approach for the isolation of Fab producers and subsequent Fab production.
Development of electrochemical antibody-based and enzymatic assays for mycotoxin analysis in food
(2023)
Electrochemical methods are promising to meet the demand for easy-to-use devices monitoring key parameters in the food industry. Many companies run own lab procedures for mycotoxin analysis, but it is a major goal to simplify the analysis. The enzyme-linked immunosorbent assay using horseradish peroxidase as enzymatic label, together with 3,3',5,5' tetramethylbenzidine (TMB)/H2O2 as substrates allows sensitive mycotoxin detection with optical detection methods. For the miniaturization of the detection step, an electrochemical system for mycotoxin analysis was developed. To this end, the electrochemical detection of TMB was studied by cyclic voltammetry on different screen-printed electrodes (carbon and gold) and at different pH values (pH 1 and pH 4). A stable electrode reaction, which is the basis for the further construction of the electrochemical detection system, could be achieved at pH 1 on gold electrodes. An amperometric detection method for oxidized TMB, using a custom-made flow cell for screen-printed electrodes, was established and applied for a competitive magnetic bead-based immunoassay for the mycotoxin ochratoxin A. A limit of detection of 150 pM (60 ng/L) could be obtained and the results were verified with optical detection. The applicability of the magnetic bead-based immunoassay was tested in spiked beer using a handheld potentiostat connected via Bluetooth to a smartphone for amperometric detection allowing to quantify ochratoxin A down to 1.2 nM (0.5 µg/L).
Based on the developed electrochemical detection system for TMB, the applicability of the approach was demonstrated with a magnetic bead-based immunoassay for the ergot alkaloid, ergometrine. Under optimized assay conditions a limit of detection of 3 nM (1 µg/L) was achieved and in spiked rye flour samples ergometrine levels in a range from 25 to 250 µg/kg could be quantified. All results were verified with optical detection. The developed electrochemical detection method for TMB gives great promise for the detection of TMB in many other HRP-based assays.
A new sensing approach, based on an enzymatic electrochemical detection system for the mycotoxin fumonisin B1 was established using an Aspergillus niger fumonisin amine oxidase (AnFAO). AnFAO was produced recombinantly in E. coli as maltose-binding protein fusion protein and catalyzes the oxidative deamination of fumonisins, producing hydrogen peroxide. It was found that AnFAO has a high storage and temperature stability. The enzyme was coupled covalently to magnetic particles, and the enzymatically produced H2O2 in the reaction with fumonisin B1 was detected amperometrically in a flow injection system using Prussian blue/carbon electrodes and the custom-made wall-jet flow cell. Fumonisin B1 could be quantified down to 1.5 µM (≈ 1 mg/L). The developed system represents a new approach to detect mycotoxins using enzymes and electrochemical methods.
The genetic structure of Bryde's whale (Balaenoptera brydei) on the central and western North Pacific feeding grounds was investigated using a total of 1195 mitochondrial control region sequences and 1182 microsatellite genotypes at 17 loci in specimens collected from three longitudinal areas, 1W (135 degrees E-165 degrees E), 1E (165 degrees E-180 degrees), and 2 (180 degrees-155 degrees W). Genetic diversities were similar among areas and a haplotype network did not show any geographic structure, while an analysis of molecular variance found evidence of genetic structure in this species. Pairwise FST and G'ST estimates and heterogeneity tests attributed this structure to weak but significant differentiation between areas 1W/1E and 2. A Mantel test and a high-resolution analysis of genetic diversity statistics showed a weak spatial cline of genetic differentiation. These findings could be reconciled by two possible stock structure scenarios: (1) a single population with kin-association affecting feeding ground preference and (2) two populations with feeding ground preference for either area 1W or area 2. An estimated dispersal rate between areas 1W and 2 indicates that both scenarios should be considered as a precautionary principle in stock assessments.
Movement is a mechanism that shapes biodiversity patterns across spatialtemporal scales. Thereby, the movement process affects species interactions, population dynamics and community composition. In this thesis, I disentangled the effects of movement on the biodiversity of zooplankton ranging from the individual to the community level. On the individual movement level, I used video-based analysis to explore the implication of movement behavior on preypredator interactions. My results showed that swimming behavior was of great importance as it determined their survival in the face of predation. The findings also additionally highlighted the relevance of the defense status/morphology of prey, as it not only affected the prey-predator relationship by the defense itself but also by plastic movement behavior. On the community movement level, I used a field mesocosm experiment to explore the role of dispersal (time i.e., from the egg bank into the water body and space i.e., between water bodies) in shaping zooplankton metacommunities. My results revealed that priority effects and taxon-specific dispersal limitation influenced community composition. Additionally, different modes of dispersal also generated distinct community structures. The egg bank and biotic vectors (i.e. mobile links) played significant roles in the colonization of newly available habitat patches. One crucial aspect that influences zooplankton species after arrival in new habitats is the local environmental conditions. By using common garden experiments, I assessed the performance of zooplankton communities in their home vs away environments in a group of ponds embedded within an agricultural landscape. I identified environmental filtering as a driving factor as zooplankton communities from individual ponds developed differently in their home and away environments. On the individual species level, there was no consistent indication of local adaptation. For some species, I found a higher abundance/fitness in their home environment, but for others, the opposite was the case, and some cases were indifferent.
Overall, the thesis highlights the links between movement and biodiversity patterns, ranging from the individual active movement to the community level.
Life on Earth is diverse and ranges from unicellular organisms to multicellular creatures like humans. Although there are theories about how these organisms might have evolved, we understand little about how ‘life’ started from molecules. Bottom-up synthetic biology aims to create minimal cells by combining different modules, such as compartmentalization, growth, division, and cellular communication.
All living cells have a membrane that separates them from the surrounding aqueous medium and helps to protect them. In addition, all eukaryotic cells have organelles that are enclosed by intracellular membranes. Each cellular membrane is primarily made of a lipid bilayer with membrane proteins. Lipids are amphiphilic molecules that assemble into molecular bilayers consisting of two leaflets. The hydrophobic chains of the lipids in the two leaflets face each other, and their hydrophilic headgroups face the aqueous surroundings. Giant unilamellar vesicles (GUVs) are model membrane systems that form large compartments with a size of many micrometers and enclosed by a single lipid bilayer. The size of GUVs is comparable to the size of cells, making them good membrane models which can be studied using an optical microscope. However, after the initial preparation, GUV membranes lack membrane proteins which have to be reconstituted into these membranes by subsequent preparation steps. Depending on the protein, it can be either attached via anchor lipids to one of the membrane leaflets or inserted into the lipid bilayer via its transmembrane domains.
The first step is to prepare the GUVs and then expose them to an exterior solution with proteins. Various protocols have been developed for the initial preparation of GUVs. For the second step, the GUVs can be exposed to a bulk solution of protein or can be trapped in a microfluidic device and then supplied with the protein solution. To minimize the amount of solution and for more precise measurements, I have designed a microfluidic device that has a main channel, and several dead-end side channels that are perpendicular to the main channel. The GUVs are trapped in the dead-end channels. This design exchanges the solution around the GUVs via diffusion from the main channel, thus shielding the GUVs from the flow within the main channel. This device has a small volume of just 2.5 μL, can be used without a pump and can be combined with a confocal microscope, enabling uninterrupted imaging of the GUVs during the experiments. I used this device for most of the experiments on GUVs that are discussed in this thesis.
In the first project of the thesis, a lipid mixture doped with an anchor lipid was used that can bind to a histidine chain (referred to as His-tag(ged) or 6H) via the metal cation Ni2+. This method is widely used for the biofunctionalization of GUVs by attaching proteins without a transmembrane domain. Fluorescently labeled His-tags which are bound to a membrane can be observed in a confocal microscope. Using the same lipid mixture, I prepared the GUVs with different protocols and investigated the membrane composition of the resulting GUVs by evaluating the amount of fluorescently labeled His-tagged molecules bound to their membranes. I used the microfluidic device described above to expose the outer leaflet of the vesicle to a constant concentration of the His-tagged molecules. Two fluorescent molecules with a His-tag were studied and compared: green fluorescent protein (6H-GFP) and fluorescein isothiocyanate (6H-FITC). Although the quantum yield in solution is similar for both molecules, the brightness of the membrane-bound 6H-GFP is higher than the brightness of the membrane-bound 6H-FITC. The observed difference in the brightness reveals that the fluorescence of the 6H-FITC is quenched by the anchor lipid via the Ni2+ ion. Furthermore, my measurements also showed that the fluorescence intensity of the membranebound His-tagged molecules depends on microenvironmental factors such as pH. For both 6H-GFP and 6H-FITC, the interaction with the membrane is quantified by evaluating the equilibrium dissociation constant. The membrane fluorescence is measured as a function of the fluorophores’ molar concentration. Theoretical analysis of these data leads to the equilibrium dissociation constants of (37.5 ± 7.5) nM for 6H-GFP and (18.5 ± 3.7) nM for 6H-FITC.
The anchor lipid mentioned previously used the metal cation Ni2+ to mediate the bond between the anchor lipid and the His-tag. The Ni2+ ion can be replaced by other transition metal ions. Studies have shown that Co3+ forms the strongest bonds with the His-tags attached to proteins. In these studies, strong oxidizing agents were used to oxidize the Co2+ mediated complex with the His-tagged protein to a Co3+ mediated complex. This procedure puts the proteins at risk of being oxidized as well. In this thesis, the vesicles were first prepared with anchor lipids without any metal cation. The Co3+ was added to these anchor lipids and finally the His-tagged protein was added to the GUVs to form the Co3+ mediated bond. This system was also established using the microfluidic device.
The different preparation procedures of GUVs usually lead to vesicles with a spherical morphology. On the other hand, many cell organelles have a more complex architecture with a non spherical topology. One fascinating example is provided by the endoplasmic reticulum (ER) which is made of a continuous membrane and extends throughout the cell in the form of tubes and sheets. The tubes are connected by three-way junctions and form a tubular network of irregular polygons. The formation and maintenance of these reticular networks requires membrane proteins that hydrolyize guanosine triphosphate (GTP). One of these membrane proteins is atlastin. In this thesis, I reconstituted the atlastin protein in GUV membranes using detergent-assisted reconstitution protocols to insert the proteins directly into lipid bilayers.
This thesis focuses on protein reconstitution by binding His-tagged proteins to anchor lipids and by detergent-assisted insertion of proteins with transmembrane domains. It also provides the design of a microfluidic device that can be used in various experiments, one example is the evaluation of the equilibrium dissociation constant for membrane-protein interactions. The results of this thesis will help other researchers to understand the protocols for preparing GUVs, to reconstitute proteins in GUVs, and to perform experiments using the microfluidic device. This knowledge should be beneficial for the long-term goal of combining the different modules of synthetic biology to make a minimal cell.
Gene expression data is analyzed to identify biomarkers, e.g. relevant genes, which serve for diagnostic, predictive, or prognostic use. Traditional approaches for biomarker detection select distinctive features from the data based exclusively on the signals therein, facing multiple shortcomings in regards to overfitting, biomarker robustness, and actual biological relevance. Prior knowledge approaches are expected to address these issues by incorporating prior biological knowledge, e.g. on gene-disease associations, into the actual analysis. However, prior knowledge approaches are currently not widely applied in practice because they are often use-case specific and seldom applicable in a different scope. This leads to a lack of comparability of prior knowledge approaches, which in turn makes it currently impossible to assess their effectiveness in a broader context.
Our work addresses the aforementioned issues with three contributions. Our first contribution provides formal definitions for both prior knowledge and the flexible integration thereof into the feature selection process. Central to these concepts is the automatic retrieval of prior knowledge from online knowledge bases, which allows for streamlining the retrieval process and agreeing on a uniform definition for prior knowledge. We subsequently describe novel and generalized prior knowledge approaches that are flexible regarding the used prior knowledge and applicable to varying use case domains. Our second contribution is the benchmarking platform Comprior. Comprior applies the aforementioned concepts in practice and allows for flexibly setting up comprehensive benchmarking studies for examining the performance of existing and novel prior knowledge approaches. It streamlines the retrieval of prior knowledge and allows for combining it with prior knowledge approaches. Comprior demonstrates the practical applicability of our concepts and further fosters the overall development and comparability of prior knowledge approaches. Our third contribution is a comprehensive case study on the effectiveness of prior knowledge approaches. For that, we used Comprior and tested a broad range of both traditional and prior knowledge approaches in combination with multiple knowledge bases on data sets from multiple disease domains. Ultimately, our case study constitutes a thorough assessment of a) the suitability of selected knowledge bases for integration, b) the impact of prior knowledge being applied at different integration levels, and c) the improvements in terms of classification performance, biological relevance, and overall robustness.
In summary, our contributions demonstrate that generalized concepts for prior knowledge and a streamlined retrieval process improve the applicability of prior knowledge approaches. Results from our case study show that the integration of prior knowledge positively affects biomarker results, particularly regarding their robustness. Our findings provide the first in-depth insights on the effectiveness of prior knowledge approaches and build a valuable foundation for future research.