Refine
Has Fulltext
- yes (44) (remove)
Year of publication
- 2018 (44) (remove)
Document Type
- Postprint (30)
- Doctoral Thesis (14)
Is part of the Bibliography
- yes (44) (remove)
Keywords
- protein (3)
- Biodiversität (2)
- GPS (2)
- adaptation (2)
- biodiversity (2)
- dynamics (2)
- expression (2)
- inheritance (2)
- microplastics (2)
- synthetic biology (2)
Institute
- Institut für Biochemie und Biologie (44) (remove)
Objective
We investigated the potential role of indirect benefits for female mate preferences in a highly promiscuous species of live-bearing fishes, the sailfin molly Poecilia latipinna using an integrative approach that combines methods from animal behavior, life-history evolution, and genetics. Males of this species solely contribute sperm for reproduction, and consequently females do not receive any direct benefits. Despite this, females typically show clear mate preferences. It has been suggested that females can increase their reproductive success through indirect benefits from choosing males of higher quality.
Results
Although preferences for large body size have been recorded as an honest signal for genetic quality, this particular study resulted in female preference being unaffected by male body size. Nonetheless, larger males did sire more offspring, but with no effect on offspring quality. This study presents a methodical innovation by combining preference testing with life history measurements—such as the determination of the dry weight of fish embryos—and paternity analyses on single fish embryos.
Abstract
Background
The unisexual Amazon molly (Poecilia formosa) originated from a hybridization between two sexual species, the sailfin molly (Poecilia latipinna) and the Atlantic molly (Poecilia mexicana). The Amazon molly reproduces clonally via sperm-dependent parthenogenesis (gynogenesis), in which the sperm of closely related species triggers embryogenesis of the apomictic oocytes, but typically does not contribute genetic material to the next generation. We compare for the first time the gonadal transcriptome of the Amazon molly to those of both ancestral species, P. mexicana and P. latipinna.
Results
We sequenced the gonadal transcriptomes of the P. formosa and its parental species P. mexicana and P. latipinna using Illumina RNA-sequencing techniques (paired-end, 100 bp). De novo assembly of about 50 million raw read pairs for each species was performed using Trinity, yielding 106,922 transcripts for P. formosa, 115,175 for P. latipinna, and 133,025 for P. mexicana after eliminating contaminations. On the basis of sequence similarity comparisons to other teleost species and the UniProt databases, functional annotation, and differential expression analysis, we demonstrate the similarity of the transcriptomes among the three species. More than 40% of the transcripts for each species were functionally annotated and about 70% were assigned to orthologous genes of a closely related species. Differential expression analysis between the sexual and unisexual species uncovered 2035 up-regulated and 564 down-regulated genes in P. formosa. This was exemplary validated for six genes by qRT-PCR.
Conclusions
We identified more than 130 genes related to meiosis and reproduction within the apomictically reproducing P. formosa. Overall expression of these genes seems to be down-regulated in the P. formosa transcriptome compared to both ancestral species (i.e., 106 genes down-regulated, 29 up-regulated). A further 35 meiosis and reproduction related genes were not found in the P. formosa transcriptome, but were only expressed in the sexual species. Our data support the hypothesis of general down-regulation of meiosis-related genes in the apomictic Amazon molly. Furthermore, the obtained dataset and identified gene catalog will serve as a resource for future research on the molecular mechanisms behind the reproductive mode of this unisexual species.
Communication is key to a wide variety of animal behaviours and multiple modalities are often involved in this exchange of information from sender to receiver. The communication of African weakly electric fish, however, is thought to be predominantly unimodal and is mediated by their electric sense, in which species-specific electric organ discharges (EODs) are generated in a context-dependent and thus variable sequence of pulse intervals (SPI). While the primary function of the electric sense is considered to be electrolocation, both of its components likely carry information regarding identity of the sender. However, a clear understanding of their contribution to species recognition is incomplete. We therefore analysed these two electrocommunication components (EOD waveform and SPI statistics) in two sympatric mormyrid Campylomormyrus species. In a set of five playback conditions, we further investigated which components may drive interspecific recognition and discrimination. While we found that both electrocommunication components are species-specific, the cues necessary for species recognition differ between the two species studied. While the EOD waveform and SPI were both necessary and sufficient for species recognition in C. compressirostris males, C. tamandua males apparently utilize other, non-electric modalities. Mapped onto a recent phylogeny, our results suggest that discrimination by electric cues alone may be an apomorphic trait evolved during a recent radiation in this taxon.
Coronary artery disease is the most common cause of death globally and is linked to a number of risk factors including serum low density lipoprotein, high density lipoprotein, triglycerides and lipoprotein(a). Recently two proteins, angiopoietin-like protein 3 and 4, have emerged from genetic studies as being factors that significantly modulate plasma triglyceride levels and coronary artery disease. The exact function and mechanism of action of both proteins remains to be elucidated, however, mutations in these proteins results in up to 34% reduction in coronary artery disease and inhibition of function results in reduced plasma triglyceride levels. Here we report the crystal structures of the fibrinogen-like domains of both proteins. These structures offer new insights into the reported loss of function mutations, the mechanisms of action of the proteins and open up the possibility for the rational design of low molecular weight inhibitors for intervention in coronary artery disease.
WRKY23 is a component of the transcriptional network mediating auxin feedback on PIN polarity
(2018)
Auxin is unique among plant hormones due to its directional transport that is mediated by the polarly distributed PIN auxin transporters at the plasma membrane. The canalization hypothesis proposes that the auxin feedback on its polar flow is a crucial, plant-specific mechanism mediating multiple self-organizing developmental processes. Here, we used the auxin effect on the PIN polar localization in Arabidopsis thaliana roots as a proxy for the auxin feedback on the PIN polarity during canalization. We performed microarray experiments to find regulators of this process that act downstream of auxin. We identified genes that were transcriptionally regulated by auxin in an AXR3/IAA17-and ARF7/ARF19-dependent manner. Besides the known components of the PIN polarity, such as PID and PIP5K kinases, a number of potential new regulators were detected, among which the WRKY23 transcription factor, which was characterized in more detail. Gain-and loss-of-function mutants confirmed a role for WRKY23 in mediating the auxin effect on the PIN polarity. Accordingly, processes requiring auxin-mediated PIN polarity rearrangements, such as vascular tissue development during leaf venation, showed a higher WRKY23 expression and required the WRKY23 activity. Our results provide initial insights into the auxin transcriptional network acting upstream of PIN polarization and, potentially, canalization-mediated plant development.
Spotlight on islands
(2018)
Groups of proximate continental islands may conceal more tangled phylogeographic patterns than oceanic archipelagos as a consequence of repeated sea level changes, which allow populations to experience gene flow during periods of low sea level stands and isolation by vicariant mechanisms during periods of high sea level stands. Here, we describe for the first time an ancient and diverging lineage of the Italian wall lizard Podarcis siculus from the western Pontine Islands. We used nuclear and mitochondrial DNA sequences of 156 individuals with the aim of unraveling their phylogenetic position, while microsatellite loci were used to test several a priori insular biogeographic models of migration with empirical data. Our results suggest that the western Pontine populations colonized the islands early during their Pliocene volcanic formation, while populations from the eastern Pontine Islands seem to have been introduced recently. The inter-island genetic makeup indicates an important role of historical migration, probably due to glacial land bridges connecting islands followed by a recent vicariant mechanism of isolation. Moreover, the most supported migration model predicted higher gene flow among islands which are geographically arranged in parallel. Considering the threatened status of small insular endemic populations, we suggest this new evolutionarily independent unit be given priority in conservation efforts.
There are 63 known species of Thecaphora (Glomosporiaceae, Ustilaginomycotina), a third of which occur on Asteraceae. These smut fungi produce yellowish-brown to reddish-brown masses of spore balls in specific, mostly regenerative, plant organs. A species of Thecaphora was collected in the flower heads of Anthemis chia (Anthemideae, Asteraceae) on Rhodes Island, Greece, in 2015 and 2017, which represents the first smut record of a smut fungus on a host plant species in this tribe. Based on its distinctive morphology, host species and genetic divergence, this species is described as Thecaphora anthemidis sp. nov. Molecular barcodes of the ITS region are provided for this and several other species of Thecaphora. A phylogenetic and morphological comparison to closely related species showed that Th. anthemidis differed from other species of Thecaphora. Thecaphora anthemidis produced loose spore balls in the flower heads and peduncles of Anthemis chia unlike other flower-infecting species.
Taxonomy plays a central role in biological sciences. It provides a communication system for scientists as it aims to enable correct identification of the studied organisms. As a consequence, species descriptions should seek to include as much available information as possible at species level to follow an integrative concept of 'taxonomics'. Here, we describe the cryptic species Epimeria frankei sp. nov. from the North Sea, and also redescribe its sister species, Epimeria cornigera. The morphological information obtained is substantiated by DNA barcodes and complete nuclear 18S rRNA gene sequences. In addition, we provide, for the first time, full mitochondrial genome data as part of a metazoan species description for a holotype, as well as the neotype. This study represents the first successful implementation of the recently proposed concept of taxonomics, using data from high-throughput technologies for integrative taxonomic studies, allowing the highest level of confidence for both biodiversity and ecological research.
Epigenetic modifications, of which DNA methylation is the most stable, are a mechanism conveying environmental information to subsequent generations via parental germ lines. The paternal contribution to adaptive processes in the offspring might be crucial, but has been widely neglected in comparison to the maternal one. To address the paternal impact on the offspring's adaptability to changes in diet composition, we investigated if low protein diet (LPD) in F0 males caused epigenetic alterations in their subsequently sired sons. We therefore fed F0 male Wild guinea pigs with a diet lowered in protein content (LPD) and investigated DNA methylation in sons sired before and after their father's LPD treatment in both, liver and testis tissues. Our results point to a 'heritable epigenetic response' of the sons to the fathers' dietary change. Because we detected methylation changes also in the testis tissue, they are likely to be transmitted to the F2 generation. Gene-network analyses of differentially methylated genes in liver identified main metabolic pathways indicating a metabolic reprogramming ('metabolic shift'). Epigenetic mechanisms, allowing an immediate and inherited adaptation may thus be important for the survival of species in the context of a persistently changing environment, such as climate change.
Cells and organelles are not homogeneous but include microcompartments that alter the spatiotemporal characteristics of cellular processes. The effects of microcompartmentation on metabolic pathways are however difficult to study experimentally. The pyrenoid is a microcompartment that is essential for a carbon concentrating mechanism (CCM) that improves the photosynthetic performance of eukaryotic algae. Using Chlamydomonas reinhardtii, we obtained experimental data on photosynthesis, metabolites, and proteins in CCM-induced and CCM-suppressed cells. We then employed a computational strategy to estimate how fluxes through the Calvin-Benson cycle are compartmented between the pyrenoid and the stroma. Our model predicts that ribulose-1,5-bisphosphate (RuBP), the substrate of Rubisco, and 3-phosphoglycerate (3PGA), its product, diffuse in and out of the pyrenoid, respectively, with higher fluxes in CCM-induced cells. It also indicates that there is no major diffusional barrier to metabolic flux between the pyrenoid and stroma. Our computational approach represents a stepping stone to understanding microcompartmentalized CCM in other organisms.
DNA nanostructures enable the attachment of functional molecules to nearly any unique location on their underlying structure. Due to their single-base-pair structural resolution, several ligands can be spatially arranged and closely controlled according to the geometry of their desired target, resulting in optimized binding and/or signaling interactions. Here, the efficacy of SWL, an ephrin-mimicking peptide that binds specifically to EphrinA2 (EphA2) receptors, increased by presenting up to three of these peptides on small DNA nanostructures in an oligovalent manner. Ephrin signaling pathways play crucial roles in tumor development and progression. Moreover, Eph receptors are potential targets in cancer diagnosis and treatment. Here, the quantitative impact of SWL valency on binding, phosphorylation (key player for activation) and phenotype regulation in EphA2-expressing prostate cancer cells was demonstrated. EphA2 phosphorylation was significantly increased by DNA trimers carrying three SWL peptides compared to monovalent SWL. In comparison to one of EphA2’s natural ligands ephrin-A1, which is known to bind promiscuously to multiple receptors, pinpointed targeting of EphA2 by oligovalent DNA-SWL constructs showed enhanced cell retraction. Overall, we show that DNA scaffolds can increase the potency of weak signaling peptides through oligovalent presentation and serve as potential tools for examination of complex signaling pathways.
Plant X-tender
(2018)
Cloning multiple DNA fragments for delivery of several genes of interest into the plant genome is one of the main technological challenges in plant synthetic biology. Despite several modular assembly methods developed in recent years, the plant biotechnology community has not widely adopted them yet, probably due to the lack of appropriate vectors and software tools. Here we present Plant X-tender, an extension of the highly efficient, scarfree and sequence-independent multigene assembly strategy AssemblX,based on overlapdepended cloning methods and rare-cutting restriction enzymes. Plant X-tender consists of a set of plant expression vectors and the protocols for most efficient cloning into the novel vector set needed for plant expression and thus introduces advantages of AssemblX into plant synthetic biology. The novel vector set covers different backbones and selection markers to allow full design flexibility. We have included ccdB counterselection, thereby allowing the transfer of multigene constructs into the novel vector set in a straightforward and highly efficient way. Vectors are available as empty backbones and are fully flexible regarding the orientation of expression cassettes and addition of linkers between them, if required. We optimised the assembly and subcloning protocol by testing different scar-less assembly approaches: the noncommercial SLiCE and TAR methods and the commercial Gibson assembly and NEBuilder HiFi DNA assembly kits. Plant X-tender was applicable even in combination with low efficient homemade chemically competent or electrocompetent Escherichia coli. We have further validated the developed procedure for plant protein expression by cloning two cassettes into the newly developed vectors and subsequently transferred them to Nicotiana benthamiana in a transient expression setup. Thereby we show that multigene constructs can be delivered into plant cells in a streamlined and highly efficient way. Our results will support faster introduction of synthetic biology into plant science.
Global change threatens the maintenance of ecosystem functions that are shaped by the persistence and dynamics of populations. It has been shown that the persistence of species increases if they possess larger trait adaptability. Here, we investigate whether trait adaptability also affects the robustness of population dynamics of interacting species and thereby shapes the reliability of ecosystem functions that are driven by these dynamics. We model co‐adaptation in a predator–prey system as changes to predator offense and prey defense due to evolution or phenotypic plasticity. We investigate how trait adaptation affects the robustness of population dynamics against press perturbations to environmental parameters and against pulse perturbations targeting species abundances and their trait values. Robustness of population dynamics is characterized by resilience, elasticity, and resistance. In addition to employing established measures for resilience and elasticity against pulse perturbations (extinction probability and return time), we propose the warping distance as a new measure for resistance against press perturbations, which compares the shapes and amplitudes of pre‐ and post‐perturbation population dynamics. As expected, we find that the robustness of population dynamics depends on the speed of adaptation, but in nontrivial ways. Elasticity increases with speed of adaptation as the system returns more rapidly to the pre‐perturbation state. Resilience, in turn, is enhanced by intermediate speeds of adaptation, as here trait adaptation dampens biomass oscillations. The resistance of population dynamics strongly depends on the target of the press perturbation, preventing a simple relationship with the adaptation speed. In general, we find that low robustness often coincides with high amplitudes of population dynamics. Hence, amplitudes may indicate the robustness against perturbations also in other natural systems with similar dynamics. Our findings show that besides counteracting extinctions, trait adaptation indeed strongly affects the robustness of population dynamics against press and pulse perturbations.
The centrosome is not only the largest and most sophisticated protein complex within a eukaryotic cell, in the light of evolution, it is also one of its most ancient organelles. This special issue of "Cells" features representatives of three main, structurally divergent centrosome types, i.e., centriole-containing centrosomes, yeast spindle pole bodies (SPBs), and amoebozoan nucleus-associated bodies (NABs). Here, I discuss their evolution and their key-functions in microtubule organization, mitosis, and cytokinesis. Furthermore, I provide a brief history of centrosome research and highlight recently emerged topics, such as the role of centrioles in ciliogenesis, the relationship of centrosomes and centriolar satellites, the integration of centrosomal structures into the nuclear envelope and the involvement of centrosomal components in non-centrosomal microtubule organization.
Polar nuclear migration is crucial during the development of diverse eukaryotes. In plants, root hair growth requires polar nuclear migration into the outgrowing hair. However, knowledge about the dynamics and the regulatory mechanisms underlying nuclear movements in root epidermal cells remains limited. Here, we show that both auxin and Rho-of-Plant (ROP) signaling modulate polar nuclear position at the inner epidermal plasma membrane domain oriented to the cortical cells during cell elongation as well as subsequent polar nuclear movement to the outer domain into the emerging hair bulge in Arabidopsis (Arabidopsis thaliana). Auxin signaling via the nuclear AUXIN RESPONSE FACTOR7 (ARF7)/ARF19 and INDOLE ACETIC ACID7 pathway ensures correct nuclear placement toward the inner membrane domain. Moreover, precise inner nuclear placement relies on SPIKE1 Rho-GEF, SUPERCENTIPEDE1 Rho-GDI, and ACTIN7 (ACT7) function and to a lesser extent on VTI11 vacuolar SNARE activity. Strikingly, the directionality and/or velocity of outer polar nuclear migration into the hair outgrowth along actin strands also are ACT7 dependent, auxin sensitive, and regulated by ROP signaling. Thus, our findings provide a founding framework revealing auxin and ROP signaling of inner polar nuclear position with some contribution by vacuolar morphology and of actin-dependent outer polar nuclear migration in root epidermal hair cells.
Specialized glial subtypes provide support to developing and functioning neural networks. Astrocytes modulate information processing by neurotransmitter recycling and release of neuromodulatory substances, whereas ensheathing glial cells have not been associated with neuromodulatory functions yet. To decipher a possible role of ensheathing glia in neuronal information processing, we screened for glial genes required in the Drosophila central nervous system for normal locomotor behavior. Shopper encodes a mitochondrial sulfite oxidase that is specifically required in ensheathing glia to regulate head bending and peristalsis. shopper mutants show elevated sulfite levels affecting the glutamate homeostasis which then act on neuronal network function. Interestingly, human patients lacking the Shopper homolog SUOX develop neurological symptoms, including seizures. Given an enhanced expression of SUOX by oligodendrocytes, our findings might indicate that in both invertebrates and vertebrates more than one glial cell type may be involved in modulating neuronal activity.
The sequencing of the human genome in the early 2000s led to an increased interest in cheap and fast sequencing technologies. This interest culminated in the advent of next generation sequencing (NGS). A number of different NGS platforms have arisen since then all promising to do the same thing, i.e. produce large amounts of genetic information for relatively low costs compared to more traditional methods such as Sanger sequencing. The capabilities of NGS meant that researchers were no longer bound to species for which a lot of previous work had already been done (e.g. model organisms and humans) enabling a shift in research towards more novel and diverse species of interest. This capability has greatly benefitted many fields within the biological sciences, one of which being the field of evolutionary biology. Researchers have begun to move away from the study of laboratory model organisms to wild, natural populations and species which has greatly expanded our knowledge of evolution. NGS boasts a number of benefits over more traditional sequencing approaches. The main benefit comes from the capability to generate information for drastically more loci for a fraction of the cost. This is hugely beneficial to the study of wild animals as, even when large numbers of individuals are unobtainable, the amount of data produced still allows for accurate, reliable population and species level results from a small selection of individuals.
The use of NGS to study species for which little to no previous research has been carried out on and the production of novel evolutionary information and reference datasets for the greater scientific community were the focuses of this thesis. Two studies in this thesis focused on producing novel mitochondrial genomes from shotgun sequencing data through iterative mapping, bypassing the need for a close relative to serve as a reference sequence. These mitochondrial genomes were then used to infer species level relationships through phylogenetic analyses. The first of these studies involved reconstructing a complete mitochondrial genome of the bat eared fox (Otocyon megalotis). Phylogenetic analyses of the mitochondrial genome confidently placed the bat eared fox as sister to the clade consisting of the raccoon dog and true foxes within the canidae family. The next study also involved reconstructing a mitochondrial genome but in this case from the extinct Macrauchenia of South America. As this study utilised ancient DNA, it involved a lot of parameter testing, quality controls and strict thresholds to obtain a near complete mitochondrial genome devoid of contamination known to plague ancient DNA studies. Phylogenetic analyses confidently placed Macrauchenia as sister to all living representatives of Perissodactyla with a divergence time of ~66 million years ago. The third and final study of this thesis involved de novo assemblies of both nuclear and mitochondrial genomes from brown and striped hyena and focussed on demographic, genetic diversity and population genomic analyses within the brown hyena. Previous studies of the brown hyena hinted at very low levels of genomic diversity and, perhaps due to this, were unable to find any notable population structure across its range. By incorporating a large number of genetic loci, in the form of complete nuclear genomes, population structure within the brown hyena was uncovered. On top of this, genomic diversity levels were compared to a number of other species. Results showed the brown hyena to have the lowest genomic diversity out of all species included in the study which was perhaps caused by a continuous and ongoing decline in effective population size that started about one million years ago and dramatically accelerated towards the end of the Pleistocene.
The studies within this thesis show the power NGS sequencing has and its utility within evolutionary biology. The most notable capabilities outlined in this thesis involve the study of species for which no reference data is available and in the production of large amounts of data, providing evolutionary answers at the species and population level that data produced using more traditional techniques simply could not.
East Africa is a natural laboratory: Studying its unique geological and biological history can help us better inform our theories and models. Studying its present and future can help us protect its globally important biodiversity and ecosystem services. East African vegetation plays a central role in all these aspects, and this dissertation aims to quantify its dynamics through computer simulations.
Computer models help us recreate past settings, forecast into the future or conduct simulation experiments that we cannot otherwise perform in the field. But before all that, one needs to test their performance. The outputs that the model produced using the present day-inputs, agreed well with present-day observations of East African vegetation. Next, I simulated past vegetation for which we have fossil pollen data to compare. With computer models, we can fill the gaps of knowledge between sites where we have fossil pollen data from, and create a more complete picture of the past. Good level of agreement between model and pollen data where they overlapped in space further validated our model performance.
Once the model was tested and validated for the region, it became possible to probe one of the long standing questions regarding East African vegetation: How did East Africa lose its tropical forests? The present-day vegetation in the tropics is mainly characterized by continuous forests worldwide except in tropical East Africa, where forests only occur as patches. In a series of simulation experiments, I was able to show under which conditions these forest patches could have been connected and fragmented in the past. This study showed the sensitivity of East African vegetation to climate change and variability such as those expected under future climate change.
El Niño Southern Oscillation (ENSO) events that result from the fluctuations in temperature between the ocean and atmosphere, bring further variability to East African climate and are predicted to increase in intensity in the future. But climate models are still not good at capturing the pattens of these events. In a study where I quantified the influence of ENSO events on East African vegetation, I showed how different the future vegetation could be from what we currently predict with these climate models that lack accurate ENSO contribution. Consideration of these discrepancies is important for our future global carbon budget calculations and management decisions.
Systems biology aims at investigating biological systems in its entirety by gathering and analyzing large-scale data sets about the underlying components. Computational systems biology approaches use these large-scale data sets to create models at different scales and cellular levels. In addition, it is concerned with generating and testing hypotheses about biological processes. However, such approaches are inevitably leading to computational challenges due to the high dimensionality of the data and the differences in the dimension of data from different cellular layers.
This thesis focuses on the investigation and development of computational approaches to analyze metabolite profiles in the context of cellular networks. This leads to determining what aspects of the network functionality are reflected in the metabolite levels. With these methods at hand, this thesis aims to answer three questions: (1) how observability of biological systems is manifested in metabolite profiles and if it can be used for phenotypical comparisons; (2) how to identify couplings of reaction rates from metabolic profiles alone; and (3) which regulatory mechanism that affect metabolite levels can be distinguished by integrating transcriptomics and metabolomics read-outs.
I showed that sensor metabolites, identified by an approach from observability theory, are more correlated to each other than non-sensors. The greater correlations between sensor metabolites were detected both with publicly available metabolite profiles and synthetic data simulated from a medium-scale kinetic model. I demonstrated through robustness analysis that correlation was due to the position of the sensor metabolites in the network and persisted irrespectively of the experimental conditions. Sensor metabolites are therefore potential candidates for phenotypical comparisons between conditions through targeted metabolic analysis.
Furthermore, I demonstrated that the coupling of metabolic reaction rates can be investigated from a purely data-driven perspective, assuming that metabolic reactions can be described by mass action kinetics. Employing metabolite profiles from domesticated and wild wheat and tomato species, I showed that the process of domestication is associated with a loss of regulatory control on the level of reaction rate coupling. I also found that the same metabolic pathways in Arabidopsis thaliana and Escherichia coli exhibit differences in the number of reaction rate couplings.
I designed a novel method for the identification and categorization of transcriptional effects on metabolism by combining data on gene expression and metabolite levels. The approach determines the partial correlation of metabolites with control by the principal components of the transcript levels. The principle components contain the majority of the transcriptomic information allowing to partial out the effect of the transcriptional layer from the metabolite profiles. Depending whether the correlation between metabolites persists upon controlling for the effect of the transcriptional layer, the approach allows us to group metabolite pairs into being associated due to post-transcriptional or transcriptional regulation, respectively. I showed that the classification of metabolite pairs into those that are associated due to transcriptional or post-transcriptional regulation are in agreement with existing literature and findings from a Bayesian inference approach.
The approaches developed, implemented, and investigated in this thesis open novel ways to jointly study metabolomics and transcriptomics data as well as to place metabolic profiles in the network context. The results from these approaches have the potential to provide further insights into the regulatory machinery in a biological system.
Background
Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available.
Findings
In order to improve genome contiguity, we have developed Cross-Species Scaffolding—a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico.
Conclusions
We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.