570 Biowissenschaften; Biologie
Refine
Year of publication
Document Type
- Doctoral Thesis (17)
- Article (15)
- Postprint (4)
Is part of the Bibliography
- yes (36)
Keywords
- Arabidopsis thaliana (36) (remove)
Institute
Recent advances in gene function prediction rely on ensemble approaches that integrate results from multiple inference methods to produce superior predictions. Yet, these developments remain largely unexplored in plants. We have explored and compared two methods to integrate 10 gene co-function networks for Arabidopsis thaliana and demonstrate how the integration of these networks produces more accurate gene function predictions for a larger fraction of genes with unknown function. These predictions were used to identify genes involved in mitochondrial complex I formation, and for five of them, we confirmed the predictions experimentally. The ensemble predictions are provided as a user-friendly online database, EnsembleNet. The methods presented here demonstrate that ensemble gene function prediction is a powerful method to boost prediction performance, whereas the EnsembleNet database provides a cutting-edge community tool to guide experimentalists.
Due to global climate change providing food security for an increasing world population is a big challenge. Especially abiotic stressors have a strong negative effect on crop yield. To develop climate-adapted crops a comprehensive understanding of molecular alterations in the response of varying levels of environmental stresses is required. High throughput or ‘omics’ technologies can help to identify key-regulators and pathways of abiotic stress responses. In addition to obtain omics data also tools and statistical analyses need to be designed and evaluated to get reliable biological results.
To address these issues, I have conducted three different studies covering two omics technologies. In the first study, I used transcriptomic data from the two polymorphic Arabidopsis thaliana accessions, namely Col-0 and N14, to evaluate seven computational tools for their ability to map and quantify Illumina single-end reads. Between 92% and 99% of the reads were mapped against the reference sequence. The raw count distributions obtained from the different tools were highly correlated. Performing a differential gene expression analysis between plants exposed to 20 °C or 4°C (cold acclimation), a large pairwise overlap between the mappers was obtained. In the second study, I obtained transcript data from ten different Oryza sativa (rice) cultivars by PacBio Isoform sequencing that can capture full-length transcripts. De novo reference transcriptomes were reconstructed resulting in 38,900 to 54,500 high-quality isoforms per cultivar. Isoforms were collapsed to reduce sequence redundancy and evaluated, e.g. for protein completeness level (BUSCO), transcript length, and number of unique transcripts per gene loci. For the heat and drought tolerant aus cultivar N22, I identified around 650 unique and novel transcripts of which 56 were significantly differentially expressed in developing seeds during combined drought and heat stress. In the last study, I measured and analyzed the changes in metabolite profiles of eight rice cultivars exposed to high night temperature (HNT) stress and grown during the dry and wet season on the field in the Philippines. Season-specific changes in metabolite levels, as well as for agronomic parameters, were identified and metabolic pathways causing a yield decline at HNT conditions suggested.
In conclusion, the comparison of mapper performances can help plant scientists to decide on the right tool for their data. The de novo reconstruction of rice cultivars without a genome sequence provides a targeted, cost-efficient approach to identify novel genes responding to stress conditions for any organism. With the metabolomics approach for HNT stress in rice, I identified stress and season-specific metabolites which might be used as molecular markers for crop improvement in the future.
Functional analysis of selected DOF transcription factors in the model plant Arabidopsis thaliana
(2007)
Transcription factors (TFs) are global regulators of gene expression playing essential roles in almost all biological processes, and are therefore of great scientific and biotechnological interest. This project focused on functional characterisation of three DNA-binding-with-one-zinc-finger (DOF) TFs from the genetic model plant Arabidopsis thaliana, namely OBP1, OBP2 and AtDOF4;2. These genes were selected due to severe growth phenotypes conferred upon their constitutive over-expression. To identify biological processes regulated by OBP1, OBP2 and AtDOF4;2 in detail molecular and physiological characterization of transgenic plants with modified levels of OBP1, OBP2 and AtDOF4;2 expression (constitutive and inducible over-expression, RNAi) was performed using both targeted and profiling technologies. Additionally expression patterns of studied TFs and their target genes were analyzed using promoter-GUS lines and publicly available microarray data. Finally selected target genes were confirmed by chromatin immuno-precipitation and electrophoretic-mobility shift assays. This combinatorial approach revealed distinct biological functions of OBP1, OBP2 and AtDOF4;2. Specifically OBP2 controls indole glucosinolate / auxin homeostasis by directly regulating the enzyme at the branch of these pathways; CYP83B1 (Skirycz et al., 2006). Glucosinolates are secondary compounds important for defence against herbivores and pathogens in the plants order Caparales (e.g. Arabidopsis, canola and broccoli) whilst auxin is an essential plant hormone. Hence OBP2 is important for both response to biotic stress and plant growth. Similarly to OBP2 also AtDOF4;2 is involved in the regulation of plant secondary metabolism and affects production of various phenylpropanoid compounds in a tissue and environmental specific manner. It was found that under certain stress conditions AtDOF4;2 negatively regulates flavonoid biosynthetic genes whilst in certain tissues it activates hydroxycinnamic acid production. It was hypothesized that this dual function is most likely related to specific interactions with other proteins; perhaps other TFs (Skirycz et al., 2007). Finally OBP1 regulates both cell proliferation and cell expansion. It was shown that OBP1 controls cell cycle activity by directly targeting the expression of core cell cycle genes (CYCD3;3 and KRP7), other TFs and components of the replication machinery. Evidence for OBP1 mediated activation of cell cycle during embryogenesis and germination will be presented. Additionally and independently on its effects on cell proliferation OBP1 negatively affects cell expansion via reduced expression of cell wall loosening enzymes. Summing up this work provides an important input into our knowledge on DOF TFs function. Future work will concentrate on establishing exact regulatory networks of OBP1, OBP2 and AtDOF4;2 and their possible biotechnological applications.
Sucrose synthase (Susy) is a key enzyme of sucrose metabolism, catalysing the reversible conversion of sucrose and UDP to UDP-glucose and fructose. Therefore, its activity, localization and function have been studied in various plant species. It has been shown that Susy can play a role in supplying energy in companion cells for phloem loading (Fu and Park, 1995), provides substrates for starch synthesis (Zrenner et al., 1995), and supplies UDP-glucose for cell wall synthesis (Haigler et al., 2001). Analysis of the Arabidopsis genome identifies six Susy isoforms. The expression of these isoforms was investigated using promoter-reporter gene constructs (GUS) and real time RT-PCR. Although these isoforms are closely related at the protein level they have radically different spatial and temporal patterns of expression in the plant with no two isoforms showing the same distribution. More than one isoform is expressed in all organs examined. Some of them have high but specific expression in particular organs or developmental stages whilst others are constantly expressed throughout the whole plant and across various stages of development. The in planta function of the six Susy isoforms were explored through analysis of T-DNA insertion mutants and RNAi lines. Plants without the expression of individual isoforms show no differences in growth and development, and are not significantly different from wild type plants in soluble sugars, starch and cellulose contents under all growth conditions investigated. Analysis of T-DNA insertion mutant lacking Sus3 isoform that was exclusively expressed in stomata cells only had a minor influence on guard cell osmoregulation and/or bioenergetics. Although none of the sucrose synthases appear to be essential for normal growth under our standard growth conditions, they may be necessary for growth under stress conditions. Different isoforms of sucrose synthase respond differently to various abiotic stresses. It has been shown that oxygen deprivation up regulates Sus1 and Sus4 and increases total Susy activity. However, the analysis of the plants with reduced expression of both Sus1 and Sus4 revealed no obvious effects on plant performance under oxygen deprivation. Low temperature up regulates Sus1 expression but the loss of this isoform has no effect on the freezing tolerance of non acclimated and cold acclimated plants. These data provide a comprehensive overview of the expression of this gene family which supports some of the previously reported roles for Susy and indicates the involvement of specific isoforms in metabolism and/or signalling.
The GABI Primary Database, GabiPD (http:// www.gabipd.org/), was established in the frame of the German initiative for Genome Analysis of the Plant Biological System (GABI). The goal of GabiPD is to collect, integrate, analyze and visualize primary information from GABI projects. GabiPD constitutes a repository and analysis platform for a wide array of heterogeneous data from high-throughput experiments in several plant species. Data from different ‘omics’ fronts are incorporated (i.e. genomics, transcriptomics, proteomics and metabolomics), originating from 14 different model or crop species. We have developed the concept of GreenCards for textbased retrieval of all data types in GabiPD (e.g. clones, genes, mutant lines). All data types point to a central Gene GreenCard, where gene information is integrated from genome projects or NCBI UniGene sets. The centralized Gene GreenCard allows visualizing ESTs aligned to annotated transcripts as well as displaying identified protein domains and gene structure. Moreover, GabiPD makes available interactive genetic maps from potato and barley, and protein 2DE gels from Arabidopsis thaliana and Brassica napus. Gene expression and metabolic-profiling data can be visualized through MapManWeb. By the integration of complex data in a framework of existing knowledge, GabiPD provides new insights and allows for new interpretations of the data.
Each organ of a multicellular organism is unique at the level of its tissues and cells. Furthermore, responses to environmental stimuli or developmental signals occur differentially at the single cell or tissue level. This underlines the necessity of precise investigation of the “building block of life” -the individual cell. Although recently large amount of data concerning different aspects of single cell performance was accumulated, our knowledge about development and differentiation of individual cell within specialized tissue are still far from being complete. To get more insight into processes that occur in certain individual cell during its development and differentiation changes in gene expression during life cycle of A. thaliana leaf hair cell (trichome) were explored in this work. After onset of trichome development this cell changes its cell cycle: it starts endoreduplication (a modified cell cycle in which DNA replication continues in the absence of mitosis and cytokinesis). This makes trichomes a suitable model for studying cell cycle regulation, regulation of cell development and differentiation. Cells of interest were sampled by puncturing them with glass microcapillaries. Each sample contained as few as ten single cells. At first time trichomes in initial stage of trichome development were investigated. To allow their sampling they were specifically labelled by green fluorescent protein (GFP). In total three cell types were explored: pavement cells, trichome initials and mature trichomes. Comparison of gene expression profiles of these cells allowed identification of the genes differentially expressed in subsequent stages of trichome development. Bioinformatic analysis of genes preferentially expressed in trichome initials showed their involvement in hormonal, metal, sulphur response and cell-cycle regulation. Expression pattern of three selected candidate genes, involved in hormonal response and early developmental processes was confirmed by independent method. Effects of mutations in these genes on both trichome and plant development as well as on plant metabolism were analysed. As an outcome of this work novel components in the sophisticated machinery of trichome development and cell cycle progression were identified. These factors could integrate hormone stimuli and network interactions between characterized and as yet unknown members of this machinery. I expect findings presented in this work to enhance and complement our current knowledge about cell cycle progression and trichome development, as well as about performance of the individual cell in general.
Genomic and epigenomic determinants of heat stress-induced transcriptional memory in Arabidopsis
(2023)
Background
Transcriptional regulation is a key aspect of environmental stress responses. Heat stress induces transcriptional memory, i.e., sustained induction or enhanced re-induction of transcription, that allows plants to respond more efficiently to a recurrent HS. In light of more frequent temperature extremes due to climate change, improving heat tolerance in crop plants is an important breeding goal. However, not all heat stress-inducible genes show transcriptional memory, and it is unclear what distinguishes memory from non-memory genes. To address this issue and understand the genome and epigenome architecture of transcriptional memory after heat stress, we identify the global target genes of two key memory heat shock transcription factors, HSFA2 and HSFA3, using time course ChIP-seq.
Results
HSFA2 and HSFA3 show near identical binding patterns. In vitro and in vivo binding strength is highly correlated, indicating the importance of DNA sequence elements. In particular, genes with transcriptional memory are strongly enriched for a tripartite heat shock element, and are hallmarked by several features: low expression levels in the absence of heat stress, accessible chromatin environment, and heat stress-induced enrichment of H3K4 trimethylation. These results are confirmed by an orthogonal transcriptomic data set using both de novo clustering and an established definition of memory genes.
Conclusions
Our findings provide an integrated view of HSF-dependent transcriptional memory and shed light on its sequence and chromatin determinants, enabling the prediction and engineering of genes with transcriptional memory behavior.
Plants are the primary producers of biomass and thereby the basis of all life. Many varieties are cultivated, mainly to produce food, but to an increasing amount as a source of renewable energy. Because of the limited acreage available, further improvements of cultivated species both with respect to yield and composition are inevitable. One approach to further progress in developing improved plant cultivars is a systems biology oriented approach. This work aimed to investigate the primary metabolism of the model plant A.thaliana and its relation to plant growth using quantitative genetics methods. A special focus was set on the characterization of heterosis, the deviation of hybrids from their parental means for certain traits, on a metabolic level. More than 2000 samples of recombinant inbred lines (RILs) and introgression lines (ILs) developed from the two accessions Col-0 and C24 were analyzed for 181 metabolic traces using gas-chromatography/ mass-spectrometry (GC-MS). The observed variance allowed the detection of 157 metabolic quantitative trait loci (mQTL), genetic regions carrying genes, which are relevant for metabolite abundance. By analyzing several hundred test crosses of RILs and ILs it was further possible to identify 385 heterotic metabolic QTL (hmQTL). Within the scope of this work a robust method for large scale GC-MS analyses was developed. A highly significant canonical correlation between biomass and metabolic profiles (r = 0.73) was found. A comparable analysis of the results of the two independent experiments using RILs and ILs showed a large agreement. The confirmation rate for RIL QTL in ILs was 56 % and 23 % for mQTL and hmQTL respectively. Candidate genes from available databases could be identified for 67 % of the mQTL. To validate some of these candidates, eight genes were re-sequenced and in total 23 polymorphisms could be found. In the hybrids, heterosis is small for most metabolites (< 20%). Heterotic QTL gave rise to less candidate genes and a lower overlap between both populations than was determined for mQTL. This hints that regulatory loci and epistatic effects contribute to metabolite heterosis. The data described in this thesis present a rich source for further investigation and annotation of relevant genes and may pave the way towards a better understanding of plant biology on a system level.
Primary carbohydrate metabolism in plants includes several sugar and sugar-derivative transport processes. Over recent years, evidences have shown that in starch-related transport processes, in addition to glucose 6-phosphate, maltose, glucose and triose-phosphates, glucose 1-phosphate also plays a role and thereby increases the possible fluxes of sugar metabolites in planta. In this study, we report the characterization of two highly similar transporters, At1g34020 and At4g09810, in Arabidopsis thaliana, which allow the import of glucose 1-phosphate through the plasma membrane. Both transporters were expressed in yeast and were biochemically analyzed to reveal an antiport of glucose 1-phosphate/phosphate. Furthermore, we showed that the apoplast of Arabidopsis leaves contained glucose 1-phosphate and that the corresponding mutant of these transporters had higher glucose 1-phosphate amounts in the apoplast and alterations in starch and starch-related metabolism.
Für alle Organismen ist die Aufrechterhaltung ihres energetischen Gleichgewichts unter fluktuierenden Umweltbedingungen lebensnotwendig. In Eukaryoten steuern evolutionär konservierte Proteinkinasen, die in Pflanzen als SNF1-RELATED PROTEIN KINASE1 (SnRK1) bezeichnet werden, die Adaption an Stresssignale aus der Umwelt und an die Limitierung von Nährstoffen und zellulärer Energie. Die Aktivierung von SnRK1 bedingt eine umfangreiche transkriptionelle Umprogrammierung, die allgemein zu einer Repression energiekonsumierender Prozesse wie beispielsweise Zellteilung und Proteinbiosynthese und zu einer Induktion energieerzeugender, katabolischer Stoffwechselwege führt. Wie unterschiedliche Signale zu einer generellen sowie teilweise gewebe- und stressspezifischen SnRK1-vermittelten Antwort führen ist bisher noch nicht ausreichend geklärt, auch weil bislang nur wenige Komponenten der SnRK1-Signaltransduktion identifiziert wurden. In dieser Arbeit konnte ein Protein-Protein-Interaktionsnetzwerk um die SnRK1αUntereinheiten aus Arabidopsis AKIN10/AKIN11 etabliert werden. Dadurch wurden zunächst Mitglieder der pflanzenspezifischen DUF581-Proteinfamilie als Interaktionspartner der SnRK1α-Untereinheiten identifiziert. Diese Proteine sind über ihre konservierte DUF581Domäne, in der ein Zinkfinger-Motiv lokalisiert ist, fähig mit AKIN10/AKIN11 zu interagieren. In planta Ko-Expressionsanalysen zeigten, dass die DUF581-Proteine eine Verschiebung der nucleo-cytoplasmatischen Lokalisierung von AKIN10 hin zu einer nahezu ausschließlichen zellkernspezifischen Lokalisierung begünstigen sowie die Ko-Lokalisierung von AKIN10 und DUF581-Proteinen im Nucleus. In Bimolekularen Fluoreszenzkomplementations-Analysen konnte die zellkernspezifische Interaktion von DUF581-Proteinen mit SnRK1α-Untereinheiten in planta bestätigt werden. Außerhalb der DUF581-Domäne weisen die Proteine einander keine große Sequenzähnlichkeit auf. Aufgrund ihrer Fähigkeit mit SnRK1 zu interagieren, dem Fehlen von SnRK1Phosphorylierungsmotiven sowie ihrer untereinander sehr variabler gewebs-, entwicklungs- und stimulusspezifischer Expression wurde für DUF581-Proteine eine Funktion als Adaptoren postuliert, die unter bestimmten physiologischen Bedingungen spezifische Substratproteine in den SnRK1-Komplex rekrutieren. Auf diese Weise könnten DUF581Proteine die Interaktion von SnRK1 mit deren Zielproteinen modifizieren und eine Feinjustierung der SnRK1-Signalweiterleitung ermöglichen. Durch weiterführende Interaktionsstudien konnten DUF581-interagierende Proteine darunter Transkriptionsfaktoren, Proteinkinasen sowie regulatorische Proteine gefunden werden, die teilweise ebenfalls Wechselwirkungen mit SnRK1α-Untereinheiten aufzeigten. Im Rahmen dieser Arbeit wurde eines dieser Proteine für das eine Beteiligung an der SnRK1Signalweiterleitung als Transkriptionsregulator vermutet wurde näher charakterisiert. STKR1 (STOREKEEPER RELATED 1), ein spezifischer Interaktionspartner von DUF581-18, gehört zu einer pflanzenspezifischen Leucin-Zipper-Transkriptionsfaktorfamilie und interagiert in Hefe sowie in planta mit SnRK1. Die zellkernspezifische Interaktion von STKR1 und AKIN10 in Pflanzen unterstützt die Vermutung der kooperativen Regulation von Zielgenen. Weiterhin stabilisierte die Anwesenheit von AKIN10 die Proteingehalte von STKR1, das wahrscheinlich über das 26S Proteasom abgebaut wird. Da es sich bei STKR1 um ein Phosphoprotein mit SnRK1-Phosphorylierungsmotiv handelt, stellt es sehr wahrscheinlich ein SnRK1-Substrat dar. Allerdings konnte eine SnRK1-vermittelte Phosphorylierung von STKR1 in dieser Arbeit nicht gezeigt werden. Der Verlust von einer Phosphorylierungsstelle beeinflusste die Homo- und Heterodimerisierungsfähigkeit von STKR1 in Hefeinteraktionsstudien, wodurch eine erhöhte Spezifität der Zielgenregulation ermöglicht werden könnte. Außerdem wurden Arabidopsis-Pflanzen mit einer veränderten STKR1-Expression phänotypisch, physiologisch und molekularbiologisch charakterisiert. Während der Verlust der STKR1-Expression zu Pflanzen führte, die sich kaum von Wildtyp-Pflanzen unterschieden, bedingte die konstitutive Überexpression von STKR1 ein stark vermindertes Pflanzenwachstum sowie Entwicklungsverzögerungen hinsichtlich der Blühinduktion und Seneszenz ähnlich wie sie auch bei SnRK1α-Überexpression beschrieben wurden. Pflanzen dieser Linien waren nicht in der Lage Anthocyane zu akkumulieren und enthielten geringere Gehalte an Chlorophyll und Carotinoiden. Neben einem erhöhten nächtlichen Stärkeumsatz waren die Pflanzen durch geringere Saccharosegehalte im Vergleich zum Wildtyp gekennzeichnet. Eine Transkriptomanalyse ergab, dass in den STKR1-überexprimierenden Pflanzen unter Energiemangelbedingungen, hervorgerufen durch eine verlängerte Dunkelphase, eine größere Anzahl an Genen im Vergleich zum Wildtyp differentiell reguliert war als während der Lichtphase. Dies spricht für eine Beteiligung von STKR1 an Prozessen, die während der verlängerten Dunkelphase aktiv sind. Ein solcher ist beispielsweise die SnRK1-Signaltransduktion, die unter energetischem Stress aktiviert wird. Die STKR1Überexpression führte zudem zu einer verstärkten transkriptionellen Induktion von Abwehrassoziierten Genen sowie NAC- und WRKY-Transkriptionsfaktoren nach verlängerter Dunkelphase. Die Transkriptomdaten deuteten auf eine stimulusunabhängige Induktion von Abwehrprozessen hin und konnten eine Erklärung für die phänotypischen und physiologischen Auffälligkeiten der STKR1-Überexprimierer liefern.