Refine
Has Fulltext
- no (11)
Document Type
- Article (11)
Language
- English (11)
Is part of the Bibliography
- yes (11)
Keywords
- clustering (2)
- random matrix theory (2)
- transcription factor (2)
- Arabidopsis thaliana (1)
- FAIRE-qPCR (1)
- FAIRE-seq (1)
- Solanum lycopersicum (1)
- chromatin (1)
- cis-regulatory elements (1)
- development (1)
Institute
Birth weight variation is influenced by fetal and maternal genetic and non-genetic factors, and has been reproducibly associated with future cardio-metabolic health outcomes. In expanded genome-wide association analyses of own birth weight (n = 321,223) and offspring birth weight (n = 230,069 mothers), we identified 190 independent association signals (129 of which are novel). We used structural equation modeling to decompose the contributions of direct fetal and indirect maternal genetic effects, then applied Mendelian randomization to illuminate causal pathways. For example, both indirect maternal and direct fetal genetic effects drive the observational relationship between lower birth weight and higher later blood pressure: maternal blood pressure-raising alleles reduce offspring birth weight, but only direct fetal effects of these alleles, once inherited, increase later offspring blood pressure. Using maternal birth weight-lowering genotypes to proxy for an adverse intrauterine environment provided no evidence that it causally raises offspring blood pressure, indicating that the inverse birth weight-blood pressure association is attributable to genetic effects, and not to intrauterine programming.
Reproducibility is a defining feature of science, but the extent to which it characterizes current research is unknown. We conducted replications of 100 experimental and correlational studies published in three psychology journals using high-powered designs and original materials when available. Replication effects were half the magnitude of original effects, representing a substantial decline. Ninety-seven percent of original studies had statistically significant results. Thirty-six percent of replications had statistically significant results; 47% of original effect sizes were in the 95% confidence interval of the replication effect size; 39% of effects were subjectively rated to have replicated the original result; and if no bias in original results is assumed, combining original and replication results left 68% with statistically significant effects. Correlational tests suggest that replication success was better predicted by the strength of original evidence than by characteristics of the original and replication teams.
Clustering of gene expression data is often done with the latent aim of dimension reduction, by finding groups of genes that have a common response to potentially unknown stimuli. However, what is poorly understood to date is the behaviour of a low dimensional signal embedded in high dimensions. This paper introduces a multicollinear model which is based on random matrix theory results, and shows potential for the characterisation of a gene cluster's correlation matrix. This model projects a one dimensional signal into many dimensions and is based on the spiked covariance model, but rather characterises the behaviour of the corresponding correlation matrix. The eigenspectrum of the correlation matrix is empirically examined by simulation, under the addition of noise to the original signal. The simulation results are then used to propose a dimension estimation procedure of clusters from data. Moreover, the simulation results warn against considering pairwise correlations in isolation, as the model provides a mechanism whereby a pair of genes with 'low' correlation may simply be due to the interaction of high dimension and noise. Instead, collective information about all the variables is given by the eigenspectrum.
Random matrix theory (RMT) is well suited to describing the emergent properties of systems with complex interactions amongst their constituents through their eigenvalue spectrums. Some RMT results are applied to the problem of clustering high dimensional biological data with complex dependence structure amongst the variables. It will be shown that a gene relevance or correlation network can be constructed by choosing a correlation threshold in a principled way, such that it corresponds to a block diagonal structure in the correlation matrix, if such a structure exists. The structure is then found using community detection algorithms, but with parameter choice guided by RMT predictions. The resulting clustering is compared to a variety of hierarchical clustering outputs and is found to the most generalised result, in that it captures all the features found by the other considered methods.
Distinct roles of the last transmembrane domain in controlling Arabidopsis K+ channel activity
(2009)
The family of voltage-gated potassium channels in plants presumably evolved from a common ancestor and includes both inward-rectifying (K-in) channels that allow plant cells to accumulate K+ and outward-rectifying (K-out) channels that mediate K+ efflux. Despite their close structural similarities, the activity of Kin channels is largely independent of K+ and depends only on the transmembrane voltage, whereas that of K-out channels responds to the membrane voltage and the prevailing extracellular K+ concentration. Gating of potassium channels is achieved by structural rearrangements within the last transmembrane domain (S6). Here we investigated the functional equivalence of the S6 helices of the Kin channel KAT1 and the K-out channel SKOR by domain-swapping and site-directed mutagenesis. Channel mutants and chimeras were analyzed after expression in Xenopus oocytes. We identified two discrete regions that influence gating differently in both channels, demonstrating a lack of functional complementarity between KAT1 and SKOR. Our findings are supported by molecular models of KAT1 and SKOR in the open and closed states. The role of the S6 segment in gating evolved differently during specialization of the two channel subclasses, posing an obstacle for the transfer of the K+-sensor from K-out to K-in channels.
To gain a deeper understanding of the mechanisms behind biomass accumulation, it is important to study plant growth behavior. Manually phenotyping large sets of plants requires important human resources and expertise and is typically not feasible for detection of weak growth phenotypes. Here, we established an automated growth phenotyping pipeline for Arabidopsis thaliana to aid researchers in comparing growth behaviors of different genotypes.
The analysis pipeline includes automated image analysis of two-dimensional digital plant images and evaluation of manually annotated information of growth stages. It employs linear mixed-effects models to quantify genotype effects on total rosette area and relative leaf growth rate (RLGR) and ANOVAs to quantify effects on developmental times.
Using the system, a single researcher can phenotype up to 7000 plants d(-1). Technical variance is very low (typically < 2%). We show quantitative results for the growth-impaired starch-excessmutant sex4-3 and the growth-enhancedmutant grf9.
We show that recordings of environmental and developmental variables reduce noise levels in the phenotyping datasets significantly and that careful examination of predictor variables (such as d after sowing or germination) is crucial to avoid exaggerations of recorded phenotypes and thus biased conclusions.
The control of gene expression by transcriptional regulators and other types of functionally relevant DNA transactions such as chromatin remodeling and replication underlie a vast spectrum of biological processes in all organisms. DNA transactions require the controlled interaction of proteins with DNA sequence motifs which are often located in nucleosome-depleted regions (NDRs) of the chromatin. Formaldehyde-assisted isolation of regulatory elements (FAIRE) has been established as an easy-to-implement method for the isolation of NDRs from a number of eukaryotic organisms, and it has been successfully employed for the discovery of new regulatory segments in genomic DNA from, for example, yeast, Drosophila, and humans. Until today, however, FAIRE has only rarely been employed in plant research and currently no detailed FAIRE protocol for plants has been published. Here, we provide a step-by-step FAIRE protocol for NDR discovery in Arabidopsis thaliana. We demonstrate that NDRs isolated from plant chromatin are readily amenable to quantitative polymerase chain reaction and next-generation sequencing. Only minor modification of the FAIRE protocol will be needed to adapt it to other plants, thus facilitating the global inventory of regulatory regions across species.
The unicellular green alga Chlamydomonas reinhardtii is a long-established model organism for studies on photosynthesis and carbon metabolism-related physiology. Under conditions of air-level carbon dioxide concentration [CO2], a carbon concentrating mechanism (CCM) is induced to facilitate cellular carbon uptake. CCM increases the availability of carbon dioxide at the site of cellular carbon fixation. To improve our understanding of the transcriptional control of the CCM, we employed FAIRE-seq (formaldehyde-assisted Isolation of Regulatory Elements, followed by deep sequencing) to determine nucleosome-depleted chromatin regions of algal cells subjected to carbon deprivation. Our FAIRE data recapitulated the positions of known regulatory elements in the promoter of the periplasmic carbonic anhydrase (Cah1) gene, which is upregulated during CCM induction, and revealed new candidate regulatory elements at a genome-wide scale. In addition, time series expression patterns of 130 transcription factor (TF) and transcription regulator (TR) genes were obtained for cells cultured under photoautotrophic condition and subjected to a shift from high to low [CO2]. Groups of co-expressed genes were identified and a putative directed gene-regulatory network underlying the CCM was reconstructed from the gene expression data using the recently developed IOTA (inner composition alignment) method. Among the candidate regulatory genes, two members of the MYB-related TF family, Lcr1 (Low-CO2 response regulator 1) and Lcr2 (Low-CO2 response regulator 2), may play an important role in down-regulating the expression of a particular set of TF and TR genes in response to low [CO2]. The results obtained provide new insights into the transcriptional control of the CCM and revealed more than 60 new candidate regulatory genes. Deep sequencing of nucleosome-depleted genomic regions indicated the presence of new, previously unknown regulatory elements in the C. reinhardtii genome. Our work can serve as a basis for future functional studies of transcriptional regulator genes and genomic regulatory elements in Chlamydomonas.
Maturation of fleshy fruits such as tomato (Solanum lycopersicum) is subject to tight genetic control. Here we describe the development of a quantitative real-time PCR platform that allows accurate quantification of the expression level of approximately 1000 tomato transcription factors. In addition to utilizing this novel approach, we performed cDNA microarray analysis and metabolite profiling of primary and secondary metabolites using GC-MS and LC-MS, respectively. We applied these platforms to pericarp material harvested throughout fruit development, studying both wild-type Solanum lycopersicum cv. Ailsa Craig and the hp1 mutant. This mutant is functionally deficient in the tomato homologue of the negative regulator of the light signal transduction gene DDB1 from Arabidopsis, and is furthermore characterized by dramatically increased pigment and phenolic contents. We choose this particular mutant as it had previously been shown to have dramatic alterations in the content of several important fruit metabolites but relatively little impact on other ripening phenotypes. The combined dataset was mined in order to identify metabolites that were under the control of these transcription factors, and, where possible, the respective transcriptional regulation underlying this control. The results are discussed in terms of both programmed fruit ripening and development and the transcriptional and metabolic shifts that occur in parallel during these processes.
We investigated the systems response of metabolism and growth after an increase in irradiance in the nonsaturating range in the algal model Chlamydomonas reinhardtii. In a three-step process, photosynthesis and the levels of metabolites increased immediately, growth increased after 10 to 15 min, and transcript and protein abundance responded by 40 and 120 to 240 min, respectively. In the first phase, starch and metabolites provided a transient buffer for carbon until growth increased. This uncouples photosynthesis from growth in a fluctuating light environment. In the first and second phases, rising metabolite levels and increased polysome loading drove an increase in fluxes. Most Calvin-Benson cycle (CBC) enzymes were substrate-limited in vivo, and strikingly, many were present at higher concentrations than their substrates, explaining how rising metabolite levels stimulate CBC flux. Rubisco, fructose-1,6-biosphosphatase, and seduheptulose-1,7-bisphosphatase were close to substrate saturation in vivo, and flux was increased by posttranslational activation. In the third phase, changes in abundance of particular proteins, including increases in plastidial ATP synthase and some CBC enzymes, relieved potential bottlenecks and readjusted protein allocation between different processes. Despite reasonable overall agreement between changes in transcript and protein abundance (R-2 = 0.24), many proteins, including those in photosynthesis, changed independently of transcript abundance.