Refine
Year of publication
Document Type
- Article (32)
- Postprint (7)
- Conference Proceeding (2)
- Review (2)
Language
- English (43)
Is part of the Bibliography
- yes (43)
Keywords
- translation (6)
- codon usage (4)
- Escherichia coli (3)
- Monte-Carlo simulations (2)
- RNA-Seq (2)
- Transcription (2)
- gene-expression (2)
- mRNA structure (2)
- single ribosomes (2)
- tRNA (2)
A flexible approach to assess fluorescence decay functions in complex energy transfer systems
(2015)
Background: Time-correlated Forster resonance energy transfer (FRET) probes molecular distances with greater accuracy than intensity-based calculation of FRET efficiency and provides a powerful tool to study biomolecular structure and dynamics. Moreover, time-correlated photon count measurements bear additional information on the variety of donor surroundings allowing more detailed differentiation between distinct structural geometries which are typically inaccessible to general fitting solutions.
Results: Here we develop a new approach based on Monte Carlo simulations of time-correlated FRET events to estimate the time-correlated single photon counts (TCSPC) histograms in complex systems. This simulation solution assesses the full statistics of time-correlated photon counts and distance distributions of fluorescently labeled biomolecules. The simulations are consistent with the theoretical predictions of the dye behavior in FRET systems with defined dye distances and measurements of randomly distributed dye solutions. We validate the simulation results using a highly heterogeneous aggregation system and explore the conditions to use this tool in complex systems.
Conclusion: This approach is powerful in distinguishing distance distributions in a wide variety of experimental setups, thus providing a versatile tool to accurately distinguish between different structural assemblies in highly complex systems.
A flexible approach to assess fluorescence decay functions in complex energy transfer systems
(2015)
Background: Time-correlated Forster resonance energy transfer (FRET) probes molecular distances with greater accuracy than intensity-based calculation of FRET efficiency and provides a powerful tool to study biomolecular structure and dynamics. Moreover, time-correlated photon count measurements bear additional information on the variety of donor surroundings allowing more detailed differentiation between distinct structural geometries which are typically inaccessible to general fitting solutions.
Results: Here we develop a new approach based on Monte Carlo simulations of time-correlated FRET events to estimate the time-correlated single photon counts (TCSPC) histograms in complex systems. This simulation solution assesses the full statistics of time-correlated photon counts and distance distributions of fluorescently labeled biomolecules. The simulations are consistent with the theoretical predictions of the dye behavior in FRET systems with defined dye distances and measurements of randomly distributed dye solutions. We validate the simulation results using a highly heterogeneous aggregation system and explore the conditions to use this tool in complex systems.
Conclusion: This approach is powerful in distinguishing distance distributions in a wide variety of experimental setups, thus providing a versatile tool to accurately distinguish between different structural assemblies in highly complex systems.
Cells contain a finite set of resources that must be distributed across many processes to ensure survival. Among them, the largest proportion of cellular resources is dedicated to protein translation. Synthetic biology often exploits these resources in executing orthogonal genetic circuits, yet the burden this places on the cell is rarely considered. Here, we develop a minimal model of ribosome allocation dynamics capturing the demands on translation when expressing a synthetic construct together with endogenous genes required for the maintenance of cell physiology. Critically, it contains three key variables related to design parameters of the synthetic construct covering transcript abundance, translation initiation rate, and elongation time. We show that model-predicted changes in ribosome allocation closely match experimental shifts in synthetic protein expression rate and cellular growth. Intriguingly, the model is also able to accurately infer transcript levels and translation times after further exposure to additional ambient stress. Our results demonstrate that a simple model of resource allocation faithfully captures the redistribution of protein synthesis resources when faced with the burden of synthetic gene expression and environmental stress. The tractable nature of the model makes it a versatile tool for exploring the guiding principles of efficient heterologous expression and the indirect interactions that can arise between synthetic circuits and their host chassis because of competition for shared translational resources.
Synonymous single nucleotide polymorphisms (sSNPs) are considered neutral for protein function, as by definition they exchange only codons, not amino acids. We identified an sSNP that modifies the local translation speed of the cystic fibrosis transmembrane conductance regulator (CFTR), leading to detrimental changes to protein stability and function. This sSNP introduces a codon pairing to a low-abundance tRNA that is particularly rare in human bronchial epithelia, but not in other human tissues, suggesting tissue-specific effects of this sSNP. Up-regulation of the tRNA cognate to the mutated codon counteracts the effects of the sSNP and rescues protein conformation and function. Our results highlight the wide-ranging impact of sSNPs, which invert the programmed local speed of mRNA translation and provide direct evidence for the central role of cellular tRNA levels in mediating the actions of sSNPs in a tissue-specific manner.
Maintenance of triplet decoding is crucial for the expression of functional protein because deviations either into the -1 or +1 reading frames are often non-functional. We report here that expression of huntingtin (Htt) exon 1 with expanded CAG repeats, implicated in Huntington pathology, undergoes a sporadic +1 frameshift to generate from the CAG repeat a trans-frame AGC repeat-encoded product. This +1 recoding is exclusively detected in pathological Htt variants, i.e. those with expanded repeats with more than 35 consecutive CAG codons. An atypical +1 shift site, UUC C at the 5 end of CAG repeats, which has some resemblance to the influenza A virus shift site, triggers the +1 frameshifting and is enhanced by the increased propensity of the expanded CAG repeats to form a stem-loop structure. The +1 trans-frame-encoded product can directly influence the aggregation of the parental Htt exon 1.
The disease risk and age of onset of Huntington disease (HD) and nine other repeat disorders strongly depend on the expansion of CAG repeats encoding consecutive polyglutamines (polyQ) in the corresponding disease protein. PolyQ length-dependent misfolding and aggregation are the hallmarks of CAG pathologies. Despite intense effort, the overall structure of these aggregates remains poorly understood. Here, we used sensitive time-dependent fluorescent decay measurements to assess the architecture of mature fibrils of huntingtin (Htt) exon 1 implicated in HD pathology. Varying the position of the fluorescent labels in the Htt monomer with expanded 51Q (Htt51Q) and using structural models of putative fibril structures, we generated distance distributions between donors and acceptors covering all possible distances between the monomers or monomer dimensions within the polyQ amyloid fibril. Using Monte Carlo simulations, we systematically scanned all possible monomer conformations that fit the experimentally measured decay times. Monomers with four-stranded 51Q stretches organized into five-layered beta-sheets with alternating N termini of the monomers perpendicular to the fibril axis gave the best fit to our data. Alternatively, the core structure of the polyQ fibrils might also be a zipper layer with antiparallel four-stranded stretches as this structure showed the next best fit. All other remaining arrangements are clearly excluded by the data. Furthermore, the assessed dimensions of the polyQ stretch of each monomer provide structural evidence for the observed polyQ length threshold in HD pathology. Our approach can be used to validate the effect of pharmacological substances that inhibit or alter amyloid growth and structure.
Assessing polyglutamine conformation in the nucleating event by molecular dynamics simulations
(2012)
Polyglutamine (polyQ) diseases comprise a group of dominantly inherited pathology caused by an expansion of an unstable polyQ stretch which is presumed to form beta-sheets. Similar to other amyloid pathologies, polyQ amyloidogenesis occurs via a nucleated polymerization mechanism, and proceeds through energetically unfavorable nucleus whose existence and structure are difficult to detect. Here, we use atomistic molecular dynamics simulations in explicit solvent to assess the conformation of the polyQ stretch in the nucleus that initiates polyQ fibrillization. Comparison of the kinetic stability of various structures of polyQ peptide with a Q-length in the pathological range (Q(40)) revealed that steric zipper or nanotube-like structures (beta-nanotube or beta-pseudohelix) are not kinetically stable enough to serve as a template to initiate polyQ fibrillization as opposed to beta-hairpin-based (beta-sheet and beta-sheetstack) or alpha-helical conformations. The selection of different structures of the polyQ stretch in the aggregation-initiating event may provide an alternative explanation for polyQ aggregate polymorphism.
Environmental stress is detrimental to cell viability and requires an adequate reprogramming of cellular activities to maximize cell survival. We present a global analysis of the response of Escherichia coli to acute heat and osmotic stress. We combine deep sequencing of total mRNA and ribosome-protected fragments to provide a genome-wide map of the stress response at transcriptional and translational levels. For each type of stress, we observe a unique subset of genes that shape the stress-specific response. Upon temperature upshift, mRNAs with reduced folding stability up-and downstream of the start codon, and thus with more accessible initiation regions, are translationally favoured. Conversely, osmotic upshift causes a global reduction of highly translated transcripts with high copy numbers, allowing reallocation of translation resources to not degraded and newly synthesized mRNAs.
The enzyme penicillin G acylase (EC 3.5.1.11) catalyzes amide-bond cleavage in benzylpenicillin (penicillin G) to yield 6-aminopenicillanic acid, an intermediate chemical used in the production of semisynthetic penicillins. A thermostable penicillin G acylase from Alcaligenes faecalis (AfPGA) has been crystallized using the hanging-drop vapour-diffusion method in two different space groups: C2221, with unit-cell parameters a = 72.9, b = 86.0, c = 260.2 angstrom, and P41212, with unit-cell parameters a = b = 85.6, c = 298.8 angstrom. Data were collected at 293 K and the structure was determined using the molecular-replacement method. Like other penicillin acylases, AfPGA belongs to the N-terminal nucleophilic hydrolase superfamily, has undergone post-translational processing and has a serine as the N-terminal residue of the beta-chain. A disulfide bridge has been identified in the structure that was not found in the other two known penicillin G acylase structures. The presence of the disulfide bridge is perceived to be one factor that confers higher stability to this enzyme.
Huntington disease (HD), a dominantly inherited neurodegenerative disorder caused by the expansion of a CAG-encoded polyglutamine (polyQ) repeat in huntingtin (Htt), displays a highly heterogeneous etiopathology and disease onset. Here, we show that the translation of expanded CAG repeats in mutant Htt exon 1 leads to a depletion of charged glutaminyl-transfer RNA (tRNA) Gln-CUG that pairs exclusively to the CAG codon. This results in translational frameshifting and the generation of various transframe-encoded species that differently modulate the conformational switch to nucleate fibrillization of the parental polyQ protein. Intriguingly, the frameshifting frequency varies strongly among different cell lines and is higher in cells with intrinsically lower concentrations of tRNA Gln-CUG. The concentration of tRNA Gln-CUG also differs among different brain areas in the mouse. We propose that translational frameshifting may act as a significant disease modifier that contributes to the cell-selective neurotoxicity and disease course heterogeneity of HD on both cellular and individual levels.
In eukaryotes, the transcription of tRNA genes is initiated by the concerted action of transcription factors IIIC (TFIIIC) and IIIB (TFIIIB) which direct the recruitment of polymerase III. While TFIIIC recognizes highly conserved, intragenic promoter elements, TFIIIB binds to the non-coding 5'-upstream regions of the tRNA genes. Using a systematic bioinformatic analysis of 11 multicellular eukaryotic genomes we identified a highly conserved TATA motif followed by a CAA-motif in the tRNA upstream regions of all plant genomes. Strikingly, the 5'-flanking tRNA regions of the animal genomes are highly heterogeneous and lack a common conserved sequence signature. Interestingly, in the animal genomes the tRNA species that read the same codon share conserved motifs in their upstream regions. Deep-sequencing analysis of 16 human tissues revealed multiple splicing variants of two of the TFIIIB subunits, Bdp1 and Brf1, with tissue-specific expression patterns. These multiple forms most likely modulate the TFIIIB-DNA interactions and explain the lack of a uniform signature motif in the tRNA upstream regions of animal genomes. The anticodon-dependent 5'-flanking motifs provide a possible mechanism for independent regulation of the tRNA transcription in various human tissues.
Translation is a central cellular process and is optimized for speed and fidelity. The speed of translation of a single codon depends on the concentration of aminoacyl-tRNAs. Here, we used microarray-based approaches to analyze the charging levels of tRNAs in Escherichia coli growing at different growth rates. Strikingly, we observed a non-uniform aminoacylation of tRNAs in complex media. In contrast, in minimal medium, the level of aminoacyl-tRNAs is more uniform and rises to approximately 60%. Particularly, the charging level of tRNA(Ser), tRNA(Cys), tRNA(Thr) and tRNA(His) is below 50% in complex medium and their aminoacylation levels mirror the degree that amino acids inhibit growth when individually added to minimal medium. Serine is among the most toxic amino acids for bacteria and tRNAs(Ser) exhibit the lowest charging levels, below 10%, at high growth rate although intracellular serine concentration is plentiful. As a result some serine codons are among the most slowly translated codons. A large fraction of the serine is most likely degraded by L-serine-deaminase, which competes with the seryl-tRNA-synthetase that charges the tRNAs(Ser). These results indicate that the level of aminoacylation in complex media might be a competition between charging for translation and degradation of amino acids that inhibit growth.
The genetic code is degenerate; thus, protein evolution does not uniquely determine the coding sequence. One of the puzzles in evolutionary genetics is therefore to uncover evolutionary driving forces that result in specific codon choice. In many bacteria, the first 5-10 codons of protein-coding genes are often codons that are less frequently used in the rest of the genome, an effect that has been argued to arise from selection for slowed early elongation to reduce ribosome traffic jams. However, genome analysis across many species has demonstrated that the region shows reduced mRNA folding consistent with pressure for efficient translation initiation. This raises the possibility that unusual codon usage is a side effect of selection for reduced mRNA structure. Here we discriminate between these two competing hypotheses, and show that in bacteria selection favours codons that reduce mRNA folding around the translation start, regardless of whether these codons are frequent or rare. Experiments confirm that primarily mRNA structure, and not codon usage, at the beginning of genes determines the translation rate.
The genetic code is degenerate; thus, protein evolution does not uniquely determine the coding sequence. One of the puzzles in evolutionary genetics is therefore to uncover evolutionary driving forces that result in specific codon choice. In many bacteria, the first 5-10 codons of protein-coding genes are often codons that are less frequently used in the rest of the genome, an effect that has been argued to arise from selection for slowed early elongation to reduce ribosome traffic jams. However, genome analysis across many species has demonstrated that the region shows reduced mRNA folding consistent with pressure for efficient translation initiation. This raises the possibility that unusual codon usage is a side effect of selection for reduced mRNA structure. Here we discriminate between these two competing hypotheses, and show that in bacteria selection favours codons that reduce mRNA folding around the translation start, regardless of whether these codons are frequent or rare. Experiments confirm that primarily mRNA structure, and not codon usage, at the beginning of genes determines the translation rate.
tRNAs, nexus molecules between mRNAs and proteins, have a central role in translation. Recent discoveries have revealed unprecedented complexity of tRNA biosynthesis, modification patterns, regulation and function. In this Review, we present emerging concepts regarding how tRNA abundance is dynamically regulated and how tRNAs (and their nucleolytic fragments) are centrally involved in stress signalling and adaptive translation, operating across a wide range of timescales. Mutations in tRNAs or in genes affecting tRNA biogenesis are also linked to complex human diseases with surprising heterogeneity in tissue vulnerability, and we highlight cell-specific aspects that modulate the disease penetrance of tRNA-based pathologies.
The most crucial step in data processing from high-throughput sequencing applications is the accurate and sensitive alignment of the sequencing reads to reference genomes or transcriptomes. The accurate detection of insertions and deletions (indels) and errors introduced by the sequencing platform or by misreading of modified nucleotides is essential for the quantitative processing of the RNA-based sequencing (RNA-Seq) datasets and for the identification of genetic variations and modification patterns. We developed a new, fast and accurate algorithm for nucleic acid sequence analysis, FANSe, with adjustable mismatch allowance settings and ability to handle indels to accurately and quantitatively map millions of reads to small or large reference genomes. It is a seed-based algorithm which uses the whole read information for mapping and high sensitivity and low ambiguity are achieved by using short and non-overlapping reads. Furthermore, FANSe uses hotspot score to prioritize the processing of highly possible matches and implements modified Smith-Watermann refinement with reduced scoring matrix to accelerate the calculation without compromising its sensitivity. The FANSe algorithm stably processes datasets from various sequencing platforms, masked or unmasked and small or large genomes. It shows a remarkable coverage of low-abundance mRNAs which is important for quantitative processing of RNA-Seq datasets.
Folding at the birth of the nascent chain: coordinating translation with co-translational folding
(2011)
In the living cells, the folding of many proteins is largely believed to begin co-translationally, during their biosynthesis at the ribosomes. In the ribosomal tunnel, the nascent peptide may establish local interactions and stabilize alpha-helical structures. Long-range contacts are more likely outside the ribosomes after release of larger segments of the nascent chain. Examples suggest that domains can attain native-like structure on the ribosome with and without population of folding intermediates. The co-translational folding is limited by the speed of the gradual extrusion of the nascent peptide which imposes conformational restraints on its folding landscape. Recent experimental and in silico modeling studies indicate that translation kinetics fine-tunes co-translational folding by providing a time delay for sequential folding of distinct portions of the nascent chain.
Synonymous codon usage and variations in the level of isoaccepting tRNAs exert a powerful selective force on translation fidelity. We have developed an algorithm to evaluate the relative rate of translation which allows large-scale comparisons of the non-uniform translation rate on the protein biogenesis. Using the complete genomes of Escherichia coli and Bacillus subtilis we show that stretches of codons pairing to minor tRNAs form putative sites to locally attenuate translation; thereby the tendency is to cluster in near proximity whereas long contiguous stretches of slow-translating triplets are avoided. The presence of slow-translating segments positively correlates with the protein length irrespective of the protein abundance. The slow-translating clusters are predominantly located down-stream of the domain boundaries presumably to fine-tune translational accuracy with the folding fidelity of multidomain proteins. Translation attenuation patterns at highly structurally and functionally conserved domains are preserved across the species suggesting a concerted selective pressure on the codon selection and species-specific tRNA abundance in these regions.
The translation of genetic information according to the sequence of the mRNA template occurs with high accuracy and fidelity. Critical events in each single step of translation are selection of transfer RNA (tRNA), codon reading and tRNA-regeneration for a new cycle. We developed a model that accurately describes the dynamics of single elongation steps, thus providing a systematic insight into the sensitivity of the mRNA translation rate to dynamic environmental conditions. Alterations in the concentration of the aminoacylated tRNA can transiently stall the ribosomes during translation which results, as suggested by the model, in two outcomes: either stress-induced change in the tRNA availability triggers the premature termination of the translation and ribosomal dissociation, or extensive demand for one tRNA species results in a competition between frameshift to an aberrant open-reading frame and ribosomal drop-off. Using the bacterial Escherichia coli system, we experimentally draw parallels between these two possible mechanisms.