183 research outputs found
Population genetics of translational robustness
Recent work has shown that expression level is the main predictor of a
gene’s evolutionary rate, and that more highly expressed genes evolve
slower. A possible explanation for this observation is selection for proteins
which fold properly despite mistranslation, in short selection for
translational robustness. Translational robustness leads to the somewhat
paradoxical prediction that highly expressed genes are extremely tolerant to
missense substitutions but nevertheless evolve very slowly. Here, we study a
simple theoretical model of translational robustness that allows us to gain
analytic insight into how this paradoxical behavior arises.Comment: 32 pages, 4 figures, Genetics in pres
Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution
SummaryStrikingly consistent correlations between rates of coding-sequence evolution and gene expression levels are apparent across taxa, but the biological causes behind the selective pressures on coding-sequence evolution remain controversial. Here, we demonstrate conserved patterns of simple covariation between sequence evolution, codon usage, and mRNA level in E. coli, yeast, worm, fly, mouse, and human that suggest that all observed trends stem largely from a unified underlying selective pressure. In metazoans, these trends are strongest in tissues composed of neurons, whose structure and lifetime confer extreme sensitivity to protein misfolding. We propose, and demonstrate using a molecular-level evolutionary simulation, that selection against toxicity of misfolded proteins generated by ribosome errors suffices to create all of the observed covariation. The mechanistic model of molecular evolution that emerges yields testable biochemical predictions, calls into question the use of nonsynonymous-to-synonymous substitution ratios (Ka/Ks) to detect functional selection, and suggests how mistranslation may contribute to neurodegenerative disease
Recommended from our members
Transient intracellular acidification regulates the core transcriptional heat shock response
Heat shock induces a conserved transcriptional program regulated by heat shock factor 1 (Hsf1) in eukaryotic cells. Activation of this heat shock response is triggered by heat-induced misfolding of newly synthesized polypeptides, and so has been thought to depend on ongoing protein synthesis. Here, using the budding yeast Saccharomyces cerevisiae, we report the discovery that Hsf1 can be robustly activated when protein synthesis is inhibited, so long as cells undergo cytosolic acidification. Heat shock has long been known to cause transient intracellular acidification which, for reasons which have remained unclear, is associated with increased stress resistance in eukaryotes. We demonstrate that acidification is required for heat shock response induction in translationally inhibited cells, and specifically affects Hsf1 activation. Physiological heat-triggered acidification also increases population fitness and promotes cell cycle reentry following heat shock. Our results uncover a previously unknown adaptive dimension of the well-studied eukaryotic heat shock response
Recommended from our members
Good Codons, Bad Transcript: Large Reductions in Gene Expression and Fitness Arising from Synonymous Mutations in a Key Enzyme
Biased codon usage in protein-coding genes is pervasive, whereby amino acids are largely encoded by a specific subset of possible codons. Within individual genes, codon bias is stronger at evolutionarily conserved residues, favoring codons recognized by abundant tRNAs. Although this observation suggests an overall pattern of selection for translation speed and/or accuracy, other work indicates that transcript structure or binding motifs drive codon usage. However, our understanding of codon bias evolution is constrained by limited experimental data on the fitness effects of altering codons in functional genes. To bridge this gap, we generated synonymous variants of a key enzyme-coding gene in Methylobacterium extorquens. We found that mutant gene expression, enzyme production, enzyme activity, and fitness were all significantly lower than wild-type. Surprisingly, encoding the gene using only rare codons decreased fitness by 40%, whereas an allele coded entirely by frequent codons decreased fitness by more than 90%. Increasing gene expression restored mutant fitness to varying degrees, demonstrating that the fitness disadvantage of synonymous mutants arose from a lack of beneficial protein rather than costs of protein production. Protein production was negatively correlated with the frequency of motifs with high affinity for the anti-Shine-Dalgarno sequence, suggesting ribosome pausing as the dominant cause of low mutant fitness. Together, our data support the idea that, although a particular set of codons are favored on average across a genome, in an individual gene selection can either act for or against codons depending on their local context.Organismic and Evolutionary Biolog
Pervasive, conserved secondary structure in highly charged protein regions
Understanding how protein sequences confer function remains a defining challenge in molecular biology. Two approaches have yielded enormous insight yet are often pursued separately: structure-based, where sequence-encoded structures mediate function, and disorder-based, where sequences dictate physicochemical and dynamical properties which determine function in the absence of stable structure. Here we study highly charged protein regions (>40% charged residues), which are routinely presumed to be disordered. Using recent advances in structure prediction and experimental structures, we show that roughly 40% of these regions form well-structured helices. Features often used to predict disorder—high charge density, low hydrophobicity, low sequence complexity, and evolutionarily varying length—are also compatible with solvated, variable-length helices. We show that a simple composition classifier predicts the existence of structure far better than well-established heuristics based on charge and hydropathy. We show that helical structure is more prevalent than previously appreciated in highly charged regions of diverse proteomes and characterize the conservation of highly charged regions. Our results underscore the importance of integrating, rather than choosing between, structure- and disorder-based approaches
Bulk and surface energetics of lithium hydride crystal: benchmarks from quantum Monte Carlo and quantum chemistry
We show how accurate benchmark values of the surface formation energy of
crystalline lithium hydride can be computed by the complementary techniques of
quantum Monte Carlo (QMC) and wavefunction-based molecular quantum chemistry.
To demonstrate the high accuracy of the QMC techniques, we present a detailed
study of the energetics of the bulk LiH crystal, using both pseudopotential and
all-electron approaches. We show that the equilibrium lattice parameter agrees
with experiment to within 0.03 %, which is around the experimental uncertainty,
and the cohesive energy agrees to within around 10 meV per formula unit. QMC in
periodic slab geometry is used to compute the formation energy of the LiH (001)
surface, and we show that the value can be accurately converged with respect to
slab thickness and other technical parameters. The quantum chemistry
calculations build on the recently developed hierarchical scheme for computing
the correlation energy of a crystal to high precision. We show that the
hierarchical scheme allows the accurate calculation of the surface formation
energy, and we present results that are well converged with respect to basis
set and with respect to the level of correlation treatment. The QMC and
hierarchical results for the surface formation energy agree to within about 1
%.Comment: 16 pages, 4 figure
Quantifying Condition-Dependent Intracellular Protein Levels Enables High-Precision Fitness Estimates
Countless studies monitor the growth rate of microbial populations as a measure of fitness. However, an enormous gap separates growth-rate differences measurable in the laboratory from those that natural selection can distinguish efficiently. Taking advantage of the recent discovery that transcript and protein levels in budding yeast closely track growth rate, we explore the possibility that growth rate can be more sensitively inferred by monitoring the proteomic response to growth, rather than growth itself. We find a set of proteins whose levels, in aggregate, enable prediction of growth rate to a higher precision than direct measurements. However, we find little overlap between these proteins and those that closely track growth rate in other studies. These results suggest that, in yeast, the pathways that set the pace of cell division can differ depending on the growth-altering stimulus. Still, with proper validation, protein measurements can provide high-precision growth estimates that allow extension of phenotypic growth-based assays closer to the limits of evolutionary selection
Why high-error-rate random mutagenesis libraries are enriched in functional and improved proteins
Recently, several groups have used error-prone polymerase chain reactions to
construct mutant libraries containing up to 27 nucleotide mutations per gene on
average, and reported a striking observation: although retention of protein
function initially declines exponentially with mutations as has previously been
observed, orders of magnitude more proteins remain viable at the highest
mutation rates than this trend would predict. Mutant proteins having improved
or novel activity were isolated disproportionately from these heavily mutated
libraries, leading to the suggestion that distant regions of sequence space are
enriched in useful cooperative mutations and that optimal mutagenesis should
target these regions. If true, these claims have profound implications for
laboratory evolution and for evolutionary theory. Here, we demonstrate that
properties of the polymerase chain reaction can explain these results and,
consequently, that average protein viability indeed decreases exponentially
with mutational distance at all error rates. We show that high-error-rate
mutagenesis may be useful in certain cases, though for very different reasons
than originally proposed, and that optimal mutation rates are inherently
protocol-dependent. Our results allow optimal mutation rates to be found given
mutagenesis conditions and a protein of known mutational robustness.Comment: Optimality results improved. 26 pages, 4 figures, 3 table
- …