183 research outputs found

    Population genetics of translational robustness

    Full text link
    Recent work has shown that expression level is the main predictor of a gene’s evolutionary rate, and that more highly expressed genes evolve slower. A possible explanation for this observation is selection for proteins which fold properly despite mistranslation, in short selection for translational robustness. Translational robustness leads to the somewhat paradoxical prediction that highly expressed genes are extremely tolerant to missense substitutions but nevertheless evolve very slowly. Here, we study a simple theoretical model of translational robustness that allows us to gain analytic insight into how this paradoxical behavior arises.Comment: 32 pages, 4 figures, Genetics in pres

    Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution

    Get PDF
    SummaryStrikingly consistent correlations between rates of coding-sequence evolution and gene expression levels are apparent across taxa, but the biological causes behind the selective pressures on coding-sequence evolution remain controversial. Here, we demonstrate conserved patterns of simple covariation between sequence evolution, codon usage, and mRNA level in E. coli, yeast, worm, fly, mouse, and human that suggest that all observed trends stem largely from a unified underlying selective pressure. In metazoans, these trends are strongest in tissues composed of neurons, whose structure and lifetime confer extreme sensitivity to protein misfolding. We propose, and demonstrate using a molecular-level evolutionary simulation, that selection against toxicity of misfolded proteins generated by ribosome errors suffices to create all of the observed covariation. The mechanistic model of molecular evolution that emerges yields testable biochemical predictions, calls into question the use of nonsynonymous-to-synonymous substitution ratios (Ka/Ks) to detect functional selection, and suggests how mistranslation may contribute to neurodegenerative disease

    Pervasive, conserved secondary structure in highly charged protein regions

    Get PDF
    Understanding how protein sequences confer function remains a defining challenge in molecular biology. Two approaches have yielded enormous insight yet are often pursued separately: structure-based, where sequence-encoded structures mediate function, and disorder-based, where sequences dictate physicochemical and dynamical properties which determine function in the absence of stable structure. Here we study highly charged protein regions (>40% charged residues), which are routinely presumed to be disordered. Using recent advances in structure prediction and experimental structures, we show that roughly 40% of these regions form well-structured helices. Features often used to predict disorder—high charge density, low hydrophobicity, low sequence complexity, and evolutionarily varying length—are also compatible with solvated, variable-length helices. We show that a simple composition classifier predicts the existence of structure far better than well-established heuristics based on charge and hydropathy. We show that helical structure is more prevalent than previously appreciated in highly charged regions of diverse proteomes and characterize the conservation of highly charged regions. Our results underscore the importance of integrating, rather than choosing between, structure- and disorder-based approaches

    Bulk and surface energetics of lithium hydride crystal: benchmarks from quantum Monte Carlo and quantum chemistry

    Get PDF
    We show how accurate benchmark values of the surface formation energy of crystalline lithium hydride can be computed by the complementary techniques of quantum Monte Carlo (QMC) and wavefunction-based molecular quantum chemistry. To demonstrate the high accuracy of the QMC techniques, we present a detailed study of the energetics of the bulk LiH crystal, using both pseudopotential and all-electron approaches. We show that the equilibrium lattice parameter agrees with experiment to within 0.03 %, which is around the experimental uncertainty, and the cohesive energy agrees to within around 10 meV per formula unit. QMC in periodic slab geometry is used to compute the formation energy of the LiH (001) surface, and we show that the value can be accurately converged with respect to slab thickness and other technical parameters. The quantum chemistry calculations build on the recently developed hierarchical scheme for computing the correlation energy of a crystal to high precision. We show that the hierarchical scheme allows the accurate calculation of the surface formation energy, and we present results that are well converged with respect to basis set and with respect to the level of correlation treatment. The QMC and hierarchical results for the surface formation energy agree to within about 1 %.Comment: 16 pages, 4 figure

    Quantifying Condition-Dependent Intracellular Protein Levels Enables High-Precision Fitness Estimates

    Get PDF
    Countless studies monitor the growth rate of microbial populations as a measure of fitness. However, an enormous gap separates growth-rate differences measurable in the laboratory from those that natural selection can distinguish efficiently. Taking advantage of the recent discovery that transcript and protein levels in budding yeast closely track growth rate, we explore the possibility that growth rate can be more sensitively inferred by monitoring the proteomic response to growth, rather than growth itself. We find a set of proteins whose levels, in aggregate, enable prediction of growth rate to a higher precision than direct measurements. However, we find little overlap between these proteins and those that closely track growth rate in other studies. These results suggest that, in yeast, the pathways that set the pace of cell division can differ depending on the growth-altering stimulus. Still, with proper validation, protein measurements can provide high-precision growth estimates that allow extension of phenotypic growth-based assays closer to the limits of evolutionary selection

    Why high-error-rate random mutagenesis libraries are enriched in functional and improved proteins

    Get PDF
    Recently, several groups have used error-prone polymerase chain reactions to construct mutant libraries containing up to 27 nucleotide mutations per gene on average, and reported a striking observation: although retention of protein function initially declines exponentially with mutations as has previously been observed, orders of magnitude more proteins remain viable at the highest mutation rates than this trend would predict. Mutant proteins having improved or novel activity were isolated disproportionately from these heavily mutated libraries, leading to the suggestion that distant regions of sequence space are enriched in useful cooperative mutations and that optimal mutagenesis should target these regions. If true, these claims have profound implications for laboratory evolution and for evolutionary theory. Here, we demonstrate that properties of the polymerase chain reaction can explain these results and, consequently, that average protein viability indeed decreases exponentially with mutational distance at all error rates. We show that high-error-rate mutagenesis may be useful in certain cases, though for very different reasons than originally proposed, and that optimal mutation rates are inherently protocol-dependent. Our results allow optimal mutation rates to be found given mutagenesis conditions and a protein of known mutational robustness.Comment: Optimality results improved. 26 pages, 4 figures, 3 table
    • …
    corecore