666 research outputs found
Assessing the effect of dynamics on the closed-loop protein-folding hypothesis
The closed-loop (loop-n-lock) hypothesis of protein folding suggests that loops of about 25 residues, closed through interactions between the loop ends (locks), play an important role in protein structure. Coarse-grain elastic network simulations, and examination of loop lengths in a diverse set of proteins, each supports a bias towards loops of close to 25 residues in length between residues of high stability. Previous studies have established a correlation between total contact distance (TCD), a metric of sequence distances between contacting residues (cf. contact order), and the log-folding rate of a protein. In a set of 43 proteins, we identify an improved correlation (
r
2
= 0.76), when the metric is restricted to residues contacting the locks, compared to the equivalent result when all residues are considered (
r
2
= 0.65). This provides qualified support for the hypothesis, albeit with an increased emphasis upon the importance of a much larger set of residues surrounding the locks. Evidence of a similar-sized protein core/extended nucleus (with significant overlap) was obtained from TCD calculations in which residues were successively eliminated according to their hydrophobicity and connectivity, and from molecular dynamics simulations. Our results suggest that while folding is determined by a subset of residues that can be predicted by application of the closed-loop hypothesis, the original hypothesis is too simplistic; efficient protein folding is dependent on a considerably larger subset of residues than those involved in lock formation.
</jats:p
Designing libraries of chimeric proteins using SCHEMA recombination and RASPP
SCHEMA is a method for designing libraries of novel proteins by recombination of homologous sequences. The goal is to maximize the number of folded proteins while simultaneously generating significant sequence diversity. Here, we use the RASPP algorithm to identify optimal SCHEMA designs for shuffling contiguous elements of sequence. To exemplify the method, SCHEMA is used to recombine five fungal cellobiohydrolases (CBH1s) to produce a library of more than 390,000 novel CBH1 sequences
Predictability of evolutionary trajectories in fitness landscapes
Experimental studies on enzyme evolution show that only a small fraction of
all possible mutation trajectories are accessible to evolution. However, these
experiments deal with individual enzymes and explore a tiny part of the fitness
landscape. We report an exhaustive analysis of fitness landscapes constructed
with an off-lattice model of protein folding where fitness is equated with
robustness to misfolding. This model mimics the essential features of the
interactions between amino acids, is consistent with the key paradigms of
protein folding and reproduces the universal distribution of evolutionary rates
among orthologous proteins. We introduce mean path divergence as a quantitative
measure of the degree to which the starting and ending points determine the
path of evolution in fitness landscapes. Global measures of landscape roughness
are good predictors of path divergence in all studied landscapes: the mean path
divergence is greater in smooth landscapes than in rough ones. The
model-derived and experimental landscapes are significantly smoother than
random landscapes and resemble additive landscapes perturbed with moderate
amounts of noise; thus, these landscapes are substantially robust to mutation.
The model landscapes show a deficit of suboptimal peaks even compared with
noisy additive landscapes with similar overall roughness. We suggest that
smoothness and the substantial deficit of peaks in the fitness landscapes of
protein evolution are fundamental consequences of the physics of protein
folding.Comment: 14 pages, 7 figure
Design principles for riboswitch function
Scientific and technological advances that enable the tuning of integrated regulatory components to match network and system requirements are critical to reliably control the function of biological systems. RNA provides a promising building block for the construction of tunable regulatory components based on its rich regulatory capacity and our current understanding of the sequence–function relationship. One prominent example of RNA-based regulatory components is riboswitches, genetic elements that mediate ligand control of gene expression through diverse regulatory mechanisms. While characterization of natural and synthetic riboswitches has revealed that riboswitch function can be modulated through sequence alteration, no quantitative frameworks exist to investigate or guide riboswitch tuning. Here, we combined mathematical modeling and experimental approaches to investigate the relationship between riboswitch function and performance. Model results demonstrated that the competition between reversible and irreversible rate constants dictates performance for different regulatory mechanisms. We also found that practical system restrictions, such as an upper limit on ligand concentration, can significantly alter the requirements for riboswitch performance, necessitating alternative tuning strategies. Previous experimental data for natural and synthetic riboswitches as well as experiments conducted in this work support model predictions. From our results, we developed a set of general design principles for synthetic riboswitches. Our results also provide a foundation from which to investigate how natural riboswitches are tuned to meet systems-level regulatory demands
Glucanocellulosic ethanol: The undiscovered biofuel potential in energy crops and marine biomass
Converting biomass to biofuels is a key strategy in substituting fossil fuels to mitigate climate change. Conventional strategies to convert lignocellulosic biomass to ethanol address the fermentation of cellulose-derived glucose. Here we used super-resolution fluorescence microscopy to uncover the nanoscale structure of cell walls in the energy crops maize and Miscanthus where the typical polymer cellulose forms an unconventional layered architecture with the atypical (1, 3)-β-glucan polymer callose. This raised the question about an unused potential of (1, 3)-β-glucan in the fermentation of lignocellulosic biomass. Engineering biomass conversion for optimized (1, 3)-β-glucan utilization, we increased the ethanol yield from both energy crops. The generation of transgenic Miscanthus lines with an elevated (1, 3)-β-glucan content further increased ethanol yield providing a new strategy in energy crop breeding. Applying the (1, 3)-β-glucan-optimized conversion method on marine biomass from brown macroalgae with a naturally high (1, 3)-β-glucan content, we not only substantially increased ethanol yield but also demonstrated an effective co-fermentation of plant and marine biomass. This opens new perspectives in combining different kinds of feedstock for sustainable and efficient biofuel production, especially in coastal regions
Predicting the Tolerated Sequences for Proteins and Protein Interfaces Using RosettaBackrub Flexible Backbone Design
Predicting the set of sequences that are tolerated by a protein or protein interface, while maintaining a desired function, is useful for characterizing protein interaction specificity and for computationally designing sequence libraries to engineer proteins with new functions. Here we provide a general method, a detailed set of protocols, and several benchmarks and analyses for estimating tolerated sequences using flexible backbone protein design implemented in the Rosetta molecular modeling software suite. The input to the method is at least one experimentally determined three-dimensional protein structure or high-quality model. The starting structure(s) are expanded or refined into a conformational ensemble using Monte Carlo simulations consisting of backrub backbone and side chain moves in Rosetta. The method then uses a combination of simulated annealing and genetic algorithm optimization methods to enrich for low-energy sequences for the individual members of the ensemble. To emphasize certain functional requirements (e.g. forming a binding interface), interactions between and within parts of the structure (e.g. domains) can be reweighted in the scoring function. Results from each backbone structure are merged together to create a single estimate for the tolerated sequence space. We provide an extensive description of the protocol and its parameters, all source code, example analysis scripts and three tests applying this method to finding sequences predicted to stabilize proteins or protein interfaces. The generality of this method makes many other applications possible, for example stabilizing interactions with small molecules, DNA, or RNA. Through the use of within-domain reweighting and/or multistate design, it may also be possible to use this method to find sequences that stabilize particular protein conformations or binding interactions over others
Organizing Effects of Sex Steroids on Brain Aromatase Activity in Quail
Preoptic/hypothalamic aromatase activity (AA) is sexually differentiated in birds and mammals but the mechanisms controlling this sex difference remain unclear. We determined here (1) brain sites where AA is sexually differentiated and (2) whether this sex difference results from organizing effects of estrogens during ontogeny or activating effects of testosterone in adulthood. In the first experiment we measured AA in brain regions micropunched in adult male and female Japanese quail utilizing the novel strategy of basing the microdissections on the distribution of aromatase-immunoreactive cells. The largest sex difference was found in the medial bed nucleus of the stria terminalis (mBST) followed by the medial preoptic nucleus (POM) and the tuberal hypothalamic region. A second experiment tested the effect of embryonic treatments known to sex-reverse male copulatory behavior (i.e., estradiol benzoate [EB] or the aromatase inhibitor, Vorozole) on brain AA in gonadectomized adult males and females chronically treated as adults with testosterone. Embryonic EB demasculinized male copulatory behavior, while vorozole blocked demasculinization of behavior in females as previously demonstrated in birds. Interestingly, these treatments did not affect a measure of appetitive sexual behavior. In parallel, embryonic vorozole increased, while EB decreased AA in pooled POM and mBST, but the same effect was observed in both sexes. Together, these data indicate that the early action of estrogens demasculinizes AA. However, this organizational action of estrogens on AA does not explain the behavioral sex difference in copulatory behavior since AA is similar in testosterone-treated males and females that were or were not exposed to embryonic treatments with estrogens
STAR: predicting recombination sites from amino acid sequence
BACKGROUND: Designing novel proteins with site-directed recombination has enormous prospects. By locating effective recombination sites for swapping sequence parts, the probability that hybrid sequences have the desired properties is increased dramatically. The prohibitive requirements for applying current tools led us to investigate machine learning to assist in finding useful recombination sites from amino acid sequence alone. RESULTS: We present STAR, Site Targeted Amino acid Recombination predictor, which produces a score indicating the structural disruption caused by recombination, for each position in an amino acid sequence. Example predictions contrasted with those of alternative tools, illustrate STAR'S utility to assist in determining useful recombination sites. Overall, the correlation coefficient between the output of the experimentally validated protein design algorithm SCHEMA and the prediction of STAR is very high (0.89). CONCLUSION: STAR allows the user to explore useful recombination sites in amino acid sequences with unknown structure and unknown evolutionary origin. The predictor service is available from
Ribozyme-based insulator parts buffer synthetic circuits from genetic context
Synthetic genetic programs are built from circuits that integrate sensors and implement temporal control of gene expression. Transcriptional circuits are layered by using promoters to carry the signal between circuits. In other words, the output promoter of one circuit serves as the input promoter to the next. Thus, connecting circuits requires physically connecting a promoter to the next circuit. We show that the sequence at the junction between the input promoter and circuit can affect the input-output response (transfer function) of the circuit. A library of putative sequences that might reduce (or buffer) such context effects, which we refer to as 'insulator parts', is screened in Escherichia coli. We find that ribozymes that cleave the 5′ untranslated region (5′-UTR) of the mRNA are effective insulators. They generate quantitatively identical transfer functions, irrespective of the identity of the input promoter. When these insulators are used to join synthetic gene circuits, the behavior of layered circuits can be predicted using a mathematical model. The inclusion of insulators will be critical in reliably permuting circuits to build different programs.Life Technologies, Inc.United States. Defense Advanced Research Projects Agency (DARPA CLIO N66001-12-C-4018)United States. Office of Naval Research (N00014-10-1-0245)National Science Foundation (U.S.) (CCF-0943385)National Institutes of Health (U.S.) (AI067699)National Science Foundation (U.S.). Synthetic Biology Engineering Research Center (SynBERC, SA5284-11210
- …