62 research outputs found
Computational design of catalytically active TIM barrel xylanases
TIM barrel enzymes perform five out of six Enzyme Commission (EC) reaction classes, and are therefore one of the most promising class of enzymes for computational design. Up until now, chimeric TIM barrels were made out of half barrels and catalytic activity was installed using laborious rounds of random mutagenesis, selection and rational design, whereas a recently described de novo designed TIM barrel lacked the elaborate loop conformations necessary to install catalytic activity, and indeed did not show activity.
In order to design catalytically active TIM barrel glycoside-hydrolase 10 family xylanases, we sampled diverse protein backbone conformations, in some cases generating completely new combinations of backbones at all eight beta-alpha blades, and optimized the sequence for both protein stability and catalytic activity. We then selected structurally diverse, low energy subset of the designs for further characterization. Designs had \u3c60% sequence identity to natural xylanases, incorporating many insertions and deletions in loop regions. All designs expressed well and 30% showed catalytic activity, the most active one having Kcat/Km = 106.
The design process samples extensively the sequence-structure space and the designs can serve as a library for altered enzyme selectivity. Moreover, our algorithm is general and robust, and can be applied to other TIM barrel enzyme families having a few dozen solved structures and to other modular protein folds
Recommended from our members
A combined computational-experimental approach to define the structural origin of antibody recognition of sialyl-Tn, a tumor-associated carbohydrate antigen.
Anti-carbohydrate monoclonal antibodies (mAbs) hold great promise as cancer therapeutics and diagnostics. However, their specificity can be mixed, and detailed characterization is problematic, because antibody-glycan complexes are challenging to crystallize. Here, we developed a generalizable approach employing high-throughput techniques for characterizing the structure and specificity of such mAbs, and applied it to the mAb TKH2 developed against the tumor-associated carbohydrate antigen sialyl-Tn (STn). The mAb specificity was defined by apparent KD values determined by quantitative glycan microarray screening. Key residues in the antibody combining site were identified by site-directed mutagenesis, and the glycan-antigen contact surface was defined using saturation transfer difference NMR (STD-NMR). These features were then employed as metrics for selecting the optimal 3D-model of the antibody-glycan complex, out of thousands plausible options generated by automated docking and molecular dynamics simulation. STn-specificity was further validated by computationally screening of the selected antibody 3D-model against the human sialyl-Tn-glycome. This computational-experimental approach would allow rational design of potent antibodies targeting carbohydrates
RosettaScripts: A Scripting Language Interface to the Rosetta Macromolecular Modeling Suite
Macromolecular modeling and design are increasingly useful in basic research, biotechnology, and teaching. However, the absence of a user-friendly modeling framework that provides access to a wide range of modeling capabilities is hampering the wider adoption of computational methods by non-experts. RosettaScripts is an XML-like language for specifying modeling tasks in the Rosetta framework. RosettaScripts provides access to protocol-level functionalities, such as rigid-body docking and sequence redesign, and allows fast testing and deployment of complex protocols without need for modifying or recompiling the underlying C++ code. We illustrate these capabilities with RosettaScripts protocols for the stabilization of proteins, the generation of computationally constrained libraries for experimental selection of higher-affinity binding proteins, loop remodeling, small-molecule ligand docking, design of ligand-binding proteins, and specificity redesign in DNA-binding proteins
Preclinical development of a stabilized RH5 virus-like particle vaccine that induces improved antimalarial antibodies
Plasmodium falciparum reticulocyte-binding protein homolog 5 (RH5) is a leading blood-stage malaria vaccine antigen target, currently in a phase 2b clinical trial as a full-length soluble protein/adjuvant vaccine candidate called RH5.1/Matrix-M. We identify that disordered regions of the full-length RH5 molecule induce non-growth inhibitory antibodies in human vaccinees and that a re-engineered and stabilized immunogen (including just the alpha-helical core of RH5) induces a qualitatively superior growth inhibitory antibody response in rats vaccinated with this protein formulated in Matrix-M adjuvant. In parallel, bioconjugation of this immunogen, termed "RH5.2," to hepatitis B surface antigen virus-like particles (VLPs) using the "plug-and-display" SpyTag-SpyCatcher platform technology also enables superior quantitative antibody immunogenicity over soluble protein/adjuvant in vaccinated mice and rats. These studies identify a blood-stage malaria vaccine candidate that may improve upon the current leading soluble protein vaccine candidate RH5.1/Matrix-M. The RH5.2-VLP/Matrix-M vaccine candidate is now under evaluation in phase 1a/b clinical trials
Community-Wide Assessment of Protein-Interface Modeling Suggests Improvements to Design Methodology
The CAPRI and CASP prediction experiments have demonstrated the power of community wide tests of methodology in assessing the current state of the art and spurring progress in the very challenging areas of protein docking and structure prediction. We sought to bring the power of community wide experiments to bear on a very challenging protein design problem that provides a complementary but equally fundamental test of current understanding of protein-binding thermodynamics. We have generated a number of designed protein-protein interfaces with very favorable computed binding energies but which do not appear to be formed in experiments, suggesting there may be important physical chemistry missing in the energy calculations. 28 research groups took up the challenge of determining what is missing: we provided structures of 87 designed complexes and 120 naturally occurring complexes and asked participants to identify energetic contributions and/or structural features that distinguish between the two sets. The community found that electrostatics and solvation terms partially distinguish the designs from the natural complexes, largely due to the non-polar character of the designed interactions. Beyond this polarity difference, the community found that the designed binding surfaces were on average structurally less embedded in the designed monomers, suggesting that backbone conformational rigidity at the designed surface is important for realization of the designed function. These results can be used to improve computational design strategies, but there is still much to be learned; for example, one designed complex, which does form in experiments, was classified by all metrics as a non-binder
What Have We Learned from Design of Function in Large Proteins?
The overarching goal of computational protein design is to gain complete control over protein structure and function. The majority of sophisticated binders and enzymes, however, are large and exhibit diverse and complex folds that defy atomistic design calculations. Encouragingly, recent strategies that combine evolutionary constraints from natural homologs with atomistic calculations have significantly improved design accuracy. In these approaches, evolutionary constraints mitigate the risk from misfolding and aggregation, focusing atomistic design calculations on a small but highly enriched sequence subspace. Such methods have dramatically optimized diverse proteins, including vaccine immunogens, enzymes for sustainable chemistry, and proteins with therapeutic potential. The new generation of deep learning-based ab initio structure predictors can be combined with these methods to extend the scope of protein design, in principle, to any natural protein of known sequence. We envision that protein engineering will come to rely on completely computational methods to efficiently discover and optimize biomolecular activities
- …