58 research outputs found
Engineering an AB<sub>5</sub>Â protein carrier
The promise of biologic therapeutics is hindered by the challenge to deliver their activity to biochemically relevant sites within diseased cells. The favourable application of the natural protein carriers of the AB5 toxin family to this challenge has been restricted owing to still unresolved requirements for assembling non-native cargo into carrier complexes. Here, we clarify the properties of fusion peptides which allow co-assembly of a selected fluorescent protein cargo with the non-toxic B subunit of a heat-labile enterotoxin. We establish the influence of sequence length, sequence identity and secondary structure of these linking domains on the assembly and disassembly of the complexes. Through our engineering framework we identify several non-native, reduced length fusion sequences that robustly assemble with the native carriers, maintain their ability to deliver protein cargo to cells, and demonstrate substantially refined in vitro properties. Constructs based upon these sequences should prove directly applicable to a variety of protein delivery challenges, and the described design framework should find immediate application to other members of the AB5 protein carrier family
ProtGPT2 is a deep unsupervised language model for protein design
Protein design aims to build novel proteins customized for specific purposes, thereby holding the potential to tackle many environmental and biomedical problems. Recent progress in Transformer-based architectures has enabled the implementation of language models capable of generating text with human-like capabilities. Here, motivated by this success, we describe ProtGPT2, a language model trained on the protein space that generates de novo protein sequences following the principles of natural ones. The generated proteins display natural amino acid propensities, while disorder predictions indicate that 88% of ProtGPT2-generated proteins are globular, in line with natural sequences. Sensitive sequence searches in protein databases show that ProtGPT2 sequences are distantly related to natural ones, and similarity networks further demonstrate that ProtGPT2 is sampling unexplored regions of protein space. AlphaFold prediction of ProtGPT2-sequences yields well-folded non-idealized structures with embodiments and large loops and reveals topologies not captured in current structure databases. ProtGPT2 generates sequences in a matter of seconds and is freely available
Retracing the evolution of a modern periplasmic binding protein
Investigating the evolution of structural features in modern multidomain proteins helps to understand their immense diversity and functional versatility. The class of periplasmic binding proteins PBPs offers an opportunity to interrogate one of the main processes driving diversification the duplication and fusion of protein sequences to generate new architectures. The symmetry of their two lobed topology, their mechanism of binding, and the organization of their operon structure led to the hypothesis that PBPs arose through a duplication and fusion event of a single common ancestor. To investigate this claim, we set out to reverse the evolutionary process and recreate the structural equivalent of a single lobed progenitor using ribose binding protein RBP as our model. We found that this modern PBP can be deconstructed into its lobes, producing two proteins that represent possible progenitor halves. The isolated halves of RBP are well folded and monomeric proteins, albeit with a lower thermostability, and do not retain the original binding function. However, the two entities readily form a heterodimer in vitro and in cell. The x ray structure of the heterodimer closely resembles the parental protein. Moreover, the binding function is fully regained upon formation of the heterodimer with a ligand affinity similar to that observed in the modern RBP. This highlights how a duplication event could have given rise to a stable and functional PBP like fold and provides insights into how more complex functional structures can evolve from simpler molecular component
Extension of a de novo TIM barrel with a rationally designed secondary structure element
The ability to construct novel enzymes is a major aim in de novo protein design. A popular enzyme fold for design attempts is the TIM barrel. This fold is a common topology for enzymes and can harbor many diverse reactions. The recent de novo design of a four-fold symmetric TIM barrel provides a well understood minimal scaffold for potential enzyme designs. Here we explore opportunities to extend and diversify this scaffold by adding a short de novo helix on top of the barrel. Due to the size of the protein, we developed a design pipeline based on computational ab initio folding that solves a less complex sub-problem focused around the helix and its vicinity and adapt it to the entire protein. We provide biochemical characterization and a high-resolution X-ray structure for one variant and compare it to our design model. The successful extension of this robust TIM-barrel scaffold opens opportunities to diversify it towards more pocket like arrangements and as such can be considered a building block for future design of binding or catalytic sites
Fine-tuning spermidine binding modes in the putrescine binding protein PotF
A profound understanding of the molecular interactions between receptors and ligands is important throughout diverse research, such as protein design, drug discovery, or neuroscience. What determines specificity and how do proteins discriminate against similar ligands? In this study, we analyzed factors that determine binding in two homologs belonging to the well-known superfamily of periplasmic binding proteins, PotF and PotD. Building on a previously designed construct, modes of polyamine binding were swapped. This change of specificity was approached by analyzing local differences in the binding pocket as well as overall conformational changes in the protein. Throughout the study, protein variants were generated and characterized structurally and thermodynamically, leading to a specificity swap and improvement in affinity. This dataset not only enriches our knowledge applicable to rational protein design but also our results can further lay groundwork for engineering of specific biosensors as well as help to explain the adaptability of pathogenic bacteria
A newly introduced salt bridge cluster improves structural and biophysical properties of de novo TIM barrels
Protein stability can be fineâtuned by modifying different structural features such as hydrogenâbond networks, salt bridges, hydrophobic cores, or disulfide bridges. Among these, stabilization by salt bridges is a major challenge in protein design and engineering since their stabilizing effects show a high dependence on the structural environment in the protein, and therefore are difficult to predict and model. In this work, we explore the effects on structure and stability of an introduced salt bridge cluster in the context of three different de novo TIM barrels. The salt bridge variants exhibit similar thermostability in comparison with their parental designs but important differences in the conformational stability at 25°C can be observed such as a highly stabilizing effect for two of the proteins but a destabilizing effect to the third. Analysis of the formed geometries of the salt bridge cluster in the crystal structures show either highly ordered salt bridge clusters or only single salt bridges. Rosetta modeling of the salt bridge clusters results in a good prediction of the tendency on stability changes but not the geometries observed in the threeâdimensional structures. The results show that despite the similarities in protein fold, the salt bridge clusters differently influence the structural and stability properties of the de novo TIM barrel variants depending on the structural background where they are introduced
PocketOptimizer 2.0 : A modular framework for computerâaided ligandâbinding design
The ability to design customized proteins to perform specific tasks is of great interest. We are particularly interested in the design of sensitive and specific small molecule ligandâbinding proteins for biotechnological or biomedical applications. Computational methods can narrow down the immense combinatorial space to find the best solution and thus provide starting points for experimental procedures. However, success rates strongly depend on accurate modeling and energetic evaluation. Not only intraâ but also intermolecular interactions have to be considered. To address this problem, we developed PocketOptimizer, a modular computational protein design pipeline, that predicts mutations in the binding pockets of proteins to increase affinity for a specific ligand. Its modularity enables users to compare different combinations of force fields, rotamer libraries, and scoring functions. Here, we present a muchâimproved versionââPocketOptimizer 2.0. We implemented a cleaner user interface, an extended architecture with more supported tools, such as force fields and scoring functions, a backboneâdependent rotamer library, as well as different improvements in the underlying algorithms. Version 2.0 was tested against a benchmark of design cases and assessed in comparison to the first version. Our results show how newly implemented features such as the new rotamer library can lead to improved prediction accuracy. Therefore, we believe that PocketOptimizer 2.0, with its many new and improved functionalities, provides a robust and versatile environment for the design of small moleculeâbinding pockets in proteins. It is widely applicable and extendible due to its modular framework. PocketOptimizer 2.0 can be downloaded at https://github.com/Hoecker-Lab/pocketoptimizer
- âŠ