19 research outputs found

    Successful examples of the application of novel iterative trainable algorithms to guide rational mutation strategies for enzyme engineering: From prediction to lab testing to algorithm retraining

    Get PDF
    Both natural mutations occurring in a homologous enzyme family and mutations engineered in a given protein can have a tremendous impact in the activity and binding behavior of the enzyme towards substrates or other molecules. Binding and catalytic properties can be modified by rationally mutating selected amino acids in a protein. For instance, new specificity properties can be engineered into existing enzymes, which can be applied to the rational design of mutations to alter its catalysis. Although this approach has been largely used, the modifications introduced in the target protein have not been exempt of deleterious effects on protein function, binding or physicochemical properties. Much finer tuned modifications should be designed in order to alter the desired catalytic or binding properties of a protein and simultaneously not affecting other protein properties or functions. These engineered mutations usually require a thorough knowledge of the relevant structure-function relationships in the protein molecule. If no precise structure-function information is available for a protein, the amount of possible amino acid mutations to be tested precludes a direct search. Furthermore, in many cases a directed evolution strategy cannot be successfully used to achieve the desired results due to the unavailability of suitable screening tests. In the last years, we have developed new and powerful in silico methodologies to automatically propose, test and redesign mutagenesis strategies for a target protein, based only on evolutionarily conserved physicochemical properties of amino acids in a protein family where the target protein belongs, and on structural properties, including calculation of vibrational entropies, if available, with no need of explicit structure-function relationships. This methodology identifies amino acid positions that are putatively responsible for function, specificity, stability or binding interactions in a family of proteins and calculates amino acid propensity and distributions at each position. Not only conserved amino acid positions in a protein family can be labelled as functionally relevant, but also non-conserved amino acid positions can be identified to have a meaningful functional effect, and even amino acid substitutions that are unobserved in nature. These results can be used to predict if a given mutation can have a functional implication and which mutation is most likely to be functionally silent for a protein. Through several rounds of mutation suggestions, laboratory testing of the mutants and feedback of results to retrain the algorithms, our methodology can be used to rapidly and automatically discard any irrelevant mutation and guide the research focus toward functionally significant mutations. In this work, we will show how we have successfully used our publicly available methods to guide mutant design in enzyme engineering applied to xylanases (producing an improved octuple mutant in a single mutagenesis round), proteases, glucanases, ubiquitin ligases and other enzymes, to alter protein function, stability or thermodynamic properties independently of their catalytic properties in vitro and in vivo. We will also show how the predictions of these methods have been employed to shift chromatographic elution profiles of xylanases and ferritin nanocages for better purification without affecting their activity and to obtain ferritin variants with better properties to be used in nanotechnological applications, including modifications to the external and internal surface of the protein to change its interaction properties, improve its recombinant production, alter the characteristics of nanoparticles within or change its organic molecule carrier capacity. Finally, we will show how a similar approach has been integrated in an artificial intelligence classification scheme to identify somatic mutations in the human VHL gene that are related to renal clear-cell cancer and to predict the clinical outcome and prognosis of pVHL mutation and malfunction in humans, based on specific disruption of interactions with VHL binding partners. Clearly, our techniques show promising performance as a valuable and powerful bioinformatics tool to aid in the computer-aided design of engineered enzyme variants and in the understanding of function-structure, binding and affinity relationships in enzymes and other proteins

    Mutagenesis Objective Search and Selection Tool (MOSST): an algorithm to predict structure-function related mutations in proteins

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Functionally relevant artificial or natural mutations are difficult to assess or predict if no structure-function information is available for a protein. This is especially important to correctly identify functionally significant non-synonymous single nucleotide polymorphisms (nsSNPs) or to design a site-directed mutagenesis strategy for a target protein. A new and powerful methodology is proposed to guide these two decision strategies, based only on conservation rules of physicochemical properties of amino acids extracted from a multiple alignment of a protein family where the target protein belongs, with no need of explicit structure-function relationships.</p> <p>Results</p> <p>A statistical analysis is performed over each amino acid position in the multiple protein alignment, based on different amino acid physical or chemical characteristics, including hydrophobicity, side-chain volume, charge and protein conformational parameters. The variances of each of these properties at each position are combined to obtain a global statistical indicator of the conservation degree of each property. Different types of physicochemical conservation are defined to characterize relevant and irrelevant positions. The differences between statistical variances are taken together as the basis of hypothesis tests at each position to search for functionally significant mutable sites and to identify specific mutagenesis targets. The outcome is used to statistically predict physicochemical consensus sequences based on different properties and to calculate the amino acid propensities at each position in a given protein. Hence, amino acid positions are identified that are putatively responsible for function, specificity, stability or binding interactions in a family of proteins. Once these key functional positions are identified, position-specific statistical distributions are applied to divide the 20 common protein amino acids in each position of the protein's primary sequence into a group of functionally non-disruptive amino acids and a second group of functionally deleterious amino acids.</p> <p>Conclusions</p> <p>With this approach, not only conserved amino acid positions in a protein family can be labeled as functionally relevant, but also non-conserved amino acid positions can be identified to have a physicochemically meaningful functional effect. These results become a discriminative tool in the selection and elaboration of rational mutagenesis strategies for the protein. They can also be used to predict if a given nsSNP, identified, for instance, in a genomic-scale analysis, can have a functional implication for a particular protein and which nsSNPs are most likely to be functionally silent for a protein. This analytical tool could be used to rapidly and automatically discard any irrelevant nsSNP and guide the research focus toward functionally significant mutations. Based on preliminary results and applications, this technique shows promising performance as a valuable bioinformatics tool to aid in the development of new protein variants and in the understanding of function-structure relationships in proteins.</p

    Mathematical modeling of the dynamic storage of iron in ferritin

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Iron is essential for the maintenance of basic cellular processes. In the regulation of its cellular levels, ferritin acts as the main intracellular iron storage protein. In this work we present a mathematical model for the dynamics of iron storage in ferritin during the process of intestinal iron absorption. A set of differential equations were established considering kinetic expressions for the main reactions and mass balances for ferritin, iron and a discrete population of ferritin species defined by their respective iron content.</p> <p>Results</p> <p>Simulation results showing the evolution of ferritin iron content following a pulse of iron were compared with experimental data for ferritin iron distribution obtained with purified ferritin incubated <it>in vitro </it>with different iron levels. Distinctive features observed experimentally were successfully captured by the model, namely the distribution pattern of iron into ferritin protein nanocages with different iron content and the role of ferritin as a controller of the cytosolic labile iron pool (cLIP). Ferritin stabilizes the cLIP for a wide range of total intracellular iron concentrations, but the model predicts an exponential increment of the cLIP at an iron content > 2,500 Fe/ferritin protein cage, when the storage capacity of ferritin is exceeded.</p> <p>Conclusions</p> <p>The results presented support the role of ferritin as an iron buffer in a cellular system. Moreover, the model predicts desirable characteristics for a buffer protein such as effective removal of excess iron, which keeps intracellular cLIP levels approximately constant even when large perturbations are introduced, and a freely available source of iron under iron starvation. In addition, the simulated dynamics of the iron removal process are extremely fast, with ferritin acting as a first defense against dangerous iron fluctuations and providing the time required by the cell to activate slower transcriptional regulation mechanisms and adapt to iron stress conditions. In summary, the model captures the complexity of the iron-ferritin equilibrium, and can be used for further theoretical exploration of the role of ferritin in the regulation of intracellular labile iron levels and, in particular, as a relevant regulator of transepithelial iron transport during the process of intestinal iron absorption.</p

    Cold Adaptation, Ca<sup>2+</sup> Dependency and Autolytic Stability Are Related Features in a Highly Active Cold-Adapted Trypsin Resistant to Autoproteolysis Engineered for Biotechnological Applications

    Get PDF
    <div><p>Pig trypsin is routinely used as a biotechnological tool, due to its high specificity and ability to be stored as an inactive stable zymogen. However, it is not an optimum enzyme for conditions found in wound debriding for medical uses and trypsinization processes for protein analysis and animal cell culturing, where low Ca<sup>2+</sup> dependency, high activity in mild conditions and easy inactivation are crucial. We isolated and thermodynamically characterized a highly active cold-adapted trypsin for medical and laboratory use that is four times more active than pig trypsin at 10<sup>°</sup> C and at least 50% more active than pig trypsin up to 50<sup>°</sup> C. Contrary to pig trypsin, this enzyme has a broad optimum pH between 7 and 10 and is very insensitive to Ca<sup>2+</sup> concentration. The enzyme is only distantly related to previously described cryophilic trypsins. We built and studied molecular structure models of this trypsin and performed molecular dynamic calculations. Key residues and structures associated with calcium dependency and cryophilicity were identified. Experiments indicated that the protein is unstable and susceptible to autoproteolysis. Correlating experimental results and structural predictions, we designed mutations to improve the resistance to autoproteolysis and conserve activity for longer periods after activation. One single mutation provided around 25 times more proteolytic stability. Due to its cryophilic nature, this trypsin is easily inactivated by mild denaturation conditions, which is ideal for controlled proteolysis processes without requiring inhibitors or dilution. We clearly show that cold adaptation, Ca<sup>2+</sup> dependency and autolytic stability in trypsins are related phenomena that are linked to shared structural features and evolve in a concerted fashion. Hence, both structurally and evolutionarily they cannot be interpreted and studied separately as previously done.</p> </div

    Wall-eye stereogram representation of the main features of the KT1 molecular model.

    No full text
    <p>The 3D molecular model of KT1 is represented in a wall-eye stereo view. β-sheets are represented in purple, α-helixes in cyan and loops in pink. The solvent accessible molecular surface is shown as a transparent gray envelope. Particular features of KT1 are highlighted. The TAL is showed in green and the CBL in yellow. The chelated Ca<sup>2+</sup> atom is displayed as an orange sphere. Active site extending loops are shown in violet in the right-lower corner. The three active residues are represented at the middle-right of the figure. At the top, three of the putative cleavage sites amino acid residues are shown: K163, K197 and R230. The primary K96 cleavage site is shown at the bottom of the figure.</p

    Unrooted phylogenetic tree of 28 trypsins inferred from their amino acid sequence alignment.

    No full text
    <p>Five phylogenetically-related clusters were inferred from this analysis: cluster A of shrimp and prawn trypsins (both cryophilic and mesophilic), cluster B of crayfish trypsins, cluster C of other non-crustacean Arthropoda, cluster D of cryophilic vertebrate trypsins, and cluster E of trypsins from mesophilic vertebrates. Cluster A trypsin sequences are shown to be very closely related to krill trypsins KT1 and KT4.</p

    pH dependency of the catalytic activity of the purified krill trypsin.

    No full text
    <p>Comparative plots of enzymatic activity versus pH for the isolated cryophilic krill trypsin and the mesophilic pig trypsin.</p

    Wall-eye stereograms of the structural comparison between calcium binding loops of KT1, PT and CFT.

    No full text
    <p>Wall-stereo view of the molecular superposition of the calcium binding loops (yellow), trailing activation loops (green) and surrounding residues (violet) in the cryophilic krill trypsin 1 (KT1), the mesophilic pig trypsin (PT), and the homologous crayfish trypsin (CFT), showing the structural differences discussed in the text. CBLs and TALs are viewed from a point where the line of view is approximately perpendicular to the molecular surface of the protein. In this way, the closest residues to the viewer are the surface residues, and receding side chains are buried inside the protein structure. Amino acid side chains mentioned in the text are labeled. H80 hydrogen bonds in KT1 are shown as cyan dotted lines.</p

    Temperature dependency of the catalytic activity of the purified krill trypsin.

    No full text
    <p>Comparative plots of enzymatic activity versus temperature in the range from 4 to 80<sup>°</sup>C for the isolated cryophilic krill trypsin and the mesophilic pig trypsin on two substrates, casein (A) and BAPNA (B). The corresponding Arrhenius plots are also shown, including the corresponding linear regressions at each side of the maximum.</p

    SDS-PAGE and Western blot analysis of the recombinant purified proteins and their post-activation products.

    No full text
    <p>(A) SDS-PAGE of the products of the <i>E</i>. <i>coli</i> BL21(DE3)/pET22b-KT1 expression system. Lane 1: purified inactive proteins from the soluble cytoplasmic cell fraction. Lane 2: end products after pig trypsin activation of the sample in lane 1. (B) Western blot of polyhistidine-tagged proteins present in the corresponding samples in (A).</p
    corecore