55 research outputs found

    Význam slavistických studií dnes

    Get PDF

    Machine Learning in Enzyme Engineering

    Get PDF
    Enzyme engineering plays a central role in developing efficient biocatalysts for biotechnology, biomedicine, and life sciences. Apart from classical rational design and directed evolution approaches, machine learning methods have been increasingly applied to find patterns in data that help predict protein structures, improve enzyme stability, solubility, and function, predict substrate specificity, and guide rational protein design. In this Perspective, we analyze the state of the art in databases and methods used for training and validating predictors in enzyme engineering. We discuss current limitations and challenges which the community is facing and recent advancements in experimental and theoretical methods that have the potential to address those challenges. We also present our view on possible future directions for developing the applications to the design of efficient biocatalysts

    Computational Design of Stable and Soluble Biocatalysts

    Get PDF
    Natural enzymes are delicate biomolecules possessing only marginal thermodynamic stability. Poorly stable, misfolded, and aggregated proteins lead to huge economic losses in the biotechnology and biopharmaceutical industries. Consequently, there is a need to design optimized protein sequences that maximize stability, solubility, and activity over a wide range of temperatures and pH values in buffers of different composition and in the presence of organic cosolvents. This has created great interest in using computational methods to enhance biocatalysts' robustness and solubility. Suitable methods include (i) energy calculations, (ii) machine learning, (iii) phylogenetic analyses, and (iv) combinations of these approaches. We have witnessed impressive progress in the design of stable enzymes over the last two decades, but predictions of protein solubility and expressibility are scarce. Stabilizing mutations can be predicted accurately using available force fields, and the number of sequences available for phylogenetic analyses is growing. In addition, complex computational workflows are being implemented in intuitive web tools, enhancing the quality of protein stability predictions. Conversely, solubility predictors are limited by the lack of robust and balanced experimental data, an inadequate understanding of fundamental principles of protein aggregation, and a dearth of structural information on folding intermediates. Here we summarize recent progress in the development of computational tools for predicting protein stability and solubility, critically assess their strengths and weaknesses, and identify apparent gaps in data and knowledge. We also present perspectives on the computational design of stable and soluble biocatalysts

    Transhalogenation Catalysed by Haloalkane Dehalogenases Engineered to Stop Natural Pathway at Intermediate

    Get PDF
    Haloalkane dehalogenases (HLDs) are alpha/beta-hydrolases that convert halogenated compounds to their corresponding alcohols. The overall kinetic mechanism proceeds via four steps: (i) binding of halogenated substrate, (ii) bimolecular nucleophilic substitution (S(N)2) leading to the cleavage of a carbon-halogen bond and the formation of an alkyl-enzyme intermediate, (iii) nucleophilic addition of a water molecule resulting in the hydrolysis of the intermediate to the corresponding alcohol and (iv) release of the reaction products - an alcohol, a halide ion and a proton. Although, the overall reaction has been reported as irreversible, several kinetic evidences from previous studies suggest the reversibility of the first S(N)2 chemical step. To study this phenomenon, we have engineered HLDs to stop the catalytic cycle at the stage of the alkyl-enzyme intermediate. The ability of the intermediate to exchange halides was confirmed by a stopped-flow fluorescence binding analysis. Finally, the transhalogenation reaction was confirmed with several HLDs and 2,3-dichloropropene in the presence of a high concentration of iodide. The formation of the transhalogenation product 3-iodo-2-chloropropene catalysed by five mutant HLDs was identified by gas chromatography coupled with mass spectrometry. Hereby we demonstrated the reversibility of the cleavage of the carbon-halogen bond by HLDs resulting in a transhalogenation. After optimization, the transhalogenation reaction can possibly find its use in biocatalytic applications. Enabling this reaction by strategically engineering the enzyme to stop at an intermediate in the catalytic cycle that is synthetically more useful than the product of the natural pathway is a novel concept

    SoluProt: prediction of soluble protein expression in Escherichia coli

    Get PDF
    Motivation: Poor protein solubility hinders the production of many therapeutic and industrially useful proteins. Experimental efforts to increase solubility are plagued by low success rates and often reduce biological activity. Computational prediction of protein expressibility and solubility in Escherichia coli using only sequence information could reduce the cost of experimental studies by enabling prioritization of highly soluble proteins. Results: A new tool for sequence-based prediction of soluble protein expression in E.coli, SoluProt, was created using the gradient boosting machine technique with the TargetTrack database as a training set. When evaluated against a balanced independent test set derived from the NESG database, SoluProt's accuracy of 58.5% and AUC of 0.62 exceeded those of a suite of alternative solubility prediction tools. There is also evidence that it could significantly increase the success rate of experimental protein studies

    EnzymeMiner: automated mining of soluble enzymes with diverse structures, catalytic properties and stabilities

    Get PDF
    Millions of protein sequences are being discovered at an incredible pace, representing an inexhaustible source of biocatalysts. Despite genomic databases growing exponentially, classical biochemical characterization techniques are time-demanding, cost-ineffective and low-throughput. Therefore, computational methods are being developed to explore the unmapped sequence space efficiently. Selection of putative enzymes for biochemical characterization based on rational and robust analysis of all available sequences remains an unsolved problem. To address this challenge, we have developed EnzymeMiner-a web server for automated screening and annotation of diverse family members that enables selection of hits for wet-lab experiments. EnzymeMiner prioritizes sequences that are more likely to preserve the catalytic activity and are heterologously expressible in a soluble form in Escherichia coli. The solubility prediction employs the in-house SoluProt predictor developed using machine learning. EnzymeMiner reduces the time devoted to data gathering, multi-step analysis, sequence prioritization and selection from days to hours. The successful use case for the haloalkane dehalogenase family is described in a comprehensive tutorial available on the EnzymeMiner web page

    Engineering enzyme access tunnels

    Get PDF
    Enzymes are efficient and specific catalysts for many essential reactions in biotechnological and pharmaceutical industries. Many times, the natural enzymes do not display the catalytic efficiency, stability or specificity required for these industrial processes. The current enzyme engineering methods offer solutions to this problem, but they mainly target the buried active site where the chemical reaction takes place. Despite being many times ignored, the tunnels and channels connecting the environment with the active site are equally important for the catalytic properties of enzymes. Changes in the enzymatic tunnels and channels affect enzyme activity, specificity, promiscuity, enantioselectivity and stability. This review provides an overview of the emerging field of enzyme access tunnel engineering with case studies describing design of all the aforementioned properties. The software tools for the analysis of geometry and function of the enzymatic tunnels and channels and for the rational design of tunnel modifications will also be discussed. The combination of new software tools and enzyme engineering strategies will provide enzymes with access tunnels and channels specifically tailored for individual industrial processes

    FireProt(DB): database of manually curated protein stability data

    Get PDF
    The majority of naturally occurring proteins have evolved to function under mild conditions inside the living organisms. One of the critical obstacles for the use of proteins in biotechnological applications is their insufficient stability at elevated temperatures or in the presence of salts. Since experimental screening for stabilizing mutations is typically laborious and expensive, in silico predictors are often used for narrowing down the mutational landscape. The recent advances in machine learning and artificial intelligence further facilitate the development of such computational tools. However, the accuracy of these predictors strongly depends on the quality and amount of data used for training and testing, which have often been reported as the current bottleneck of the approach. To address this problem, we present a novel database of experimental thermostability data for single-point mutants FireProt(DB). The database combines the published datasets, data extracted manually from the recent literature, and the data collected in our laboratory. Its user interface is designed to facilitate both types of the expected use: (i) the interactive explorations of individual entries on the level of a protein or mutation and (ii) the construction of highly customized and machine learning-friendly datasets using advanced searching and filtering. The database is freely available at https://loschmidt.chemi.muni.cz/fireprotdb

    Decoding the intricate network of molecular interactions of a hyperstable engineered biocatalyst

    Get PDF
    Computational design of protein catalysts with enhanced stabilities for use in research and enzyme technologies is a challenging task. Using force-field calculations and phylogenetic analysis, we previously designed the haloalkane dehalogenase DhaA115 which contains 11 mutations that confer upon it outstanding thermostability (T-m = 73.5 degrees C; Delta T-m > 23 degrees C). An understanding of the structural basis of this hyperstabilization is required in order to develop computer algorithms and predictive tools. Here, we report X-ray structures of DhaA115 at 1.55 angstrom and 1.6 angstrom resolutions and their molecular dynamics trajectories, which unravel the intricate network of interactions that reinforce the aba-sandwich architecture. Unexpectedly, mutations toward bulky aromatic amino acids at the protein surface triggered long-distance (similar to 27 angstrom) backbone changes due to cooperative effects. These cooperative interactions produced an unprecedented double-lock system that: (i) induced backbone changes, (ii) closed the molecular gates to the active site, (iii) reduced the volumes of the main and slot access tunnels, and (iv) occluded the active site. Despite these spatial restrictions, experimental tracing of the access tunnels using krypton derivative crystals demonstrates that transport of ligands is still effective. Our findings highlight key thermostabilization effects and provide a structural basis for designing new thermostable protein catalysts
    corecore