11 research outputs found

    Computational structure‐based drug design: Predicting target flexibility

    Get PDF
    The role of molecular modeling in drug design has experienced a significant revamp in the last decade. The increase in computational resources and molecular models, along with software developments, is finally introducing a competitive advantage in early phases of drug discovery. Medium and small companies with strong focus on computational chemistry are being created, some of them having introduced important leads in drug design pipelines. An important source for this success is the extraordinary development of faster and more efficient techniques for describing flexibility in three‐dimensional structural molecular modeling. At different levels, from docking techniques to atomistic molecular dynamics, conformational sampling between receptor and drug results in improved predictions, such as screening enrichment, discovery of transient cavities, etc. In this review article we perform an extensive analysis of these modeling techniques, dividing them into high and low throughput, and emphasizing in their application to drug design studies. We finalize the review with a section describing our Monte Carlo method, PELE, recently highlighted as an outstanding advance in an international blind competition and industrial benchmarks.We acknowledge the BSC-CRG-IRB Joint Research Program in Computational Biology. This work was supported by a grant from the Spanish Government CTQ2016-79138-R.J.I. acknowledges support from SVP-2014-068797, awarded by the Spanish Government.Peer ReviewedPostprint (author's final draft

    Selection of protein conformations for structure-based polypharmacology studies

    Get PDF
    Several drugs exert their therapeutic effect through the modulation of multiple targets. Structure-based approaches hold great promise for identifying compounds with the desired polypharmacological profiles. These methods use knowledge of the protein binding sites to identify stereoelectronically complementary ligands. The selection of the most suitable protein conformations to be used in the design process is vital, especially for multitarget drug design in which the same ligand has to be accommodated in multiple binding pockets. Herein, we focus on currently available techniques for the selection of the most suitable protein conformations for multitarget drug design, compare the potential advantages and limitations of each method, and comment on how their combination could help in polypharmacology drug design

    Molecular modeling studies on HIV-1 Reverse Transcriptase (RT) and Heat shock protein (Hsp) 90 as a potential anti-HIV-1 target.

    Get PDF
    Masters Degree. University of KwaZulu-Natal, Durban.Human immunodeficiency virus (HIV) infection is the leading cause of death globally. This dissertation addresses two HIV-1 target proteins namely, HIV-1 Reverse Transcriptase (RT) and Heat shock protein (Hsp) 90. More specifically for HIV-1 RT, a case study for the identification of potential inhibitors as anti-HIV agents was carried out. A more refined virtual screening (VS) approach was implemented, which was an improvement on work previously published by our group- “target-bound pharmacophore modeling approach”. This study generated a pharmacophore library based only on highly contributing amino acid residues (HCAAR), instead of arbitrary pharmacophores, most commonly used in the conventional approaches in literature. HCAAR were distinguished based on free binding energy (FBE) contributions, obtained using calculation from molecular dynamics (MD) simulations. Previous approaches have relied on the docking score (DS) to generate energy-based pharmacophore models. However, DS are reportedly unreliable. Thus we present a model for a per-residue energy decomposition (PRED), constructed from MD simulation ensembles generating a more trustworthy pharmacophore model which can be applied in drug discovery workflow. This approach was employed in screening for potential HIV-1 RT inhibitors using the pharmacophoric features of the compound GSK952. The complex was subjected to docking and thereafter MD simulations confirmed the stability of the system. Experimentally determined inhibitors with known HIV-RT inhibitory activity were used to validate the proposed protocol. Two potential hits ZINC46849657 and ZINC54359621 showed a significant potential with regards to FBE. Reported results obtained from this work confirm that this new approach is favourable to the future of drug design process. Hsp90 was recently discovered to play a vital role in HIV-1 replication. Thus has emerged, as a promising target for anti-HIV-1 drugs. The molecular mechanism of Hsp90 is poorly understood, thus the second study was aimed to address this issue and provide a clear insight to the inhibition mechanism of Hsp90. Reasonable continuous MD simulations were employed for both unbound and bound Hsp90 conformations, to understand the dimerization and inhibition mechanisms. Results demonstrated that coumermycin A1 (C-A1), a newly discovered Hsp90 inhibitor, binds at the CTD dimer of Hsp90 and lead to a significant separation between orthogonally opposed residues, such as Arg591.B, Lys594.A, Ser663.A, Thr653.B, Ala665.A, Thr649.B, Leu646.B and Asn669A. A Large difference in magnitudes was observed in the radius of gyration (Rg), per-residue fluctuation, root-mean-square deviation (RMSD) and root-mean-square fluctuation (RMSF) confirming a completely more flexible state for the unbound conformation associated with dimerization. Whereas, a less globally correlated motion in the case of the bound conformer of Hsp90 approved a reduction of the dimeric process. This undoubtedly underlines the inhibition process due to ligand binding. The detailed dynamic analyses of Hsp90 presented herein are believed to give a greater insight and understanding to the function and mechanisms of inhibition of Hsp90. The report on the inhibitor-binding mode would also be of great assistance in the design of prospective inhibitors against Hsp90 as potential HIV target

    Massively-Parallel Computational Identification of Novel Broad Spectrum Antivirals to Combat Coronavirus Infection

    Get PDF
    Philosophiae Doctor - PhDGiven the significant disease burden caused by human coronaviruses, the discovery of an effective antiviral strategy is paramount, however there is still no effective therapy to combat infection. This thesis details the in silica exploration of ligand libraries to identify candidate lead compounds that, based on multiple criteria, have a high probability of inhibiting the 3 chymotrypsin-like protease (3CUro) of human coronaviruses. Atomistic models of the 3CUro were obtained from the Protein Data Bank or theoretical models were successfully generated by homology modelling. These structures served the basis of both structure- and ligand-based drug design studies. Consensus molecular docking and pharmacophore modelling protocols were adapted to explore the ZINC Drugs-Now dataset in a high throughput virtual screening strategy to identify ligands which computationally bound to the active site of the 3CUro . Molecular dynamics was further utilized to confirm the binding mode and interactions observed in the static structure- and ligand-based techniques were correct via analysis of various parameters in a IOns simulation. Molecular docking and pharmacophore models identified a total of 19 ligands which displayed the potential to computationally bind to all 3CUro included in the study. Strategies employed to identify these lead compounds also indicated that a known inhibitor of the SARS-Co V 3CUro also has potential as a broad spectrum lead compound. Further analysis by molecular dynamic simulations largely confirmed the binding mode and ligand orientations identified by the former techniques. The comprehensive approach used in this study improves the probability of identifying experimental actives and represents a cost effective pipeline for the often expensive and time consuming process of lead discovery. These identified lead compounds represent an ideal starting point for assays to confirm in vitro activity, where experimentally confirmed actives will be proceeded to subsequent studies on lead optimization

    COMPUTATIONAL TECHNIQUES TO EVALUATE AT ATOMIC LEVEL THE MECHANISM OF MOLECULAR BINDING

    Get PDF
    Integrins are an important class of transmembrane receptors that relay signals bidirectionally across the plasma membrane, regulating several cell functions and playing a key role in diverse pathological processes. Specifically, integrin subtype \u3b1IIb\u3b23 is involved in thrombosis and stroke, while subtypes \u3b1v\u3b23 and \u3b15\u3b21 play an important role in angiogenesis and tumor progression. They therefore emerged as attractive pharmacological targets. In the past decades several peptides and peptidomimetics targeting these proteins and based on the integrin recognition motif RGD (Arg-Gly-Asp) have been developed, whereby their affinity and selectivity for a specific integrin subtype have been fine-tuned by modulation of RGD flanking residues, by cyclization or by introduction of chemical modifications. Thus far, the design and development of RGD-based cyclopeptides have been mainly based on empirical approaches, requiring expensive and time-consuming synthesis campaigns. In this field, the employment of computational tools, that could be valuable to accelerate the drug design and optimization process, has been limited by the inherent difficulties to predict in silico the three-dimensional structure and the inhibitory activity of cyclopeptides. However, recent improvements in both computational resources and in docking and modeling techniques are expected to open new perspectives in the development of cyclopeptides as modulators of protein-protein interactions and, particularly, as integrin inhibitors. Within this PhD project, I have investigated the applicability of computational techniques in predicting and rationalizing how the environment of the recognition-motif in cyclopeptides (i.e. flanking residues and introduction of chemical modification) could influence their integrin affinity and selectivity. These features can regulate integrin affinity both by favoring direct interactions with the receptor and/or by modulating the three-dimensional conformation properties of the recognition motif. To take into account both these aspects, I have proposed and optimized a multi-stage computational protocol in which an exhaustive conformational sampling of the investigated cyclopeptides is followed by docking calculations and re-scoring techniques. Specifically: i) the exhaustive sampling could be achieved by using Metadynamics in its Bias Exchange variant (BE-META), an enhanced sampling technique which represents a valuable methodology for the acceleration of rare events, allowing to cross the high free energy barriers characteristic of cyclopeptides and providing reliable estimations of the populations of the accessible conformers. ii) The docking calculations, complemented with the re-scoring technique MM-GB/SA (Molecular Mechanics Generalized Born Surface Area) and the cluster analysis of the decoy poses, allow to evaluate the ability of each peptide to engage interactions with the receptors and to rank the docking poses according to their binding ability; iii) a joint analysis of the previous outcomes results in a reliable ranking of cyclopeptides based on their binding affinity and in the rationalization of their structure-activity relationship. This computational protocol has been exploited in two different applications, illustrated within the thesis. In the first application the protocol has been applied to rationalize how the introduction of chemical modifications, specifically backbone N-methylation, impacts on the equilibrium conformation and consequently on the integrin affinity of five RGD containing cyclic hexapeptides, which were previously generated by the group of professor Kessler to modulate their selectivity for \u3b1IIb\u3b23 integrin. The study revealed that backbone N-methylation affects the preferences of the \u3c6 dihedral angle of the methylated residue, specifically favoring the adoption of additional conformations, characterized by a 180\ub0 twist of the peptide bond plane preceding the methylated residue. These twists of dihedral angles were found to have relevant consequences on the cyclopeptides conformation, influencing the formation of intra-molecular hydrogen bonds as well as some structural features which are known to be fundamental in integrin binding. Both structural analysis and docking calculations allowed to identify the \u201cbioactive\u201d conformation (i.e. an extended RGD conformation able to recapitulate the canonical electrostatic and the additional stabilizing hydrophobic interactions). Of note, the cyclopeptides that are pre-organized, already in their free state, in this bioactive conformation are the ones displaying the best \u3b1IIb\u3b23 binding affinity in terms of IC50 values, confirming that pre-organization of cyclopeptides in solution can strongly affect their binding strength to the receptor and demonstrating that the knowledge of their conformational equilibrium is fundamental to provide reliable affinity predictions. In the second application, I have focused my attention on cyclopeptides harboring a recently discovered integrin recognition motif: isoDGR (isoAsp-Gly-Arg), deriving from the spontaneous deamidation of NGR (Asp-Gly-Arg) sequence present in integrin natural ligands. As a preliminary step, I have systematically tested the accuracy of eight Molecular Mechanics force fields in reproducing the equilibrium properties of isoDGR-based cyclopeptides, for which NMR experiments have been acquired. The comparison between simulated and NMR-derived data (i.e. chemical shifts and J scalar couplings) revealed that, while most of the investigated force fields can properly reproduce the equilibrium conformational properties of cyclic peptides, only two of them (i.e. the AMBER force fields ff99sb-ildn and ff99sb*-ildn) are able to recover the NMR observables characteristics of the non-standard residue isoAspartate with an accuracy close to the systematic uncertainty. Overall, these results suggest that the transferability of force field parameters to non standard amino acids is not straightforward. However, two force fields allowed to obtain a satisfactory accuracy and have been therefore employed for the subsequent investigation. I thus applied the computational protocol to rationalize the diverse selectivity and affinity profiles for integrins \u3b1v\u3b23 and \u3b15\u3b21, both related to cancer, displayed by three isoDGR-based cyclic hexapeptides. These molecules differ in the residues flanking the isoDGR motif and show appealing tumor-homing properties; specifically it has been shown that one of these, c(CGisoDGRG), can be coupled with human serum albumin through a chemical linker to be used as a drug delivery agent for functionalized gold nanoparticles. Herein, I investigated the role of the chemical linker in improving affinity and selectivity of c(CGisoDGRG) for \u3b1v\u3b23. The application of the multi-stage protocol allowed to propose an explanation for the different selectivity profiles displayed by these molecules, where the direct interactions engaged by the flanking residues and/or their steric hindrance seem to be largely responsible for the observed different affinities. As a last result, through the combination of MD and NMR techniques, I demonstrated that the chemical linker improved the \u3b1v\u3b23 affinity of c(CGisoDGRG) by engaging direct interactions with the receptor and I proposed two possible complex models, which well-reproduce data from Saturation Transfer Difference experiments. Overall, in this PhD work I have shown that the combination of different computational techniques, BE-META, docking and MM-GB/SA re-scoring, could be a reliable approach to perform structure-activity relationship studies in cyclopeptides. Specifically, the proposed protocol is able to predict the influence of the recognition motif environment (i.e. chemical modification and flanking residues) on integrin affinities. These two features regulate integrin affinity differently: the first one by conformational modulation of the recognition motif, the second by engaging direct interactions with the receptor. Of note, the approach can deal with both these mechanisms of affinity modulation. We expect that the protocol herein described could be used in future to screen novel peptides library or to complement biochemical experiments during the drug optimization stages, assisting organic chemists in the design of more effective integrin-targeting peptides

    Enumeration, conformation sampling and population of libraries of peptide macrocycles for the search of chemotherapeutic cardioprotection agents

    Get PDF
    Peptides are uniquely endowed with features that allow them to perturb previously difficult to drug biomolecular targets. Peptide macrocycles in particular have seen a flurry of recent interest due to their enhanced bioavailability, tunability and specificity. Although these properties make them attractive hit-candidates in early stage drug discovery, knowing which peptides to pursue is non‐trivial due to the magnitude of the peptide sequence space. Computational screening approaches show promise in their ability to address the size of this search space but suffer from their inability to accurately interrogate the conformational landscape of peptide macrocycles. We developed an in‐silico compound enumerator that was tasked with populating a conformationally laden peptide virtual library. This library was then used in the search for cardio‐protective agents (that may be administered, reducing tissue damage during reperfusion after ischemia (heart attacks)). Our enumerator successfully generated a library of 15.2 billion compounds, requiring the use of compression algorithms, conformational sampling protocols and management of aggregated compute resources in the context of a local cluster. In the absence of experimental biophysical data, we performed biased sampling during alchemical molecular dynamics simulations in order to observe cyclophilin‐D perturbation by cyclosporine A and its mitochondrial targeted analogue. Reliable intermediate state averaging through a WHAM analysis of the biased dynamic pulling simulations confirmed that the cardio‐protective activity of cyclosporine A was due to its mitochondrial targeting. Paralleltempered solution molecular dynamics in combination with efficient clustering isolated the essential dynamics of a cyclic peptide scaffold. The rapid enumeration of skeletons from these essential dynamics gave rise to a conformation laden virtual library of all the 15.2 Billion unique cyclic peptides (given the limits on peptide sequence imposed). Analysis of this library showed the exact extent of physicochemical properties covered, relative to the bare scaffold precursor. Molecular docking of a subset of the virtual library against cyclophilin‐D showed significant improvements in affinity to the target (relative to cyclosporine A). The conformation laden virtual library, accessed by our methodology, provided derivatives that were able to make many interactions per peptide with the cyclophilin‐D target. Machine learning methods showed promise in the training of Support Vector Machines for synthetic feasibility prediction for this library. The synergy between enumeration and conformational sampling greatly improves the performance of this library during virtual screening, even when only a subset is used

    Towards Personalized Medicine: Computational Approaches to Support Drug Design and Clinical Decision Making

    Get PDF
    The future looks bright for a clinical practice that tailors the therapy with the best efficacy and highest safety to a patient. Substantial amounts of funding have resulted in technological advances regarding patient-centered data acquisition --- particularly genetic data. Yet, the challenge of translating this data into clinical practice remains open. To support drug target characterization, we developed a global maximum entropy-based method that predicts protein-protein complexes including the three-dimensional structure of their interface from sequence data. To further speed up the drug development process, we present methods to reposition drugs with established safety profiles to new indications leveraging paths in cellular interaction networks. We validated both methods on known data, demonstrating their ability to recapitulate known protein complexes and drug-indication pairs, respectively. After studying the extent and characteristics of genetic variation with a predicted impact on protein function across 60,607 individuals, we showed that most patients carry variants in drug-related genes. However, for the majority of variants, their impact on drug efficacy remains unknown. To inform personalized treatment decisions, it is thus crucial to first collate knowledge from open data sources about known variant effects and to then close the knowledge gaps for variants whose effect on drug binding is still not characterized. Here, we built an automated annotation pipeline for patient-specific variants whose value we illustrate for a set of patients with hepatocellular carcinoma. We further developed a molecular modeling protocol to predict changes in binding affinity in proteins with genetic variants which we evaluated for several clinically relevant protein kinases. Overall, we expect that each presented method has the potential to advance personalized medicine by closing knowledge gaps about protein interactions and genetic variation in drug-related genes. To reach clinical applicability, challenges with data availability need to be overcome and prediction performance should be validated experimentally.Therapien mit der besten Wirksamkeit und höchsten Sicherheit werden in Zukunft auf den Patienten zugeschnitten werden. Hier haben erhebliche finanzielle Mittel zu technologischen Fortschritten bei der patientenzentrierten Datenerfassung gefĂŒhrt, aber diese Daten in die klinische Praxis zu ĂŒbertragen, bleibt aktuell noch eine Herausforderung. Um die Wirkstoffforschung in der Charakterisierung therapeutischer Zielproteine zu unterstĂŒtzen, haben wir eine Maximum-Entropie-Methode entwickelt, die Protein-Interaktionen und ihre dreidimensionalen Struktur aus Sequenzdaten vorhersagt. DarĂŒber hinaus, stellen wir Methoden zur Repositionierung von etablierten Arzneimitteln auf neue Indikationen vor, die Pfade in zellulĂ€ren Interaktionsnetze nutzen. Diese Methoden haben wir anhand bekannter Daten validiert und ihre FĂ€higkeit demonstriert, bekannte Proteinkomplexe bzw. Wirkstoff-Indikations-Paare zu rekapitulieren. Unsere Analyse genetischer Variation mit einem Einfluss auf die Proteinfunktion in 60,607 Individuen konnte zeigen, dass nahezu jeder Patient funktionsverĂ€ndernde Varianten in Medikamenten-assoziierten Genen trĂ€gt. Der direkte Einfluss der meisten beobachteten Varianten auf die Medikamenten-Wirksamkeit ist jedoch noch unbekannt. Um dennoch personalisierte Behandlungsentscheidungen treffen zu können, prĂ€sentieren wir eine Annotationspipeline fĂŒr genetische Varianten, deren Wert wir fĂŒr Patienten mit hepatozellulĂ€rem Karzinom illustrieren konnten. DarĂŒber hinaus haben wir ein molekulares Modellierungsprotokoll entwickelt, um die VerĂ€nderungen in der BindungsaffinitĂ€t von Proteinen mit genetischen Varianten voraussagen. Insgesamt sind wir davon ĂŒberzeugt, dass jede der vorgestellten Methoden das Potential hat, WissenslĂŒcken ĂŒber Proteininteraktionen und genetische Variationen in medikamentenbezogenen Genen zu schlie{\ss}en und somit das Feld der personalisierten Medizin voranzubringen. Um klinische Anwendbarkeit zu erreichen, gilt es in der Zukunft, verbleibende Herausforderungen bei der DatenverfĂŒgbarkeit zu bewĂ€ltigen und unsere Vorhersagen experimentell zu validieren

    Improving docking results via reranking of ensembles of ligand poses in multiple X-ray protein conformations with MM-GBSA

    No full text
    There is a tendency in the literature to be critical of scoring functions when docking programs perform poorly. The assumption is that existing scoring functions need to be enhanced or new ones developed in order to improve the performance of docking programs for tasks such as pose prediction and virtual screening. However, failures can result from either sampling or scoring (or a combination of the two), although less emphasis tends to be given to the former. In this work, we use the programs GOLD and Glide on a high-quality data set to explore whether failures in pose prediction and binding affinity estimation can be attributable more to sampling or scoring. We show that identification of the correct pose (docking power) can be improved by incorporating ligand strain into the scoring function or rescoring an ensemble of diverse docking poses with MM-GBSA in a postprocessing step. We explore the use of nondefault docking settings and find that enhancing ligand sampling also improves docking power, again suggesting that sampling is more limiting than scoring for the docking programs investigated in this work. In cross-docking calculations (docking a ligand to a noncognate receptor structure) we observe a significant reduction in the accuracy of pose ranking, as expected and has been reported by others; however, we demonstrate that these alternate poses may in fact be more complementary between the ligand and the rigid receptor conformation, emphasizing that treating the receptor rigidly is an artificial constraint on the docking problem. We simulate protein flexibility by the use of multiple crystallographic conformations of a protein and demonstrate that docking results can be improved with this level of protein sampling. This work indicates the need for better sampling in docking programs, especially for the receptor. This study also highlights the variable descriptive value of RMSD as the sole arbiter of pose replication quality. It is shown that ligand poses within 2 Å of the crystallographic one can show dramatic differences in calculated relative protein-ligand energies. MM-GBSA rescoring of distinct poses overcomes some of the sensitivities of pose ranking experienced by the docking scoring functions due to protein preparation and binding site definition

    Improving Docking Results via Reranking of Ensembles of Ligand Poses in Multiple X‑ray Protein Conformations with MM-GBSA

    No full text
    There is a tendency in the literature to be critical of scoring functions when docking programs perform poorly. The assumption is that existing scoring functions need to be enhanced or new ones developed in order to improve the performance of docking programs for tasks such as pose prediction and virtual screening. However, failures can result from either sampling or scoring (or a combination of the two), although less emphasis tends to be given to the former. In this work, we use the programs GOLD and Glide on a high-quality data set to explore whether failures in pose prediction and binding affinity estimation can be attributable more to sampling or scoring. We show that identification of the correct pose (docking power) can be improved by incorporating ligand strain into the scoring function or rescoring an ensemble of diverse docking poses with MM-GBSA in a postprocessing step. We explore the use of nondefault docking settings and find that enhancing ligand sampling also improves docking power, again suggesting that sampling is more limiting than scoring for the docking programs investigated in this work. In cross-docking calculations (docking a ligand to a noncognate receptor structure) we observe a significant reduction in the accuracy of pose ranking, as expected and has been reported by others; however, we demonstrate that these alternate poses may in fact be more complementary between the ligand and the rigid receptor conformation, emphasizing that treating the receptor rigidly is an artificial constraint on the docking problem. We simulate protein flexibility by the use of multiple crystallographic conformations of a protein and demonstrate that docking results can be improved with this level of protein sampling. This work indicates the need for better sampling in docking programs, especially for the receptor. This study also highlights the variable descriptive value of RMSD as the sole arbiter of pose replication quality. It is shown that ligand poses within 2 Å of the crystallographic one can show dramatic differences in calculated relative protein–ligand energies. MM-GBSA rescoring of distinct poses overcomes some of the sensitivities of pose ranking experienced by the docking scoring functions due to protein preparation and binding site definition
    corecore