11 research outputs found
Computational structureâbased drug design: Predicting target flexibility
The role of molecular modeling in drug design has experienced a significant revamp in the last decade. The increase in computational resources and molecular models, along with software developments, is finally introducing a competitive advantage in early phases of drug discovery. Medium and small companies with strong focus on computational chemistry are being created, some of them having introduced important leads in drug design pipelines. An important source for this success is the extraordinary development of faster and more efficient techniques for describing flexibility in threeâdimensional structural molecular modeling. At different levels, from docking techniques to atomistic molecular dynamics, conformational sampling between receptor and drug results in improved predictions, such as screening enrichment, discovery of transient cavities, etc. In this review article we perform an extensive analysis of these modeling techniques, dividing them into high and low throughput, and emphasizing in their application to drug design studies. We finalize the review with a section describing our Monte Carlo method, PELE, recently highlighted as an outstanding advance in an international blind competition and industrial benchmarks.We acknowledge the BSC-CRG-IRB Joint Research Program in Computational Biology. This work was supported by a grant
from the Spanish Government CTQ2016-79138-R.J.I. acknowledges support from SVP-2014-068797, awarded by the Spanish Government.Peer ReviewedPostprint (author's final draft
Selection of protein conformations for structure-based polypharmacology studies
Several drugs exert their therapeutic effect through the modulation of multiple targets. Structure-based approaches hold great promise for identifying compounds with the desired polypharmacological profiles. These methods use knowledge of the protein binding sites to identify stereoelectronically complementary ligands. The selection of the most suitable protein conformations to be used in the design process is vital, especially for multitarget drug design in which the same ligand has to be accommodated in multiple binding pockets. Herein, we focus on currently available techniques for the selection of the most suitable protein conformations for multitarget drug design, compare the potential advantages and limitations of each method, and comment on how their combination could help in polypharmacology drug design
Molecular modeling studies on HIV-1 Reverse Transcriptase (RT) and Heat shock protein (Hsp) 90 as a potential anti-HIV-1 target.
Masters Degree. University of KwaZulu-Natal, Durban.Human immunodeficiency virus (HIV) infection is the leading cause of death globally. This dissertation addresses two HIV-1 target proteins namely, HIV-1 Reverse Transcriptase (RT) and Heat shock protein (Hsp) 90. More specifically for HIV-1 RT, a case study for the identification of potential inhibitors as anti-HIV agents was carried out. A more refined virtual screening (VS) approach was implemented, which was an improvement on work previously published by our group- âtarget-bound pharmacophore modeling approachâ. This study generated a pharmacophore library based only on highly contributing amino acid residues (HCAAR), instead of arbitrary pharmacophores, most commonly used in the conventional approaches in literature. HCAAR were distinguished based on free binding energy (FBE) contributions, obtained using calculation from molecular dynamics (MD) simulations. Previous approaches have relied on the docking score (DS) to generate energy-based pharmacophore models. However, DS are reportedly unreliable. Thus we present a model for a per-residue energy decomposition (PRED), constructed from MD simulation ensembles generating a more trustworthy pharmacophore model which can be applied in drug discovery workflow. This approach was employed in screening for potential HIV-1 RT inhibitors using the pharmacophoric features of the compound GSK952. The complex was subjected to docking and thereafter MD simulations confirmed the stability of the system. Experimentally determined inhibitors with known HIV-RT inhibitory activity were used to validate the proposed protocol. Two potential hits ZINC46849657 and ZINC54359621 showed a significant potential with regards to FBE. Reported results obtained from this work confirm that this new approach is favourable to the future of drug design process.
Hsp90 was recently discovered to play a vital role in HIV-1 replication. Thus has emerged, as a promising target for anti-HIV-1 drugs. The molecular mechanism of Hsp90 is poorly understood, thus the second study was aimed to address this issue and provide a clear insight to the inhibition mechanism of Hsp90. Reasonable continuous MD simulations were employed for both unbound and bound Hsp90 conformations, to understand the dimerization and inhibition mechanisms. Results demonstrated that coumermycin A1 (C-A1), a newly discovered Hsp90 inhibitor, binds at the CTD dimer of Hsp90 and lead
to a significant separation between orthogonally opposed residues, such as Arg591.B, Lys594.A, Ser663.A, Thr653.B, Ala665.A, Thr649.B, Leu646.B and Asn669A. A Large difference in magnitudes was observed in the radius of gyration (Rg), per-residue fluctuation, root-mean-square deviation (RMSD) and root-mean-square fluctuation (RMSF) confirming a completely more flexible state for the unbound conformation associated with dimerization. Whereas, a less globally correlated motion in the case of the bound conformer of Hsp90 approved a reduction of the dimeric process. This undoubtedly underlines the inhibition process due to ligand binding. The detailed dynamic analyses of Hsp90 presented herein are believed to give a greater insight and understanding to the function and mechanisms of inhibition of Hsp90. The report on the inhibitor-binding mode would also be of great assistance in the design of prospective inhibitors against Hsp90 as potential HIV target
Massively-Parallel Computational Identification of Novel Broad Spectrum Antivirals to Combat Coronavirus Infection
Philosophiae Doctor - PhDGiven the significant disease burden caused by human coronaviruses, the discovery of an effective antiviral strategy is paramount, however there is still no effective therapy to combat infection. This thesis details the in silica exploration of ligand libraries to identify candidate
lead compounds that, based on multiple criteria, have a high probability of inhibiting the 3 chymotrypsin-like protease (3CUro) of human coronaviruses. Atomistic models of the 3CUro were obtained from the Protein Data Bank or theoretical models were successfully generated by homology modelling. These structures served the basis of both structure- and ligand-based drug design studies. Consensus molecular docking and pharmacophore modelling protocols were adapted to explore the ZINC Drugs-Now dataset in a high throughput virtual screening strategy to identify ligands which computationally bound to the active site of the 3CUro . Molecular dynamics was further utilized to confirm the binding mode and interactions observed in the static structure- and ligand-based techniques were correct via analysis of various parameters in a IOns simulation. Molecular docking and pharmacophore models identified a total of 19 ligands which displayed
the potential to computationally bind to all 3CUro included in the study. Strategies employed to identify these lead compounds also indicated that a known inhibitor of the SARS-Co V 3CUro also has potential as a broad spectrum lead compound. Further analysis by molecular dynamic simulations largely confirmed the binding mode and ligand orientations identified by the former techniques. The comprehensive approach used in this study improves the probability of identifying experimental actives and represents a cost effective pipeline for the often expensive and time consuming process of lead discovery. These identified lead compounds represent an ideal
starting point for assays to confirm in vitro activity, where experimentally confirmed actives will be proceeded to subsequent studies on lead optimization
COMPUTATIONAL TECHNIQUES TO EVALUATE AT ATOMIC LEVEL THE MECHANISM OF MOLECULAR BINDING
Integrins are an important class of transmembrane receptors that relay signals bidirectionally across the plasma membrane, regulating several cell functions and playing a key role in diverse pathological processes. Specifically, integrin subtype \u3b1IIb\u3b23 is involved in thrombosis and stroke, while subtypes \u3b1v\u3b23 and \u3b15\u3b21 play an important role in angiogenesis and tumor progression. They therefore emerged as attractive pharmacological targets. In the past decades several peptides and peptidomimetics targeting these proteins and based on the integrin recognition motif RGD (Arg-Gly-Asp) have been developed, whereby their affinity and selectivity for a specific integrin subtype have been fine-tuned by modulation of RGD flanking residues, by cyclization or by introduction of chemical modifications. Thus far, the design and development of RGD-based cyclopeptides have been mainly based on empirical approaches, requiring expensive and time-consuming synthesis campaigns. In this field, the employment of computational tools, that could be valuable to accelerate the drug design and optimization process, has been limited by the inherent difficulties to predict in silico the three-dimensional structure and the inhibitory activity of cyclopeptides. However, recent improvements in both computational resources and in docking and modeling techniques are expected to open new perspectives in the development of cyclopeptides as modulators of protein-protein interactions and, particularly, as integrin inhibitors.
Within this PhD project, I have investigated the applicability of computational techniques in predicting and rationalizing how the environment of the recognition-motif in cyclopeptides (i.e. flanking residues and introduction of chemical modification) could influence their integrin affinity and selectivity. These features can regulate integrin affinity both by favoring direct interactions with the receptor and/or by modulating the three-dimensional conformation properties of the recognition motif. To take into account both these aspects, I have proposed and optimized a multi-stage computational protocol in which an exhaustive conformational sampling of the investigated cyclopeptides is followed by docking calculations and re-scoring techniques. Specifically: i) the exhaustive sampling could be achieved by using Metadynamics in its Bias Exchange variant (BE-META), an enhanced sampling technique which represents a valuable methodology for the acceleration of rare events, allowing to cross the high free energy barriers characteristic of cyclopeptides and providing reliable estimations of the populations of the accessible conformers. ii) The docking calculations, complemented with the re-scoring technique MM-GB/SA (Molecular Mechanics Generalized Born Surface Area) and the cluster analysis of the decoy poses, allow to evaluate the ability of each peptide to engage interactions with the receptors and to rank the docking poses according to their binding ability; iii) a joint analysis of the previous outcomes results in a reliable ranking of cyclopeptides based on their binding affinity and in the rationalization of their structure-activity relationship. This computational protocol has been exploited in two different applications, illustrated within the thesis.
In the first application the protocol has been applied to rationalize how the introduction of chemical modifications, specifically backbone N-methylation, impacts on the equilibrium conformation and consequently on the integrin affinity of five RGD containing cyclic hexapeptides, which were previously generated by the group of professor Kessler to modulate their selectivity for \u3b1IIb\u3b23 integrin. The study revealed that backbone N-methylation affects the preferences of the \u3c6 dihedral angle of the methylated residue, specifically favoring the adoption of additional conformations, characterized by a 180\ub0 twist of the peptide bond plane preceding the methylated residue. These twists of dihedral angles were found to have relevant consequences on the cyclopeptides conformation, influencing the formation of intra-molecular hydrogen bonds as well as some structural features which are known to be fundamental in integrin binding. Both structural analysis and docking calculations allowed to identify the \u201cbioactive\u201d conformation (i.e. an extended RGD conformation able to recapitulate the canonical electrostatic and the additional stabilizing hydrophobic interactions). Of note, the cyclopeptides that are pre-organized, already in their free state, in this bioactive conformation are the ones displaying the best \u3b1IIb\u3b23 binding affinity in terms of IC50 values, confirming that pre-organization of cyclopeptides in solution can strongly affect their binding strength to the receptor and demonstrating that the knowledge of their conformational equilibrium is fundamental to provide reliable affinity predictions.
In the second application, I have focused my attention on cyclopeptides harboring a recently discovered integrin recognition motif: isoDGR (isoAsp-Gly-Arg), deriving from the spontaneous deamidation of NGR (Asp-Gly-Arg) sequence present in integrin natural ligands. As a preliminary step, I have systematically tested the accuracy of eight Molecular Mechanics force fields in reproducing the equilibrium properties of isoDGR-based cyclopeptides, for which NMR experiments have been acquired. The comparison between simulated and NMR-derived data (i.e. chemical shifts and J scalar couplings) revealed that, while most of the investigated force fields can properly reproduce the equilibrium conformational properties of cyclic peptides, only two of them (i.e. the AMBER force fields ff99sb-ildn and ff99sb*-ildn) are able to recover the NMR observables characteristics of the non-standard residue isoAspartate with an accuracy close to the systematic uncertainty. Overall, these results suggest that the transferability of force field parameters to non standard amino acids is not straightforward. However, two force fields allowed to obtain a satisfactory accuracy and have been therefore employed for the subsequent investigation. I thus applied the computational protocol to rationalize the diverse selectivity and affinity profiles for integrins \u3b1v\u3b23 and \u3b15\u3b21, both related to cancer, displayed by three isoDGR-based cyclic hexapeptides. These molecules differ in the residues flanking the isoDGR motif and show appealing tumor-homing properties; specifically it has been shown that one of these, c(CGisoDGRG), can be coupled with human serum albumin through a chemical linker to be used as a drug delivery agent for functionalized gold nanoparticles. Herein, I investigated the role of the chemical linker in improving affinity and selectivity of c(CGisoDGRG) for \u3b1v\u3b23. The application of the multi-stage protocol allowed to propose an explanation for the different selectivity profiles displayed by these molecules, where the direct interactions engaged by the flanking residues and/or their steric hindrance seem to be largely responsible for the observed different affinities. As a last result, through the combination of MD and NMR techniques, I demonstrated that the chemical linker improved the \u3b1v\u3b23 affinity of c(CGisoDGRG) by engaging direct interactions with the receptor and I proposed two possible complex models, which well-reproduce data from Saturation Transfer Difference experiments.
Overall, in this PhD work I have shown that the combination of different computational techniques, BE-META, docking and MM-GB/SA re-scoring, could be a reliable approach to perform structure-activity relationship studies in cyclopeptides. Specifically, the proposed protocol is able to predict the influence of the recognition motif environment (i.e. chemical modification and flanking residues) on integrin affinities. These two features regulate integrin affinity differently: the first one by conformational modulation of the recognition motif, the second by engaging direct interactions with the receptor. Of note, the approach can deal with both these mechanisms of affinity modulation. We expect that the protocol herein described could be used in future to screen novel peptides library or to complement biochemical experiments during the drug optimization stages, assisting organic chemists in the design of more effective integrin-targeting peptides
Enumeration, conformation sampling and population of libraries of peptide macrocycles for the search of chemotherapeutic cardioprotection agents
Peptides are uniquely endowed with features that allow them to perturb previously difficult to drug biomolecular targets. Peptide macrocycles in particular have seen a flurry of recent interest due to their enhanced bioavailability, tunability and specificity. Although these properties make them attractive hit-candidates in early stage drug discovery, knowing which peptides to pursue is nonâtrivial due to the magnitude of the peptide sequence space. Computational screening approaches show promise in their ability to address the size of this search space but suffer from their inability to accurately interrogate the conformational landscape of peptide macrocycles. We developed an inâsilico compound enumerator that was tasked with populating a conformationally laden peptide virtual library. This library was then used in the search for cardioâprotective agents (that may be administered, reducing tissue damage during reperfusion after ischemia (heart attacks)). Our enumerator successfully generated a library of 15.2 billion compounds, requiring the use of compression algorithms, conformational sampling protocols and management of aggregated compute resources in the context of a local cluster. In the absence of experimental biophysical data, we performed biased sampling during alchemical molecular dynamics simulations in order to observe cyclophilinâD perturbation by cyclosporine A and its mitochondrial targeted analogue. Reliable intermediate state averaging through a WHAM analysis of the biased dynamic pulling simulations confirmed that the cardioâprotective activity of cyclosporine A was due to its mitochondrial targeting. Paralleltempered solution molecular dynamics in combination with efficient clustering isolated the essential dynamics of a cyclic peptide scaffold. The rapid enumeration of skeletons from these essential dynamics gave rise to a conformation laden virtual library of all the 15.2 Billion unique cyclic peptides (given the limits on peptide sequence imposed). Analysis of this library showed the exact extent of physicochemical properties covered, relative to the bare scaffold precursor. Molecular docking of a subset of the virtual library against cyclophilinâD showed significant improvements in affinity to the target (relative to cyclosporine A). The conformation laden virtual library, accessed by our methodology, provided derivatives that were able to make many interactions per peptide with the cyclophilinâD target. Machine learning methods showed promise in the training of Support Vector Machines for synthetic feasibility prediction for this library. The synergy between enumeration and conformational sampling greatly improves the performance of this library during virtual screening, even when only a subset is used
Towards Personalized Medicine: Computational Approaches to Support Drug Design and Clinical Decision Making
The future looks bright for a clinical practice that tailors the
therapy with the best efficacy and highest safety to a patient. Substantial
amounts of funding have resulted in technological advances regarding
patient-centered data acquisition --- particularly genetic data. Yet, the
challenge of translating this data into clinical practice remains open.
To support drug target characterization, we developed a global maximum
entropy-based method that predicts protein-protein complexes including the
three-dimensional structure of their interface from sequence data. To further
speed up the drug development process, we present methods to reposition drugs
with established safety profiles to new indications leveraging paths in
cellular interaction networks. We validated both methods on known data,
demonstrating their ability to recapitulate known protein complexes and
drug-indication pairs, respectively.
After studying the extent and characteristics of genetic variation with a
predicted impact on protein function across 60,607 individuals, we showed that
most patients carry variants in drug-related genes. However, for the majority
of variants, their impact on drug efficacy remains unknown. To inform
personalized treatment decisions, it is thus crucial to first collate knowledge
from open data sources about known variant effects and to then close the
knowledge gaps for variants whose effect on drug binding is still not
characterized. Here, we built an automated annotation pipeline for
patient-specific variants whose value we illustrate for a set of patients with
hepatocellular carcinoma. We further developed a molecular modeling protocol to
predict changes in binding affinity in proteins with genetic variants which we
evaluated for several clinically relevant protein kinases.
Overall, we expect that each presented method has the potential to advance
personalized medicine by closing knowledge gaps about protein interactions and
genetic variation in drug-related genes. To reach clinical applicability,
challenges with data availability need to be overcome and prediction
performance should be validated experimentally.Therapien mit der besten Wirksamkeit und höchsten
Sicherheit werden in Zukunft auf den Patienten zugeschnitten werden. Hier haben
erhebliche finanzielle Mittel zu technologischen Fortschritten bei der
patientenzentrierten Datenerfassung gefĂŒhrt, aber diese Daten in die
klinische Praxis zu ĂŒbertragen, bleibt aktuell noch eine Herausforderung.
Um die Wirkstoffforschung in der Charakterisierung therapeutischer Zielproteine
zu unterstĂŒtzen, haben wir eine Maximum-Entropie-Methode entwickelt,
die Protein-Interaktionen und ihre dreidimensionalen Struktur
aus Sequenzdaten vorhersagt. DarĂŒber hinaus, stellen wir Methoden
zur Repositionierung von etablierten Arzneimitteln auf
neue Indikationen vor, die Pfade in zellulÀren Interaktionsnetze nutzen.
Diese Methoden haben wir anhand bekannter Daten validiert und ihre FĂ€higkeit
demonstriert, bekannte Proteinkomplexe bzw. Wirkstoff-Indikations-Paare zu
rekapitulieren.
Unsere Analyse genetischer Variation mit einem Einfluss auf die
Proteinfunktion in 60,607 Individuen konnte zeigen, dass nahezu jeder Patient
funktionsverÀndernde Varianten in Medikamenten-assoziierten Genen
trÀgt. Der direkte Einfluss der meisten beobachteten Varianten auf die
Medikamenten-Wirksamkeit ist jedoch noch unbekannt. Um dennoch personalisierte
Behandlungsentscheidungen treffen zu können, prĂ€sentieren wir eine Annotationspipeline fĂŒr genetische
Varianten, deren Wert wir fĂŒr Patienten mit hepatozellulĂ€rem
Karzinom illustrieren konnten. DarĂŒber hinaus haben wir ein molekulares
Modellierungsprotokoll entwickelt, um die VerÀnderungen in der
BindungsaffinitÀt von Proteinen mit genetischen Varianten voraussagen.
Insgesamt sind wir davon ĂŒberzeugt, dass jede der vorgestellten Methoden das
Potential hat, WissenslĂŒcken ĂŒber Proteininteraktionen und
genetische Variationen in medikamentenbezogenen Genen zu schlie{\ss}en und
somit das Feld der personalisierten Medizin voranzubringen. Um klinische
Anwendbarkeit zu erreichen, gilt es in der Zukunft, verbleibende
Herausforderungen bei der DatenverfĂŒgbarkeit zu bewĂ€ltigen und unsere
Vorhersagen experimentell zu validieren
Improving docking results via reranking of ensembles of ligand poses in multiple X-ray protein conformations with MM-GBSA
There is a tendency in the literature to be critical of scoring functions when docking programs perform poorly. The assumption is that existing scoring functions need to be enhanced or new ones developed in order to improve the performance of docking programs for tasks such as pose prediction and virtual screening. However, failures can result from either sampling or scoring (or a combination of the two), although less emphasis tends to be given to the former. In this work, we use the programs GOLD and Glide on a high-quality data set to explore whether failures in pose prediction and binding affinity estimation can be attributable more to sampling or scoring. We show that identification of the correct pose (docking power) can be improved by incorporating ligand strain into the scoring function or rescoring an ensemble of diverse docking poses with MM-GBSA in a postprocessing step. We explore the use of nondefault docking settings and find that enhancing ligand sampling also improves docking power, again suggesting that sampling is more limiting than scoring for the docking programs investigated in this work. In cross-docking calculations (docking a ligand to a noncognate receptor structure) we observe a significant reduction in the accuracy of pose ranking, as expected and has been reported by others; however, we demonstrate that these alternate poses may in fact be more complementary between the ligand and the rigid receptor conformation, emphasizing that treating the receptor rigidly is an artificial constraint on the docking problem. We simulate protein flexibility by the use of multiple crystallographic conformations of a protein and demonstrate that docking results can be improved with this level of protein sampling. This work indicates the need for better sampling in docking programs, especially for the receptor. This study also highlights the variable descriptive value of RMSD as the sole arbiter of pose replication quality. It is shown that ligand poses within 2 Ă
of the crystallographic one can show dramatic differences in calculated relative protein-ligand energies. MM-GBSA rescoring of distinct poses overcomes some of the sensitivities of pose ranking experienced by the docking scoring functions due to protein preparation and binding site definition
Improving Docking Results via Reranking of Ensembles of Ligand Poses in Multiple Xâray Protein Conformations with MM-GBSA
There is a tendency in the literature
to be critical of scoring
functions when docking programs perform poorly. The assumption is
that existing scoring functions need to be enhanced or new ones developed
in order to improve the performance of docking programs for tasks
such as pose prediction and virtual screening. However, failures can
result from either sampling or scoring (or a combination of the two),
although less emphasis tends to be given to the former. In this work,
we use the programs GOLD and Glide on a high-quality data set to explore
whether failures in pose prediction and binding affinity estimation
can be attributable more to sampling or scoring. We show that identification
of the correct pose (docking power) can be improved by incorporating
ligand strain into the scoring function or rescoring an ensemble of
diverse docking poses with MM-GBSA in a postprocessing step. We explore
the use of nondefault docking settings and find that enhancing ligand
sampling also improves docking power, again suggesting that sampling
is more limiting than scoring for the docking programs investigated
in this work. In cross-docking calculations (docking a ligand to a
noncognate receptor structure) we observe a significant reduction
in the accuracy of pose ranking, as expected and has been reported
by others; however, we demonstrate that these alternate poses may
in fact be more complementary between the ligand and the rigid receptor
conformation, emphasizing that treating the receptor rigidly is an
artificial constraint on the docking problem. We simulate protein
flexibility by the use of multiple crystallographic conformations
of a protein and demonstrate that docking results can be improved
with this level of protein sampling. This work indicates the need
for better sampling in docking programs, especially for the receptor.
This study also highlights the variable descriptive value of RMSD
as the sole arbiter of pose replication quality. It is shown that
ligand poses within 2 Ă
of the crystallographic one can show
dramatic differences in calculated relative proteinâligand
energies. MM-GBSA rescoring of distinct poses overcomes some of the
sensitivities of pose ranking experienced by the docking scoring functions
due to protein preparation and binding site definition