15 research outputs found

    On evaluating moleculardocking methods for pose prediction and enrichment factors.

    Get PDF
    Four of the most well-known, commercially available docking programs, FlexX, GOLD, GLIDE, and ICM, have been examined for their ligand-docking and virtual-screening capabilities. The relative performance of the programs in reproducing the native ligand conformation from starting SMILES strings for 164 highresolution protein-ligand complexes is presented and compared. Applying only the native scoring functions, the latest versions of these four docking programs were also used to conduct virtual screening for 12 protein targets of therapeutic interest, involving both publicly available structures and AstraZeneca in-house structures. The capability of the four programs to correctly rank-order target-specific active compounds over alternative binders and nonbinders (decoys plus randomly selected compounds) and thereby enrich a small subset of a screening library is compared. Enrichments from the virtual-screening experiments are contrasted with those obtained with alternative 3D shape-matching and 2D similarity database-search methods

    Computational strategies to include protein flexibility in Ligand Docking and Virtual Screening

    Get PDF
    The dynamic character of proteins strongly influences biomolecular recognition mechanisms. With the development of the main models of ligand recognition (lock-and-key, induced fit, conformational selection theories), the role of protein plasticity has become increasingly relevant. In particular, major structural changes concerning large deviations of protein backbones, and slight movements such as side chain rotations are now carefully considered in drug discovery and development. It is of great interest to identify multiple protein conformations as preliminary step in a screening campaign. Protein flexibility has been widely investigated, in terms of both local and global motions, in two diverse biological systems. On one side, Replica Exchange Molecular Dynamics has been exploited as enhanced sampling method to collect multiple conformations of Lactate Dehydrogenase A (LDHA), an emerging anticancer target. The aim of this project was the development of an Ensemble-based Virtual Screening protocol, in order to find novel potent inhibitors. On the other side, a preliminary study concerning the local flexibility of Opioid Receptors has been carried out through ALiBERO approach, an iterative method based on Elastic Network-Normal Mode Analysis and Monte Carlo sampling. Comparison of the Virtual Screening performances by using single or multiple conformations confirmed that the inclusion of protein flexibility in screening protocols has a positive effect on the probability to early recognize novel or known active compounds

    COMPUTATIONAL MODELING OF RNA-SMALL MOLECULE AND RNA-PROTEIN INTERACTIONS

    Get PDF
    The past decade has witnessed an era of RNA biology; despite the considerable discoveries nowadays, challenges still remain when one aims to screen RNA-interacting small molecule or RNA-interacting protein. These challenges imply an immediate need for cost-efficient while predictive computational tools capable of generating insightful hypotheses to discover novel RNA-interacting small molecule or RNA-interacting protein. Thus, we implemented novel computational models in this dissertation to predict RNA-ligand interactions (Chapter 1) and RNA-protein interactions (Chapter 2). Targeting RNA has not garnered comparable interest as protein, and is restricted by lack of computational tools for structure-based drug design. To test the potential of translating molecular docking tools designed for protein to RNA-ligand docking and virtual screening, we benchmarked 5 docking software and 11 scoring functions to assess their performances in pose reproduction, pose ranking, score-RMSD correlation and virtual screening. From this benchmark, we proposed a three-step docking pipelines optimized for virtual screening against RNAs with different flexibility properties. Using this pipeline, we have successfully identified a selective compound binding to GA:UU motif. Both NMR and the subsequent MD simulation proved its selective binding to GA:UU motif flanked by two tandem flexible base pairs next to GA. Consistent to the 3D model, SAR analysis revealed that any R-group substitution would abolish the binding. Current computational methods for RNA-protein interaction prediction (sequence-based or structure-based) are either short of interpretability or robustness. Aware of these pitfalls, we implemented RNA-Protein interaction prediction through Interface Threading (RPIT), which identifies and references a known RNA-protein interface as the template to infer the region where the interaction occurs and predict the interacting propensity based on the interface profiles. To estimate the propensity more accurately, we implemented five statistical scoring functions based our unique collection of non-redundant protein-RNA interaction database. Our benchmark using leave-protein-out cross validation and two external validation sets resulted in overall 70%-80% accuracy of RPIT. Compared with other methods, RPIT offers an inexpensive but robust method for in silico prediction of RNA-protein interaction networks, and for prioritizing putative RNA-protein pairs using virtual screening

    Computational approaches to virtual screening in human central nervous system therapeutic targets

    Get PDF
    In the past several years of drug design, advanced high-throughput synthetic and analytical chemical technologies are continuously producing a large number of compounds. These large collections of chemical structures have resulted in many public and commercial molecular databases. Thus, the availability of larger data sets provided the opportunity for developing new knowledge mining or virtual screening (VS) methods. Therefore, this research work is motivated by the fact that one of the main interests in the modern drug discovery process is the development of new methods to predict compounds with large therapeutic profiles (multi-targeting activity), which is essential for the discovery of novel drug candidates against complex multifactorial diseases like central nervous system (CNS) disorders. This work aims to advance VS approaches by providing a deeper understanding of the relationship between chemical structure and pharmacological properties and design new fast and robust tools for drug designing against different targets/pathways. To accomplish the defined goals, the first challenge is dealing with big data set of diverse molecular structures to derive a correlation between structures and activity. However, an extendable and a customizable fully automated in-silico Quantitative-Structure Activity Relationship (QSAR) modeling framework was developed in the first phase of this work. QSAR models are computationally fast and powerful tool to screen huge databases of compounds to determine the biological properties of chemical molecules based on their chemical structure. The generated framework reliably implemented a full QSAR modeling pipeline from data preparation to model building and validation. The main distinctive features of the designed framework include a)efficient data curation b) prior estimation of data modelability and, c)an-optimized variable selection methodology that was able to identify the most biologically relevant features responsible for compound activity. Since the underlying principle in QSAR modeling is the assumption that the structures of molecules are mainly responsible for their pharmacological activity, the accuracy of different structural representation approaches to decode molecular structural information largely influence model predictability. However, to find the best approach in QSAR modeling, a comparative analysis of two main categories of molecular representations that included descriptor-based (vector space) and distance-based (metric space) methods was carried out. Results obtained from five QSAR data sets showed that distance-based method was superior to capture the more relevant structural elements for the accurate characterization of molecular properties in highly diverse data sets (remote chemical space regions). This finding further assisted to the development of a novel tool for molecular space visualization to increase the understanding of structure-activity relationships (SAR) in drug discovery projects by exploring the diversity of large heterogeneous chemical data. In the proposed visual approach, four nonlinear DR methods were tested to represent molecules lower dimensionality (2D projected space) on which a non-parametric 2D kernel density estimation (KDE) was applied to map the most likely activity regions (activity surfaces). The analysis of the produced probabilistic surface of molecular activities (PSMAs) from the four datasets showed that these maps have both descriptive and predictive power, thus can be used as a spatial classification model, a tool to perform VS using only structural similarity of molecules. The above QSAR modeling approach was complemented with molecular docking, an approach that predicts the best mode of drug-target interaction. Both approaches were integrated to develop a rational and re-usable polypharmacology-based VS pipeline with improved hits identification rate. For the validation of the developed pipeline, a dual-targeting drug designing model against Parkinson’s disease (PD) was derived to identify novel inhibitors for improving the motor functions of PD patients by enhancing the bioavailability of dopamine and avoiding neurotoxicity. The proposed approach can easily be extended to more complex multi-targeting disease models containing several targets and anti/offtargets to achieve increased efficacy and reduced toxicity in multifactorial diseases like CNS disorders and cancer. This thesis addresses several issues of cheminformatics methods (e.g., molecular structures representation, machine learning, and molecular similarity analysis) to improve and design new computational approaches used in chemical data mining. Moreover, an integrative drug-designing pipeline is designed to improve polypharmacology-based VS approach. This presented methodology can identify the most promising multi-targeting candidates for experimental validation of drug-targets network at the systems biology level in the drug discovery process

    Application of computer-aided drug design for identification of P. falciparum inhibitors

    Get PDF
    Malaria is a millennia-old disease with the first recorded cases dating back to 2700 BC found in Chinese medical records, and later in other civilizations. It has claimed human lives to such an extent that there are a notable associated socio-economic consequences. Currently, according to the World Health Organization (WHO), Africa holds the highest disease burden with 94% of deaths and 82% of cases with P. falciparum having ~100% prevalence. Chemotherapy, such as artemisinin combination therapy, has been and continues to be the work horse in the fight against the disease, together with seasonal malaria chemoprevention and the use of insecticides. Natural products such as quinine and artemisinin are particularly important in terms of their antimalarial activity. The emphasis in current chemotherapy research is the need for time and cost-effective workflows focussed on new mechanisms of action (MoAs) covering the target candidate profiles (TCPs). Despite a decline in cases over the past decades with, countries increasingly becoming certified malaria free, a stalling trend has been observed in the past five years resulting in missing the 2020 Global Technical Strategy (GTS) milestones. With no effective vaccine, a reduction in funding, slower drug approval than resistance emergence from resistant and invasive vectors, and threats in diagnosis with the pfhrp2/3 gene deletion, malaria remains a major health concern. Motivated by these reasons, the primary aim of this work was a contribution to the antimalarial pipeline through in silico approaches focusing on P. falciparum. We first intended an exploration of malarial targets through a proteome scale screening on 36 targets using multiple metrics to account for the multi-objective nature of drug discovery. The continuous growth of structural data offers the ideal scenario for mining new MoAs covering antimalarials TCPs. This was combined with a repurposing strategy using a set of orally available FDA approved drugs. Further, use was made of time- and cost-effective strategies combining QVina-W efficiency metrics that integrate molecular properties, GRIM rescoring for molecular interactions and a hydrogen mass repartitioning (HMR) molecular dynamics (MD) scheme for accelerated development of antimalarials in the context of resistance. This pipeline further integrates a complex ranking for better drug-target selectivity, and normalization strategies to overcome docking scoring function bias. The different metrics, ranking, normalization strategies and their combinations were first assessed using their mean ranking error (MRE). A version combining all metrics was used to select 36 unique protein-ligand complexes, assessed in MD, with the final retention of 25. From the 16 in vitro tested hits of the 25, fingolimod, abiraterone, prazosin, and terazosin showed antiplasmodial activity with IC50 2.21, 3.37, 16.67 and 34.72 μM respectively and of these, only fingolimod was found to be not safe with respect to human cell viability. These compounds were predicted active on different molecular targets, abiraterone was predicted to interact with a putative liver-stage essential target, hence promising as a transmission-blocking agent. The pipeline had a promising 25% hit rate considering the proteome-scale and use of cost-effective approaches. Secondly, we focused on Plasmodium falciparum 1-deoxy-D-xylulose-5-phosphate reductoisomerase (PfDXR) using a more extensive screening pipeline to overcome some of the current in silico screening limitations. Starting from the ZINC lead-like library of ~3M, hierarchical ligand-based virtual screening (LBVS) and structure-based virtual screening (SBVS) approaches with molecular docking and re-scoring using eleven scoring functions (SFs) were used. Later ranking with an exponential consensus strategy was included. Selected hits were further assessed through Molecular Mechanics Poisson-Boltzmann Surface Area (MM-PBSA), advanced MD sampling in a ligand pulling simulations and (Weighted Histogram Analysis Method) WHAM analysis for umbrella sampling (US) to derive binding free energies. Four leads had better predicted affinities in US than LC5, a 280 nM potent PfDXR inhibitor with ZINC000050633276 showing a promising binding of -20.43 kcal/mol. As shown with fosmidomycin, DXR inhibition offers fast acting compounds fulfilling antimalarials TCP1. Yet, fosmidomycin has a high polarity causing its short half-life and hampering its clinical use. These leads scaffolds are different from fosmidomycin and hence may offer better pharmacokinetic and pharmacodynamic properties and may also be promising for lead optimization. A combined analysis of residues’ contributions to the free energy of binding in MM-PBSA and to steered molecular dynamics (SMD) Fmax indicated GLU233, CYS268, SER270, TRP296, and HIS341 as exploitable for compound optimization. Finally, we updated the SANCDB library with new NPs and their commercially available analogs as a solution to NP availability. The library is extended to 1005 compounds from its initial 600 compounds and the database is integrated to Mcule and Molport APIs for analogs automatic update. The new set may contribute to virtual screening and to antimalarials as the most effective ones have NP origin.Thesis (PhD) -- Faculty of Science, Biochemistry and Microbiology, 202
    corecore