103 research outputs found

    Evaluation of machine-learning methods for ligand-based virtual screening

    Get PDF
    Machine-learning methods can be used for virtual screening by analysing the structural characteristics of molecules of known (in)activity, and we here discuss the use of kernel discrimination and naive Bayesian classifier (NBC) methods for this purpose. We report a kernel method that allows the processing of molecules represented by binary, integer and real-valued descriptors, and show that it is little different in screening performance from a previously described kernel that had been developed specifically for the analysis of binary fingerprint representations of molecular structure. We then evaluate the performance of an NBC when the training-set contains only a very few active molecules. In such cases, a simpler approach based on group fusion would appear to provide superior screening performance, especially when structurally heterogeneous datasets are to be processed

    Physiochemical property space distribution among human metabolites, drugs and toxins

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The current approach to screen for drug-like molecules is to sieve for molecules with biochemical properties suitable for desirable pharmacokinetics and reduced toxicity, using predominantly biophysical properties of chemical compounds, based on empirical rules such as Lipinski's "rule of five" (Ro5). For over a decade, Ro5 has been applied to combinatorial compounds, drugs and ligands, in the search for suitable lead compounds. Unfortunately, till date, a clear distinction between drugs and non-drugs has not been achieved. The current trend is to seek out drugs which show metabolite-likeness. In identifying similar physicochemical characteristics, compounds have usually been clustered based on some characteristic, to reduce the search space presented by large molecular datasets. This paper examines the similarity of current drug molecules with human metabolites and toxins, using a range of computed molecular descriptors as well as the effect of comparison to clustered data compared to searches against complete datasets.</p> <p>Results</p> <p>We have carried out statistical and substructure functional group analyses of three datasets, namely human metabolites, drugs and toxin molecules. The distributions of various molecular descriptors were investigated. Our analyses show that, although the three groups are distinct, present-day drugs are closer to toxin molecules than to metabolites. Furthermore, these distributions are quite similar for both clustered data as well as complete or unclustered datasets.</p> <p>Conclusion</p> <p>The property space occupied by metabolites is dissimilar to that of drugs or toxin molecules, with current drugs showing greater similarity to toxins than to metabolites. Additionally, empirical rules like Ro5 can be refined to identify drugs or drug-like molecules that are clearly distinct from toxic compounds and more metabolite-like. The inclusion of human metabolites in this study provides a deeper insight into metabolite/drug/toxin-like properties and will also prove to be valuable in the prediction or optimization of small molecules as ligands for therapeutic applications.</p

    ANN multiscale model of anti-HIV Drugs activity vs AIDS prevalence in the US at county level based on information indices of molecular graphs and social networks

    Get PDF
    [Abstract] This work is aimed at describing the workflow for a methodology that combines chemoinformatics and pharmacoepidemiology methods and at reporting the first predictive model developed with this methodology. The new model is able to predict complex networks of AIDS prevalence in the US counties, taking into consideration the social determinants and activity/structure of anti-HIV drugs in preclinical assays. We trained different Artificial Neural Networks (ANNs) using as input information indices of social networks and molecular graphs. We used a Shannon information index based on the Gini coefficient to quantify the effect of income inequality in the social network. We obtained the data on AIDS prevalence and the Gini coefficient from the AIDSVu database of Emory University. We also used the Balaban information indices to quantify changes in the chemical structure of anti-HIV drugs. We obtained the data on anti-HIV drug activity and structure (SMILE codes) from the ChEMBL database. Last, we used Box-Jenkins moving average operators to quantify information about the deviations of drugs with respect to data subsets of reference (targets, organisms, experimental parameters, protocols). The best model found was a Linear Neural Network (LNN) with values of Accuracy, Specificity, and Sensitivity above 0.76 and AUROC > 0.80 in training and external validation series. This model generates a complex network of AIDS prevalence in the US at county level with respect to the preclinical activity of anti-HIV drugs in preclinical assays. To train/validate the model and predict the complex network we needed to analyze 43,249 data points including values of AIDS prevalence in 2,310 counties in the US vs ChEMBL results for 21,582 unique drugs, 9 viral or human protein targets, 4,856 protocols, and 10 possible experimental measures.Ministerio de Educación, Cultura y Deportes; AGL2011-30563-C03-0

    Novel curcumin- and emodin-related compounds identified by in silico 2D/3D conformer screening induce apoptosis in tumor cells

    Get PDF
    BACKGROUND: Inhibition of the COP9 signalosome (CSN) associated kinases CK2 and PKD by curcumin causes stabilization of the tumor suppressor p53. It has been shown that curcumin induces tumor cell death and apoptosis. Curcumin and emodin block the CSN-directed c-Jun signaling pathway, which results in diminished c-Jun steady state levels in HeLa cells. The aim of this work was to search for new CSN kinase inhibitors analogue to curcumin and emodin by means of an in silico screening method. METHODS: Here we present a novel method to identify efficient inhibitors of CSN-associated kinases. Using curcumin and emodin as lead structures an in silico screening with our in-house database containing more than 10(6 )structures was carried out. Thirty-five compounds were identified and further evaluated by the Lipinski's rule-of-five. Two groups of compounds can be clearly discriminated according to their structures: the curcumin-group and the emodin-group. The compounds were evaluated in in vitro kinase assays and in cell culture experiments. RESULTS: The data revealed 3 compounds of the curcumin-group (e.g. piceatannol) and 4 of the emodin-group (e.g. anthrachinone) as potent inhibitors of CSN-associated kinases. Identified agents increased p53 levels and induced apoptosis in tumor cells as determined by annexin V-FITC binding, DNA fragmentation and caspase activity assays. CONCLUSION: Our data demonstrate that the new in silico screening method is highly efficient for identifying potential anti-tumor drugs

    IVSPlat 1.0: an integrated virtual screening platform with a molecular graphical interface

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The virtual screening (VS) of lead compounds using molecular docking and pharmacophore detection is now an important tool in drug discovery. VS tasks typically require a combination of several software tools and a molecular graphics system. Thus, the integration of all the requisite tools in a single operating environment could reduce the complexity of running VS experiments. However, only a few freely available integrated software platforms have been developed.</p> <p>Results</p> <p>A free open-source platform, IVSPlat 1.0, was developed in this study for the management and automation of VS tasks. We integrated several VS-related programs into a molecular graphics system to provide a comprehensive platform for the solution of VS tasks based on molecular docking, pharmacophore detection, and a combination of both methods. This tool can be used to visualize intermediate and final results of the VS execution, while also providing a clustering tool for the analysis of VS results. A case study was conducted to demonstrate the applicability of this platform.</p> <p>Conclusions</p> <p>IVSPlat 1.0 provides a plug-in-based solution for the management, automation, and visualization of VS tasks. IVSPlat 1.0 is an open framework that allows the integration of extra software to extend its functionality and modified versions can be freely distributed. The open source code and documentation are available at <url>http://kyc.nenu.edu.cn/IVSPlat/.</url></p

    The use of biodiversity as source of new chemical entities against defined molecular targets for treatment of malaria, tuberculosis, and T-cell mediated diseases: a review

    Full text link
    corecore