16 research outputs found

    Accurate and efficient target prediction using a potency-sensitive influence-relevance voter

    Get PDF
    BACKGROUND: A number of algorithms have been proposed to predict the biological targets of diverse molecules. Some are structure-based, but the most common are ligand-based and use chemical fingerprints and the notion of chemical similarity. These methods tend to be computationally faster than others, making them particularly attractive tools as the amount of available data grows. RESULTS: Using a ChEMBL-derived database covering 490,760 molecule-protein interactions and 3236 protein targets, we conduct a large-scale assessment of the performance of several target-prediction algorithms at predicting drug-target activity. We assess algorithm performance using three validation procedures: standard tenfold cross-validation, tenfold cross-validation in a simulated screen that includes random inactive molecules, and validation on an external test set composed of molecules not present in our database. CONCLUSIONS: We present two improvements over current practice. First, using a modified version of the influence-relevance voter (IRV), we show that using molecule potency data can improve target prediction. Second, we demonstrate that random inactive molecules added during training can boost the accuracy of several algorithms in realistic target-prediction experiments. Our potency-sensitive version of the IRV (PS-IRV) obtains the best results on large test sets in most of the experiments. Models and software are publicly accessible through the chemoinformatics portal at http://chemdb.ics.uci.edu/ ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-015-0110-6) contains supplementary material, which is available to authorized users

    Topological Data Analysis Reveals a Subgroup of Luminal B Breast Cancer

    No full text
    Objective: High-throughput biological data, with its vast complexity and higher dimensions, continues to require innovative analytic methodologies for meaningful exploration. Most methods for reducing data dimensions overlook the shape and topology of data, even though these are vital components of the data structure and complexity. This study leverages topological data analysis (TDA) and shows, using breast cancer (BC) gene expression data as an illustrative example, the power of including the shape of data. Results: In addition to delineating the known subtypes of BC, TDA identifies a new subtype within luminal B cancer along with the features that define the subtype. The final outcome is shown via three-dimensional (3D) scatter plots which demonstrate how the underlying patterns that we identified through TDA map to 3D space. Conclusions: The new subtype, obtained unsupervised and validated by prior knowledge, demonstrates the power of embedding the topology and shape of data in the analyses

    COBRA: A Computational Brewing Application for Predicting the Molecular Composition of Organic Aerosols

    No full text
    Atmospheric organic aerosols (OA) represent a significant fraction of airborne particulate matter and can impact climate, visibility, and human health. These mixtures are difficult to characterize experimentally due to their complex and dynamic chemical composition. We introduce a novel Computational Brewing Application (COBRA) and apply it to modeling oligomerization chemistry stemming from condensation and addition reactions in OA formed by photooxidation of isoprene. COBRA uses two lists as input: a list of chemical structures comprising the molecular starting pool and a list of rules defining potential reactions between molecules. Reactions are performed iteratively, with products of all previous iterations serving as reactants for the next. The simulation generated thousands of structures in the mass range of 120–500 Da and correctly predicted ∼70% of the individual OA constituents observed by high-resolution mass spectrometry. Select predicted structures were confirmed with tandem mass spectrometry. Esterification was shown to play the most significant role in oligomer formation, with hemiacetal formation less important, and aldol condensation insignificant. COBRA is not limited to atmospheric aerosol chemistry; it should be applicable to the prediction of reaction products in other complex mixtures for which reasonable reaction mechanisms and seed molecules can be supplied by experimental or theoretical methods

    Atmospheric Oxidation of Squalene: Molecular Study Using COBRA Modeling and High-Resolution Mass Spectrometry

    No full text
    Squalene is a major component of skin and plant surface lipids and is known to be present at high concentrations in indoor dust. Its high reactivity toward ozone makes it an important ozone sink and a natural protectant against atmospheric oxidizing agents. While the volatile products of squalene ozonolysis are known, the condensed-phase products have not been characterized. We present an analysis of condensed-phase products resulting from an extensive oxidation of squalene by ozone probed by electrospray ionization (ESI) high-resolution mass spectrometry (HR–MS). A complex distribution of nearly 1300 peaks assignable to molecular formulas is observed in direct infusion positive ion mode ESI mass spectra. The distribution of peaks in the mass spectra suggests that there are extensive cross-coupling reactions between hydroxy-carbonyl products of squalene ozonolysis. To get additional insights into the mechanism, we apply a Computational Brewing Application (COBRA) to simulate the oxidation of squalene in the presence of ozone, and compare predicted results with those observed by the HR–MS experiments. The system predicts over one billion molecular structures between 0 and 1450 Da, which correspond to about 27 000 distinct elemental formulas. Over 83% of the squalene oxidation products inferred from the mass spectrometry data are matched by the simulation. The simulation indicates a prevalence of peroxy groups, with hydroxyl and ether groups being the second-most important O-containing functional groups formed during squalene oxidation. These highly oxidized products of squalene ozonolysis may accumulate on indoor dust and surfaces and contribute to their redox capacity

    Targeted molecular therapy of head and neck squamous cell carcinoma with the tyrosine kinase inhibitor vandetanib in a mouse model

    Full text link
    BACKGROUND: We investigated the effects of vandetanib, an inhibitor of vascular endothelial growth factor receptor 2 (VEGFR-2) and epidermal growth factor receptor (EGFR), alone and in combination with paclitaxel in an orthotopic mouse model of human head and neck squamous cell carcinoma (HNSCC). METHODS: The in vitro effects of vandetanib (ZACTIMA(™)) were assessed in two HNSCC cell lines on cell growth, apoptosis, and receptor and downstream signaling morecule expression and phosphorylation levels. We assessed in vivo effects of vandetanib and/or paclitaxel by measuring tumor cell apoptosis, endothelial cell apoptosis, microvessel density, tumor size, and animal survival. RESULTS: In vitro, vandetanib inhibited the phosphorylation of EGFR and its downstream targets in HNSCC cells and inhibited proliferation and induced apoptosis of HNSCC cells and extended survival and inhibited tumor growth in nude mice orthotopically injected with human HNSCC. CONCLUSION: Vandetanib has the potential to be a novel molecular targeted therapy for HNSCC
    corecore