600 research outputs found
Improved large-scale prediction of growth inhibition patterns using the NCI60 cancer cell line panel
International audienceMotivation: Recent large-scale omics initiatives have catalogued the somatic alterations of cancer cell line panels along with their pharmacological response to hundreds of compounds. In this study, we have explored these data to advance computational approaches that enable more effective and targeted use of current and future anticancer therapeutics.Results: We modelled the 50% growth inhibition bioassay end-point (GI50) of 17 142 compounds screened against 59 cancer cell lines from the NCI60 panel (941 831 data-points, matrix 93.08% complete) by integrating the chemical and biological (cell line) information. We determine that the protein, gene transcript and miRNA abundance provide the highest predictive signal when modelling the GI50 endpoint, which significantly outperformed the DNA copy-number variation or exome sequencing data (Tukey’s Honestly Significant Difference, P <0.05). We demonstrate that, within the limits of the data, our approach exhibits the ability to both interpolate and extrapolate compound bioactivities to new cell lines and tissues and, although to a lesser extent, to dissimilar compounds. Moreover, our approach outperforms previous models generated on the GDSC dataset. Finally, we determine that in the cases investigated in more detail, the predicted drug-pathway associations and growth inhibition patterns are mostly consistent with the experimental data, which also suggests the possibility of identifying genomic markers of drug sensitivity for novel compounds on novel cell lines
Recommended from our members
Complex regulation of the regulator of synaptic plasticity histone deacetylase 2 in the rodent dorsal horn after peripheral injury
Histone deacetylases (HDACs), HDAC2 in particular, have been shown to regulate various forms of learning and memory. Since cognitive processes share mechanisms with spinal nociceptive signalling, we decided to investigate the HDAC2 expression in the dorsal horn after peripheral injury. Using immunohistochemistry, we found that spinal HDAC2 was mainly seen in neurons and astrocytes, with neuronal expression in naïve tissue 2.6 times greater than that in astrocytes. Cysteine (S)-nitrosylation of HDAC2 releases HDAC2 gene silencing and is controlled by nitric oxide (NO). A duration of 48 h after intraplantar injection of complete Freund's adjuvant, there was an ipsilateral increase in the most important NO-producing enzyme in pain states, nitric oxide synthase (nNOS), accompanied by an increase in HDAC2 S-nitrosylation. Moreover, a subset of nNOS-positive neurons expressed cFos, a known target of HDAC2, suggesting that derepression of cFos expression following HDAC2 S-nitrosylation might occur after noxious stimulation. We saw no change in global HDAC2 expression in both short- and long-term pain states. However, HDAC2 was increased in astrocytes 7 days after neuropathic injury suggesting that HDAC2 might inhibit astrocytic gene expression in neuropathic pain states. All together, our results indicate that the epigenetic regulation of transcriptional programmes in the dorsal horn after injury is cell specific. Moreover, the prominent role of NO in persistent pain states suggests that HDAC2 S-nitrosylation could play a crucial role in the regulation of gene expression leading to hypersensitivity. Our manuscript describes for the first time the regulation of the memory regulator histone deacetylase 2 (HDAC2) in the superficial dorsal horn of adult rats following peripheral injury. Our cell-specific approach has revealed a complex pattern of expression of spinal HDAC2 that depends on the injury and the cell type, suggesting a sophisticated regulation of gene expression by HDAC2
Target identification of hits using a concerted chemogenomic, biophysical and structural approach
Mycobacterium phenotypic hits are a good reservoir for new chemotypes for the treatment of tuberculosis. However, the absence
of defined molecular targets and modes of action could lead to failure in drug development. Therefore, a combination of
ligand-based and structure-based chemogenomic approaches followed by biophysical and biochemical validation have been used to
identify targets for Mycobacterium tuberculosis phenotypic hits. Our approach identified EthR and InhA as targets for several hits,
with some showing dual activity against these proteins. From the 35 predicted EthR inhibitors, eight exhibited an IC50 below 50
μM against M. tuberculosis EthR and three were confirmed to be also simultaneously active against InhA. Further hit validation was
performed using X-ray crystallography yielding eight new crystal structures of EthR inhibitors. Although the EthR inhibitors attain
their activity against M. tuberculosis by hitting yet undefined targets, these results provide new lead compounds that could be
further developed to be used to potentiate the effect of EthA activated pro-drugs, such as ethionamide, thus enhancing their
bactericidal effect.GM is grateful to the European Molecular Biology Laboratory and Marie Sklodowska-Curie Actions for funding this work. VM and MB
acknowledge Bill & Melinda Gates Foundation [subcontract by the Foundation for the National Institutes of Health (NIH)]
(OPP1024021). VM and MS acknowledge the European Community’s Seventh Framework Programme [grant number 260872]. GP
would like to acknowledge the Wellcome Trust and the European Molecular Biology Laboratory for funding. JPO was funded by the
member nation states of the European Molecular Biology Laboratory. TLB acknowledges The Wellcome Trust for funding and
support (grant number 200814/Z/16/Z)
Flipping the odds of drug development success through human genomics
Drug development depends on accurately identifying molecular targets that both play a causal role in a disease and are amenable to pharmacological action by small molecule drugs or bio-therapeutics, such as monoclonal antibodies. Errors in drug target specification contribute to the extremely high rates of drug development failure. Integrating knowledge of genes that encode druggable targets with those that influence susceptibility to common disease has the potential to radically improve the probability of drug development success
Improving the odds of drug development success through human genomics: modelling study.
Lack of efficacy in the intended disease indication is the major cause of clinical phase drug development failure. Explanations could include the poor external validity of pre-clinical (cell, tissue, and animal) models of human disease and the high false discovery rate (FDR) in preclinical science. FDR is related to the proportion of true relationships available for discovery (γ), and the type 1 (false-positive) and type 2 (false negative) error rates of the experiments designed to uncover them. We estimated the FDR in preclinical science, its effect on drug development success rates, and improvements expected from use of human genomics rather than preclinical studies as the primary source of evidence for drug target identification. Calculations were based on a sample space defined by all human diseases - the 'disease-ome' - represented as columns; and all protein coding genes - 'the protein-coding genome'- represented as rows, producing a matrix of unique gene- (or protein-) disease pairings. We parameterised the space based on 10,000 diseases, 20,000 protein-coding genes, 100 causal genes per disease and 4000 genes encoding druggable targets, examining the effect of varying the parameters and a range of underlying assumptions, on the inferences drawn. We estimated γ, defined mathematical relationships between preclinical FDR and drug development success rates, and estimated improvements in success rates based on human genomics (rather than orthodox preclinical studies). Around one in every 200 protein-disease pairings was estimated to be causal (γ = 0.005) giving an FDR in preclinical research of 92.6%, which likely makes a major contribution to the reported drug development failure rate of 96%. Observed success rate was only slightly greater than expected for a random pick from the sample space. Values for γ back-calculated from reported preclinical and clinical drug development success rates were also close to the a priori estimates. Substituting genome wide (or druggable genome wide) association studies for preclinical studies as the major information source for drug target identification was estimated to reverse the probability of late stage failure because of the more stringent type 1 error rate employed and the ability to interrogate every potential druggable target in the same experiment. Genetic studies conducted at much larger scale, with greater resolution of disease end-points, e.g. by connecting genomics and electronic health record data within healthcare systems has the potential to produce radical improvement in drug development success rate
IUPHAR-DB: An Open-Access, Expert-Curated Resource for Receptor and Ion Channel Research
[Image: see text] This contribution highlights efforts by the International Union of Basic and Clinical Pharmacology (IUPHAR) Nomenclature Committee (NC-IUPHAR) to classify human receptors and ion channels, to document their properties, and to recommend ligands that are useful for characterization. This effort has inspired the creation of an online database (IUPHAR-DB), which is intended to provide free information to all scientists, summarized from primary literature by experts
ChEMBL: a large-scale bioactivity database for drug discovery
ChEMBL is an Open Data database containing binding, functional and ADMET information for a large number of drug-like bioactive compounds. These data are manually abstracted from the primary published literature on a regular basis, then further curated and standardized to maximize their quality and utility across a wide range of chemical biology and drug-discovery research problems. Currently, the database contains 5.4 million bioactivity measurements for more than 1 million compounds and 5200 protein targets. Access is available through a web-based interface, data downloads and web services at: https://www.ebi.ac.uk/chembldb
Global Analysis of Small Molecule Binding to Related Protein Targets
We report on the integration of pharmacological data and homology information for a large scale analysis of small molecule binding to related targets. Differences in small molecule binding have been assessed for curated pairs of human to rat orthologs and also for recently diverged human paralogs. Our analysis shows that in general, small molecule binding is conserved for pairs of human to rat orthologs. Using statistical tests, we identified a small number of cases where small molecule binding is different between human and rat, some of which had previously been reported in the literature. Knowledge of species specific pharmacology can be advantageous for drug discovery, where rats are frequently used as a model system. For human paralogs, we demonstrate a global correlation between sequence identity and the binding of small molecules with equivalent affinity. Our findings provide an initial general model relating small molecule binding and sequence divergence, containing the foundations for a general model to anticipate and predict within-target-family selectivity
Structural diversity of biologically interesting datasets: a scaffold analysis approach
ABSTRACT:The recent public availability of the human metabolome and natural product datasets has revitalized "metabolite-likeness" and "natural product-likeness" as a drug design concept to design lead libraries targeting specific pathways. Many reports have analyzed the physicochemical property space of biologically important datasets, with only a few comprehensively characterizing the scaffold diversity in public datasets of biological interest. With large collections of high quality public data currently available, we carried out a comparative analysis of current day leads with other biologically relevant datasets.In this study, we note a two-fold enrichment of metabolite scaffolds in drug dataset (42%) as compared to currently used lead libraries (23%). We also note that only a small percentage (5%) of natural product scaffolds space is shared by the lead dataset. We have identified specific scaffolds that are present in metabolites and natural products, with close counterparts in the drugs, but are missing in the lead dataset. To determine the distribution of compounds in physicochemical property space we analyzed the molecular polar surface area, the molecular solubility, the number of rings and the number of rotatable bonds in addition to four well-known Lipinski properties. Here, we note that, with only few exceptions, most of the drugs follow Lipinski's rule. The average values of the molecular polar surface area and the molecular solubility in metabolites is the highest while the number of rings is the lowest. In addition, we note that natural products contain the maximum number of rings and the rotatable bonds than any other dataset under consideration.Currently used lead libraries make little use of the metabolites and natural products scaffold space. We believe that metabolites and natural products are recognized by at least one protein in the biosphere therefore, sampling the fragment and scaffold space of these compounds, along with the knowledge of distribution in physicochemical property space, can result in better lead libraries. Hence, we recommend the greater use of metabolites and natural products while designing lead libraries. Nevertheless, metabolites have a limited distribution in chemical space that limits the usage of metabolites in library design.14 page(s
Self-organizing ontology of biochemically relevant small molecules
<p>Abstract</p> <p>Background</p> <p>The advent of high-throughput experimentation in biochemistry has led to the generation of vast amounts of chemical data, necessitating the development of novel analysis, characterization, and cataloguing techniques and tools. Recently, a movement to publically release such data has advanced biochemical structure-activity relationship research, while providing new challenges, the biggest being the curation, annotation, and classification of this information to facilitate useful biochemical pattern analysis. Unfortunately, the human resources currently employed by the organizations supporting these efforts (e.g. ChEBI) are expanding linearly, while new useful scientific information is being released in a seemingly exponential fashion. Compounding this, currently existing chemical classification and annotation systems are not amenable to automated classification, formal and transparent chemical class definition axiomatization, facile class redefinition, or novel class integration, thus further limiting chemical ontology growth by necessitating human involvement in curation. Clearly, there is a need for the automation of this process, especially for novel chemical entities of biological interest.</p> <p>Results</p> <p>To address this, we present a formal framework based on Semantic Web technologies for the automatic design of chemical ontology which can be used for automated classification of novel entities. We demonstrate the automatic self-assembly of a structure-based chemical ontology based on 60 MeSH and 40 ChEBI chemical classes. This ontology is then used to classify 200 compounds with an accuracy of 92.7%. We extend these structure-based classes with molecular feature information and demonstrate the utility of our framework for classification of functionally relevant chemicals. Finally, we discuss an iterative approach that we envision for future biochemical ontology development.</p> <p>Conclusions</p> <p>We conclude that the proposed methodology can ease the burden of chemical data annotators and dramatically increase their productivity. We anticipate that the use of formal logic in our proposed framework will make chemical classification criteria more transparent to humans and machines alike and will thus facilitate predictive and integrative bioactivity model development.</p
- …