589 research outputs found
Improving the prediction of organism-level toxicity through integration of chemical, protein target and cytotoxicity qHTS data.
Prediction of compound toxicity is essential because covering the vast chemical space requiring safety assessment using traditional experimentally-based, resource-intensive techniques is impossible. However, such prediction is nontrivial due to the complex causal relationship between compound structure and in vivo harm. Protein target annotations and in vitro experimental outcomes encode relevant bioactivity information complementary to chemicals' structures. This work tests the hypothesis that utilizing three complementary types of data will afford predictive models that outperform traditional models built using fewer data types. A tripartite, heterogeneous descriptor set for 367 compounds was comprised of (a) chemical descriptors, (b) protein target descriptors generated using an algorithm trained on 190 000 ligand-protein interactions from ChEMBL, and (c) descriptors derived from in vitro cell cytotoxicity dose-response data from a panel of human cell lines. 100 random forests classification models for predicting rat LD50 were built using every combination of descriptors. Successive integration of data types improved predictive performance; models built using the full dataset had an average external correct classification rate of 0.82, compared to 0.73-0.80 for models built using two data types and 0.67-0.78 for models built using one. Pairwise comparisons of models trained on the same data showed that including a third data domain on top of chemistry improved average correct classification rate by 1.4-2.4 points, with p-values <0.01. Additionally, the approach enhanced the models' applicability domains and proved useful for generating novel mechanism hypotheses. The use of tripartite heterogeneous bioactivity datasets is a useful technique for improving toxicity prediction. Both protein target descriptors - which have the practical value of being derived in silico - and cytotoxicity descriptors derived from experiment are suitable contributors to such datasets.We thank Alexander Sedykh, Ivan Rusyn and Alexander Tropsha (University of North Carolina – Chapel Hill) for providing the chemical and qHTS data used in this study. We also thank the European Chemical Industry Council Long-range Research Initiative (CEFIC-LRI) for funding (via the LRI Innovative Science Award 2012 to AB). ICC thanks the Pasteur-Paris International PhD Programme for funding. ICC and TM thank Institut Pasteur for funding. AB and DSM thank Unilever and the European Research Commission (Starting Grant ERC-2013-StG 336159 MIXTURE) for funding.This is the final version of the article. It first appeared from Wiley via http://dx.doi.org/10.1039/C5TX00406
Recommended from our members
Leveraging heterogeneous data from GHS toxicity annotations, molecular and protein target descriptors and Tox21 assay readouts to predict and rationalise acute toxicity
Despite the increasing knowledge in both the chemical and biological domains the assimilation and exploration of heterogeneous datasets, encoding information about the chemical, bioactivity and phenotypic properties of compounds, remains a challenge due to requirement for overlap between chemicals assayed across the spaces. Here, we have constructed a novel dataset, larger than we have used in prior work, comprising 579 acute oral toxic compounds and 1,427 non-toxic compounds derived from regulatory GHS information, along with their corresponding molecular and protein target descriptors and qHTS in vitro assay readouts from the Tox21 project. We found no clear association between the results of a FAFDrugs4 toxicophore screen and the acute oral toxicity classifications for our compound set; and a screen using a subset of the ToxAlerts toxicophores was also of limited utility, with only slight enrichment toward the toxic set (Odds Ratio of 1.48). We then investigated to what degree toxic and non-toxic compounds could be separated in each of the spaces, to compare their potential contribution to further analyses. Using an LDA projection, we found the largest degree of separation using chemical descriptors (Cohen’s d of 1.95) and the lowest degree of separation between toxicity classes using qHTS descriptors (Cohen’s d of 0.67). To compare the predictivity of the feature spaces for the toxicity endpoint, we next trained Random Forest (RF) acute oral toxicity classifiers on either molecular, protein target and qHTS descriptors. RFs trained on molecular and protein target descriptors were most predictive, with ROC AUC values of 0.80-0.92 and 0.70-0.85, respectively, across three test sets. RFs trained on both chemical and protein target descriptors combined exhibited similar predictive performance to the single-domain models (ROC AUC of 0.80-0.91). Model interpretability was improved by the inclusion of protein target descriptors, which allow the identification of specific targets (e.g. Retinal dehydrogenase) with literature links to toxic modes of action (e.g. oxidative stress). The dataset compiled in this study has been made available for future application.CEFIC Long-range Research Initiative (CEFIC LRI Award 2012 to Andreas Bender
Hybrid crystalline-ITO/metal nanowire mesh transparent electrodes and their application for highly flexible perovskite solar cells
Here, we propose crystalline indium tin oxide/metal nanowire composite electrode (c-ITO/metal NW-GFRHybrimer) films as a robust platform for flexible optoelectronic devices. A very thin c-ITO overcoating layer was introduced to the surface-embedded metal nanowire (NW) network. The c-ITO/metal NW-GFRHybrimer films exhibited outstanding mechanical flexibility, excellent optoelectrical properties and thermal/chemical robustness. Highly flexible and efficient metal halide perovskite solar cells were fabricated on the films. The devices on the c-ITO/AgNW- and c-ITO/CuNW-GFRHybrimer films exhibited power conversion efficiency values of 14.15% and 12.95%, respectively. A synergetic combination of the thin c-ITO layer and the metal NW mesh transparent conducting electrode will be beneficial for use in flexible optoelectronic applications
A review of elliptical and disc galaxy structure, and modern scaling laws
A century ago, in 1911 and 1913, Plummer and then Reynolds introduced their
models to describe the radial distribution of stars in `nebulae'. This article
reviews the progress since then, providing both an historical perspective and a
contemporary review of the stellar structure of bulges, discs and elliptical
galaxies. The quantification of galaxy nuclei, such as central mass deficits
and excess nuclear light, plus the structure of dark matter halos and cD galaxy
envelopes, are discussed. Issues pertaining to spiral galaxies including dust,
bulge-to-disc ratios, bulgeless galaxies, bars and the identification of
pseudobulges are also reviewed. An array of modern scaling relations involving
sizes, luminosities, surface brightnesses and stellar concentrations are
presented, many of which are shown to be curved. These 'redshift zero'
relations not only quantify the behavior and nature of galaxies in the Universe
today, but are the modern benchmark for evolutionary studies of galaxies,
whether based on observations, N-body-simulations or semi-analytical modelling.
For example, it is shown that some of the recently discovered compact
elliptical galaxies at 1.5 < z < 2.5 may be the bulges of modern disc galaxies.Comment: Condensed version (due to Contract) of an invited review article to
appear in "Planets, Stars and Stellar
Systems"(www.springer.com/astronomy/book/978-90-481-8818-5). 500+ references
incl. many somewhat forgotten, pioneer papers. Original submission to
Springer: 07-June-201
- …