521 research outputs found

    The PubChem chemical structure sketcher

    Get PDF
    PubChem is an important public, Web-based information source for chemical and bioactivity information. In order to provide convenient structure search methods on compounds stored in this database, one mandatory component is a Web-based drawing tool for interactive sketching of chemical query structures. Web-enabled chemical structure sketchers are not new, being in existence for years; however, solutions available rely on complex technology like Java applets or platform-dependent plug-ins. Due to general policy and support incident rate considerations, Java-based or platform-specific sketchers cannot be deployed as a part of public NCBI Web services. Our solution: a chemical structure sketching tool based exclusively on CGI server processing, client-side JavaScript functions, and image sequence streaming. The PubChem structure editor does not require the presence of any specific runtime support libraries or browser configurations on the client. It is completely platform-independent and verified to work on all major Web browsers, including older ones without support for Web2.0 JavaScript objects

    PubChem3D: Similar conformers

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>PubChem is a free and open public resource for the biological activities of small molecules. With many tens of millions of both chemical structures and biological test results, PubChem is a sizeable system with an uneven degree of available information. Some chemical structures in PubChem include a great deal of biological annotation, while others have little to none. To help users, PubChem pre-computes "neighboring" relationships to relate similar chemical structures, which may have similar biological function. In this work, we introduce a "Similar Conformers" neighboring relationship to identify compounds with similar 3-D shape and similar 3-D orientation of functional groups typically used to define pharmacophore features.</p> <p>Results</p> <p>The first two diverse 3-D conformers of 26.1 million PubChem Compound records were compared to each other, using a shape Tanimoto (ST) of 0.8 or greater and a color Tanimoto (CT) of 0.5 or greater, yielding 8.16 billion conformer neighbor pairs and 6.62 billion compound neighbor pairs, with an average of 253 "Similar Conformers" compound neighbors per compound. Comparing the 3-D neighboring relationship to the corresponding 2-D neighboring relationship ("Similar Compounds") for molecules such as caffeine, aspirin, and morphine, one finds unique sets of related chemical structures, providing additional significant biological annotation. The PubChem 3-D neighboring relationship is also shown to be able to group a set of non-steroidal anti-inflammatory drugs (NSAIDs), despite limited PubChem 2-D similarity.</p> <p>In a study of 4,218 chemical structures of biomedical interest, consisting of many known drugs, using more diverse conformers per compound results in more 3-D compound neighbors per compound; however, the overlap of the compound neighbor lists per conformer also increasingly resemble each other, being 38% identical at three conformers and 68% at ten conformers. Perhaps surprising is that the average count of conformer neighbors per conformer increases rather slowly as a function of diverse conformers considered, with only a 70% increase for a ten times growth in conformers per compound (a 68-fold increase in the conformer pairs considered).</p> <p>Neighboring 3-D conformers on the scale performed, if implemented naively, is an intractable problem using a modest sized compute cluster. Methodology developed in this work relies on a series of filters to prevent performing 3-D superposition optimization, when it can be determined that two conformers cannot possibly be a neighbor. Most filters are based on Tanimoto equation volume constraints, avoiding incompatible conformers; however, others consider preliminary superposition between conformers using reference shapes.</p> <p>Conclusion</p> <p>The "Similar Conformers" 3-D neighboring relationship locates similar small molecules of biological interest that may go unnoticed when using traditional 2-D chemical structure graph-based methods, making it complementary to such methodologies. The computational cost of 3-D similarity methodology on a wide scale, such as PubChem contents, is a considerable issue to overcome. Using a series of efficient filters, an effective throughput rate of more than 150,000 conformers per second per processor core was achieved, more than two orders of magnitude faster than without filtering.</p

    PubChem3D: Diversity of shape

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The shape diversity of 16.4 million biologically relevant molecules from the PubChem Compound database and their 1.46 billion diverse conformers was explored as a function of molecular volume.</p> <p>Results</p> <p>The diversity of shape space was investigated by determining the shape similarity threshold to achieve a maximum on the count of reference shapes per unit of conformer volume. The rate of growth in shape space, as represented by a decreasing shape similarity threshold, was found to be remarkably smooth as a function of volume. There was no apparent correlation between the count of conformers per unit volume and their diversity, meaning that a single reference shape can describe the shape space of many chemical structures. The ability of a volume to describe the shape space of lesser volumes was also examined. It was shown that a given volume was able to describe 40-70% of the shape diversity of lesser volumes, for the majority of the volume range considered in this study.</p> <p>Conclusion</p> <p>The relative growth of shape diversity as a function of volume and shape similarity is surprisingly uniform. Given the distribution of chemicals in PubChem versus what is theoretically synthetically possible, the results from this analysis should be considered a conservative estimate to the true diversity of shape space.</p

    PubChem3D: a new resource for scientists

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>PubChem is an open repository for small molecules and their experimental biological activity. PubChem integrates and provides search, retrieval, visualization, analysis, and programmatic access tools in an effort to maximize the utility of contributed information. There are many diverse chemical structures with similar biological efficacies against targets available in PubChem that are difficult to interrelate using traditional 2-D similarity methods. A new layer called PubChem3D is added to PubChem to assist in this analysis.</p> <p>Description</p> <p>PubChem generates a 3-D conformer model description for 92.3% of all records in the PubChem Compound database (when considering the parent compound of salts). Each of these conformer models is sampled to remove redundancy, guaranteeing a minimum (non-hydrogen atom pair-wise) RMSD between conformers. A diverse conformer ordering gives a maximal description of the conformational diversity of a molecule when only a subset of available conformers is used. A pre-computed search per compound record gives immediate access to a set of 3-D similar compounds (called "Similar Conformers") in PubChem and their respective superpositions. Systematic augmentation of PubChem resources to include a 3-D layer provides users with new capabilities to search, subset, visualize, analyze, and download data.</p> <p>A series of retrospective studies help to demonstrate important connections between chemical structures and their biological function that are not obvious using 2-D similarity but are readily apparent by 3-D similarity.</p> <p>Conclusions</p> <p>The addition of PubChem3D to the existing contents of PubChem is a considerable achievement, given the scope, scale, and the fact that the resource is publicly accessible and free. With the ability to uncover latent structure-activity relationships of chemical structures, while complementing 2-D similarity analysis approaches, PubChem3D represents a new resource for scientists to exploit when exploring the biological annotations in PubChem.</p

    Fundamental Concepts

    Full text link
    This chapter briefly discusses the fundamental properties of black holes in general relativity, the discovery of astrophysical black holes and their main astronomical observations, how X-ray and γ\gamma-ray facilities can study these objects, and ends with a list of open problems and future developments in the field.Comment: 14 pages, 4 figures. To appear in "Tutorial Guide to X-ray and Gamma-ray Astronomy: Data Reduction and Analysis" (Ed. C. Bambi, Springer Singapore, 2020). v2: fixed some typos and updated some parts. arXiv admin note: text overlap with arXiv:1711.1025

    Early star-forming galaxies and the reionization of the Universe

    Full text link
    Star forming galaxies represent a valuable tracer of cosmic history. Recent observational progress with Hubble Space Telescope has led to the discovery and study of the earliest-known galaxies corresponding to a period when the Universe was only ~800 million years old. Intense ultraviolet radiation from these early galaxies probably induced a major event in cosmic history: the reionization of intergalactic hydrogen. New techniques are being developed to understand the properties of these most distant galaxies and determine their influence on the evolution of the universe.Comment: Review article appearing in Nature. This posting reflects a submitted version of the review formatted by the authors, in accordance with Nature publication policies. For the official, published version of the review, please see http://www.nature.com/nature/archive/index.htm

    The guinea pig ileum lacks the direct, high-potency, M2-muscarinic, contractile mechanism characteristic of the mouse ileum

    Get PDF
    We explored whether the M2 muscarinic receptor in the guinea pig ileum elicits a highly potent, direct-contractile response, like that from the M3 muscarinic receptor knockout mouse. First, we characterized the irreversible receptor-blocking activity of 4-DAMP mustard in ileum from muscarinic receptor knockout mice to verify its M3 selectivity. Then, we used 4-DAMP mustard to inactivate M3 responses in the guinea pig ileum to attempt to reveal direct, M2 receptor-mediated contractions. The muscarinic agonist, oxotremorine-M, elicited potent contractions in ileum from wild-type, M2 receptor knockout, and M3 receptor knockout mice characterized by negative log EC50 (pEC50) values ± SEM of 6.75 ± 0.03, 6.26 ± 0.05, and 6.99 ± 0.08, respectively. The corresponding Emax values in wild-type and M2 receptor knockout mice were approximately the same, but that in the M3 receptor knockout mouse was only 36% of wild type. Following 4-DAMP mustard treatment, the concentration–response curve of oxotremorine-M in wild-type ileum resembled that of the M3 knockout mouse in terms of its pEC50, Emax, and inhibition by selective muscarinic antagonists. Thus, 4-DAMP mustard treatment appears to inactivate M3 responses selectively and renders the muscarinic contractile behavior of the wild-type ileum similar to that of the M3 knockout mouse. Following 4-DAMP mustard treatment, the contractile response of the guinea pig ileum to oxotremorine-M exhibited low potency and a competitive-antagonism profile consistent with an M3 response. The guinea pig ileum, therefore, lacks a direct, highly potent, M2-contractile component but may have a direct, lower potency M2 component

    Virtual Screening of Bioassay Data

    Get PDF
    Background: There are three main problems associated with the virtual screening of bioassay data. The first is access to freely-available curated data, the second is the number of false positives that occur in the physical primary screening process, and finally the data is highly-imbalanced with a low ratio of Active compounds to Inactive compounds. This paper first discusses these three problems and then a selection of Weka cost-sensitive classifiers (Naive Bayes, SVM, C4.5 and Random Forest) are applied to a variety of bioassay datasets. Results: Pharmaceutical bioassay data is not readily available to the academic community. The data held at PubChem is not curated and there is a lack of detailed cross-referencing between Primary and Confirmatory screening assays. With regard to the number of false positives that occur in the primary screening process, the analysis carried out has been shallow due to the lack of crossreferencing mentioned above. In six cases found, the average percentage of false positives from the High-Throughput Primary screen is quite high at 64%. For the cost-sensitive classification, Weka's implementations of the Support Vector Machine and C4.5 decision tree learner have performed relatively well. It was also found, that the setting of the Weka cost matrix is dependent on the base classifier used and not solely on the ratio of class imbalance. Conclusions: Understandably, pharmaceutical data is hard to obtain. However, it would be beneficial to both the pharmaceutical industry and to academics for curated primary screening and corresponding confirmatory data to be provided. Two benefits could be gained by employing virtual screening techniques to bioassay data. First, by reducing the search space of compounds to be screened and secondly, by analysing the false positives that occur in the primary screening process, the technology may be improved. The number of false positives arising from primary screening leads to the issue of whether this type of data should be used for virtual screening. Care when using Weka's cost-sensitive classifiers is needed - across the board misclassification costs based on class ratios should not be used when comparing differing classifiers for the same dataset
    • …
    corecore