136 research outputs found

    The PubChem chemical structure sketcher

    Get PDF
    PubChem is an important public, Web-based information source for chemical and bioactivity information. In order to provide convenient structure search methods on compounds stored in this database, one mandatory component is a Web-based drawing tool for interactive sketching of chemical query structures. Web-enabled chemical structure sketchers are not new, being in existence for years; however, solutions available rely on complex technology like Java applets or platform-dependent plug-ins. Due to general policy and support incident rate considerations, Java-based or platform-specific sketchers cannot be deployed as a part of public NCBI Web services. Our solution: a chemical structure sketching tool based exclusively on CGI server processing, client-side JavaScript functions, and image sequence streaming. The PubChem structure editor does not require the presence of any specific runtime support libraries or browser configurations on the client. It is completely platform-independent and verified to work on all major Web browsers, including older ones without support for Web2.0 JavaScript objects

    In silico assessment of potential druggable pockets on the surface of α1-Antitrypsin conformers

    Get PDF
    The search for druggable pockets on the surface of a protein is often performed on a single conformer, treated as a rigid body. Transient druggable pockets may be missed in this approach. Here, we describe a methodology for systematic in silico analysis of surface clefts across multiple conformers of the metastable protein α1-antitrypsin (A1AT). Pathological mutations disturb the conformational landscape of A1AT, triggering polymerisation that leads to emphysema and hepatic cirrhosis. Computational screens for small molecule inhibitors of polymerisation have generally focused on one major druggable site visible in all crystal structures of native A1AT. In an alternative approach, we scan all surface clefts observed in crystal structures of A1AT and in 100 computationally produced conformers, mimicking the native solution ensemble. We assess the persistence, variability and druggability of these pockets. Finally, we employ molecular docking using publicly available libraries of small molecules to explore scaffold preferences for each site. Our approach identifies a number of novel target sites for drug design. In particular one transient site shows favourable characteristics for druggability due to high enclosure and hydrophobicity. Hits against this and other druggable sites achieve docking scores corresponding to a Kd in the µM–nM range, comparing favourably with a recently identified promising lead. Preliminary ThermoFluor studies support the docking predictions. In conclusion, our strategy shows considerable promise compared with the conventional single pocket/single conformer approach to in silico screening. Our best-scoring ligands warrant further experimental investigation

    11th German Conference on Chemoinformatics (GCC 2015) : Fulda, Germany. 8-10 November 2015.

    Get PDF

    Fast 3D shape screening of large chemical databases through alignment-recycling

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Large chemical databases require fast, efficient, and simple ways of looking for similar structures. Although such tasks are now fairly well resolved for graph-based similarity queries, they remain an issue for 3D approaches, particularly for those based on 3D shape overlays. Inspired by a recent technique developed to compare molecular shapes, we designed a hybrid methodology, alignment-recycling, that enables efficient retrieval and alignment of structures with similar 3D shapes.</p> <p>Results</p> <p>Using a dataset of more than one million PubChem compounds of limited size (< 28 heavy atoms) and flexibility (< 6 rotatable bonds), we obtained a set of a few thousand diverse structures covering entirely the 3D shape space of the conformers of the dataset. Transformation matrices gathered from the overlays between these diverse structures and the 3D conformer dataset allowed us to drastically (100-fold) reduce the CPU time required for shape overlay. The alignment-recycling heuristic produces results consistent with <it>de novo </it>alignment calculation, with better than 80% hit list overlap on average.</p> <p>Conclusion</p> <p>Overlay-based 3D methods are computationally demanding when searching large databases. Alignment-recycling reduces the CPU time to perform shape similarity searches by breaking the alignment problem into three steps: selection of diverse shapes to describe the database shape-space; overlay of the database conformers to the diverse shapes; and non-optimized overlay of query and database conformers using common reference shapes. The precomputation, required by the first two steps, is a significant cost of the method; however, once performed, querying is two orders of magnitude faster. Extensions and variations of this methodology, for example, to handle more flexible and larger small-molecules are discussed.</p

    Seven Golden Rules for heuristic filtering of molecular formulas obtained by accurate mass spectrometry

    Get PDF
    BACKGROUND: Structure elucidation of unknown small molecules by mass spectrometry is a challenge despite advances in instrumentation. The first crucial step is to obtain correct elemental compositions. In order to automatically constrain the thousands of possible candidate structures, rules need to be developed to select the most likely and chemically correct molecular formulas. RESULTS: An algorithm for filtering molecular formulas is derived from seven heuristic rules: (1) restrictions for the number of elements, (2) LEWIS and SENIOR chemical rules, (3) isotopic patterns, (4) hydrogen/carbon ratios, (5) element ratio of nitrogen, oxygen, phosphor, and sulphur versus carbon, (6) element ratio probabilities and (7) presence of trimethylsilylated compounds. Formulas are ranked according to their isotopic patterns and subsequently constrained by presence in public chemical databases. The seven rules were developed on 68,237 existing molecular formulas and were validated in four experiments. First, 432,968 formulas covering five million PubChem database entries were checked for consistency. Only 0.6% of these compounds did not pass all rules. Next, the rules were shown to effectively reducing the complement all eight billion theoretically possible C, H, N, S, O, P-formulas up to 2000 Da to only 623 million most probable elemental compositions. Thirdly 6,000 pharmaceutical, toxic and natural compounds were selected from DrugBank, TSCA and DNP databases. The correct formulas were retrieved as top hit at 80–99% probability when assuming data acquisition with complete resolution of unique compounds and 5% absolute isotope ratio deviation and 3 ppm mass accuracy. Last, some exemplary compounds were analyzed by Fourier transform ion cyclotron resonance mass spectrometry and by gas chromatography-time of flight mass spectrometry. In each case, the correct formula was ranked as top hit when combining the seven rules with database queries. CONCLUSION: The seven rules enable an automatic exclusion of molecular formulas which are either wrong or which contain unlikely high or low number of elements. The correct molecular formula is assigned with a probability of 98% if the formula exists in a compound database. For truly novel compounds that are not present in databases, the correct formula is found in the first three hits with a probability of 65–81%. Corresponding software and supplemental data are available for downloads from the authors' website

    A taxonomic backbone for the global synthesis of species diversity in the angiosperm order Caryophyllales

    Full text link
    The Caryophyllales constitute a major lineage of flowering plants with approximately 12500 species in 39 families. A taxonomic backbone at the genus level is provided that reflects the current state of knowledge and accepts 749 genera for the order. A detailed review of the literature of the past two decades shows that enormous progress has been made in understanding overall phylogenetic relationships in Caryophyllales. The process of re-circumscribing families in order to be monophyletic appears to be largely complete and has led to the recognition of eight new families (Anacampserotaceae, Kewaceae, Limeaceae, Lophiocarpaceae, Macarthuriaceae, Microteaceae, Montiaceae and Talinaceae), while the phylogenetic evaluation of generic concepts is still well underway. As a result of this, the number of genera has increased by more than ten percent in comparison to the last complete treatments in the Families and genera of vascular plants” series. A checklist with all currently accepted genus names in Caryophyllales, as well as nomenclatural references, type names and synonymy is presented. Notes indicate how extensively the respective genera have been studied in a phylogenetic context. The most diverse families at the generic level are Cactaceae and Aizoaceae, but 28 families comprise only one to six genera. This synopsis represents a first step towards the aim of creating a global synthesis of the species diversity in the angiosperm order Caryophyllales integrating the work of numerous specialists around the world

    Epidermis structure in Mesembryanthemaceae

    No full text
    The Mesembryanthemaceae, which inhabit the deserts and semi-deserts of Southern Africa, exhibit two very different types of epidermis. The first is characterized by thick outer cell walls encrusted by oxalate crystals, a thick cuticle and thick wax layers. In the second type, the epidermal cells exhibit only very thin outer cell walls, not encrusted by oxalate crystals, a very thin cuticle, and wax layers are poorly developed; the most prominent feature of this type is the occurrence of huge idioblasts often protruding as ‘bladder cells’. With regard to the climatic conditions in the native habitats this type of epidermis must be called ‘abnormal’. Both types of epidermis may form sculptures of three distinct size classes, which are not necessarily homologous. The function of these sculptures is still poorly understood. There is evidence that the sculptures influence the optical properties of the surfaces and thereby the radiation balance (and heat stress) of the leaves.From recent research, it has become evident that the two different types of epidermis are the anatomical expression of two different strategies for survival under extreme conditions. Species with the idioblast type of epidermis exhibit a strategy that might be called ‘opportunistic’. These species have developed a mechanism of gaining water from the atmosphere during the night under favourable conditions. There are indications that the idioblasts are involved in this mechanism
    corecore