13 research outputs found

    The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching

    Get PDF
    open access articleBackground: The Chemistry Development Kit (CDK) is a widely used open source cheminformatics toolkit, providing data structures to represent chemical concepts along with methods to manipulate such structures and perform computations on them. The library implements a wide variety of cheminformatics algorithms ranging from chemical structure canonicalization to molecular descriptor calculations and pharmacophore perception. It is used in drug discovery, metabolomics, and toxicology. Over the last 10 years, the code base has grown significantly, however, resulting in many complex interdependencies among components and poor performance of many algorithms. Results: We report improvements to the CDK v2.0 since the v1.2 release series, specifically addressing the increased functional complexity and poor performance. We first summarize the addition of new functionality, such atom typing and molecular formula handling, and improvement to existing functionality that has led to significantly better performance for substructure searching, molecular fingerprints, and rendering of molecules. Second, we outline how the CDK has evolved with respect to quality control and the approaches we have adopted to ensure stability, including a code review mechanism. Conclusions: This paper highlights our continued efforts to provide a community driven, open source cheminformatics library, and shows that such collaborative projects can thrive over extended periods of time, resulting in a high-quality and performant library. By taking advantage of community support and contributions, we show that an open source cheminformatics project can act as a peer reviewed publishing platform for scientific computing software

    <scp>ReSurveyEurope</scp>: A database of resurveyed vegetation plots in Europe

    Get PDF
    AbstractAimsWe introduce ReSurveyEurope — a new data source of resurveyed vegetation plots in Europe, compiled by a collaborative network of vegetation scientists. We describe the scope of this initiative, provide an overview of currently available data, governance, data contribution rules, and accessibility. In addition, we outline further steps, including potential research questions.ResultsReSurveyEurope includes resurveyed vegetation plots from all habitats. Version 1.0 of ReSurveyEurope contains 283,135 observations (i.e., individual surveys of each plot) from 79,190 plots sampled in 449 independent resurvey projects. Of these, 62,139 (78%) are permanent plots, that is, marked in situ, or located with GPS, which allow for high spatial accuracy in resurvey. The remaining 17,051 (22%) plots are from studies in which plots from the initial survey could not be exactly relocated. Four data sets, which together account for 28,470 (36%) plots, provide only presence/absence information on plant species, while the remaining 50,720 (64%) plots contain abundance information (e.g., percentage cover or cover–abundance classes such as variants of the Braun‐Blanquet scale). The oldest plots were sampled in 1911 in the Swiss Alps, while most plots were sampled between 1950 and 2020.ConclusionsReSurveyEurope is a new resource to address a wide range of research questions on fine‐scale changes in European vegetation. The initiative is devoted to an inclusive and transparent governance and data usage approach, based on slightly adapted rules of the well‐established European Vegetation Archive (EVA). ReSurveyEurope data are ready for use, and proposals for analyses of the data set can be submitted at any time to the coordinators. Still, further data contributions are highly welcome.</jats:sec

    Photochemistry of Matrix-Isolated Phenylsilyl Azides

    No full text

    C8H6 thermal chemistry. 7-Methylenecyclohepta-1,3,5- dienyne (heptafulvyne) by flash vacuum thermolysis]matrix isolation. Chemical activation in the rearrangements of phenylenedicarbenes and of benzocyclobutadiene to phenylacetylene

    No full text
    Methylenecycloheptadienyne 11 (heptafulvyne) is obtained very cleanly by flash vacuum thermolysis (FVT) of the diazobenzocyclobutene precursor 8 at 400°C followed by isolation as a neat solid at 77K or in an Ar matrix at 7-10K. Compound 11 is a yellow solid, stable till ∼-100°C in the neat state. The diazo compound itself (2) is observable by IR spectroscopy following mild decomposition of the tosylhydrazone salt 1 at 115°C. FVT of 8 at 200°C also generates diazo compound 2 as observed by IR spectroscopy and on-line mass spectrometry. FVT of 8 at 600-800°C causes rearrangement of 11 to phenylacetylene 12 and benzocyclobutadiene 13. Mechanisms for the rearrangements are proposed. Facile rearrangement of benzocyclobutadiene to phenylacetylene is ascribed to chemical activation, which is also seen to be involved in the rearrangement of p-, m-, and o-phenylenebiscarbenes 25-27 to phenylacetylene 12

    Introduction à la biologie des Carapidae

    Get PDF
    Background: Contemporary biological research integrates neighboring scientific domains to answer complex questions in fields such as systems biology and drug discovery. This calls for tools that are intuitive to use, yet flexible to adapt to new tasks. Results: Bioclipse is a free, open source workbench with advanced features for the life sciences. Version 2.0 constitutes a complete rewrite of Bioclipse, and delivers a stable, scalable integration platform for developers and an intuitive workbench for end users. All functionality is available both from the graphical user interface and from a built-in novel domain-specific language, supporting the scientist in interdisciplinary research and reproducible analyses through advanced visualization of the inputs and the results. New components for Bioclipse 2 include a rewritten editor for chemical structures, a table for multiple molecules that supports gigabyte-sized files, as well as a graphical editor for sequences and alignments. Conclusion: Bioclipse 2 is equipped with advanced tools required to carry out complex analysis in the fields of bio- and cheminformatics. Developed as a Rich Client based on Eclipse, Bioclipse 2 leverages on today's powerful desktop computers for providing a responsive user interface, but also takes full advantage of the Web and networked (Web/Cloud) services for more demanding calculations or retrieval of data. The fact that Bioclipse 2 is based on an advanced and widely used service platform ensures wide extensibility, making it easy to add new algorithms, visualizations, as well as scripting commands. The intuitive tools for end users and the extensible architecture make Bioclipse 2 ideal for interdisciplinary and integrative research. Bioclipse 2 is released under the Eclipse Public License (EPL), a flexible open source license that allows additional plugins to be of any license. Bioclipse 2 is implemented in Java and supported on all major platforms; Source code and binaries are freely available at http://www.bioclipse.net

    Bioclipse 2: A scriptable integration platform for the life sciences

    No full text
    Background: Contemporary biological research integrates neighboring scientific domains to answer complex questions in fields such as systems biology and drug discovery. This calls for tools that are intuitive to use, yet flexible to adapt to new tasks. Results: Bioclipse is a free, open source workbench with advanced features for the life sciences. Version 2.0 constitutes a complete rewrite of Bioclipse, and delivers a stable, scalable integration platform for developers and an intuitive workbench for end users. All functionality is available both from the graphical user interface and from a built-in novel domain-specific language, supporting the scientist in interdisciplinary research and reproducible analyses through advanced visualization of the inputs and the results. New components for Bioclipse 2 include a rewritten editor for chemical structures, a table for multiple molecules that supports gigabyte-sized files, as well as a graphical editor for sequences and alignments. Conclusion: Bioclipse 2 is equipped with advanced tools required to carry out complex analysis in the fields of bio- and cheminformatics. Developed as a Rich Client based on Eclipse, Bioclipse 2 leverages on today's powerful desktop computers for providing a responsive user interface, but also takes full advantage of the Web and networked (Web/Cloud) services for more demanding calculations or retrieval of data. The fact that Bioclipse 2 is based on an advanced and widely used service platform ensures wide extensibility, making it easy to add new algorithms, visualizations, as well as scripting commands. The intuitive tools for end users and the extensible architecture make Bioclipse 2 ideal for interdisciplinary and integrative research. Bioclipse 2 is released under the Eclipse Public License (EPL), a flexible open source license that allows additional plugins to be of any license. Bioclipse 2 is implemented in Java and supported on all major platforms; Source code and binaries are freely available at http://www.bioclipse.net
    corecore