36 research outputs found

    MGmapper: Reference based mapping and taxonomy annotation of metagenomics sequence reads

    Get PDF
    An increasing amount of species and gene identification studies rely on the use of next generation sequence analysis of either single isolate or metagenomics samples. Several methods are available to perform taxonomic annotations and a previous metagenomics benchmark study has shown that a vast number of false positive species annotations are a problem unless thresholds or post-processing are applied to differentiate between correct and false annotations. MGmapper is a package to process raw next generation sequence data and perform reference based sequence assignment, followed by a post-processing analysis to produce reliable taxonomy annotation at species and strain level resolution. An in-vitro bacterial mock community sample comprised of 8 genuses, 11 species and 12 strains was previously used to benchmark metagenomics classification methods. After applying a post-processing filter, we obtained 100% correct taxonomy assignments at species and genus level. A sensitivity and precision at 75% was obtained for strain level annotations. A comparison between MGmapper and Kraken at species level, shows MGmapper assigns taxonomy at species level using 84.8% of the sequence reads, compared to 70.5% for Kraken and both methods identified all species with no false positives. Extensive read count statistics are provided in plain text and excel sheets for both rejected and accepted taxonomy annotations. The use of custom databases is possible for the command-line version of MGmapper, and the complete pipeline is freely available as a bitbucked package (https://bitbucket.org/genomicepidemiology/mgmapper). A web-version (https://cge.cbs.dtu.dk/services/MGmapper) provides the basic functionality for analysis of small fastq datasets

    Reads2Type: a web application for rapid microbial taxonomy identification

    Get PDF
    BACKGROUND: Identification of bacteria may be based on sequencing and molecular analysis of a specific locus such as 16S rRNA, or a set of loci such as in multilocus sequence typing. In the near future, healthcare institutions and routine diagnostic microbiology laboratories may need to sequence the entire genome of microbial isolates. Therefore we have developed Reads2Type, a web-based tool for taxonomy identification based on whole bacterial genome sequence data. RESULTS: Raw sequencing data provided by the user are mapped against a set of marker probes that are derived from currently available bacteria complete genomes. Using a dataset of 1003 whole genome sequenced bacteria from various sequencing platforms, Reads2Type was able to identify the species with 99.5 % accuracy and on the minutes time scale. CONCLUSIONS: In comparison with other tools, Reads2Type offers the advantage of not needing to transfer sequencing files, as the entire computational analysis is done on the computer of whom utilizes the web application. This also prevents data privacy issues to arise. The Reads2Type tool is available at http://www.cbs.dtu.dk/~dhany/reads2type.html

    Phase behavior of supported lipid bilayers: A systematic study by coarse-grained molecular dynamics simulations

    Get PDF
    Solid-supported lipid bilayers are utilized by experimental scientists as models for biological membranes because of their stability. However, compared to free standing bilayers, their close proximity to the substrate may affect their phase behavior. As this is still poorly understood, and few computational studies have been performed on such systems thus far, here we present the results from a systematic study based on molecular dynamics simulations of an implicit-solvent model for solid-supported lipid bilayers with varying lipid-substrate interactions. The attractive interaction between the substrate and the lipid head groups that are closest to the substrate leads to an increased translocation of the lipids from the distal to the proximal bilayer-leaflet. This thereby leads to a transbilayer imbalance of the lipid density, with the lipid density of the proximal leaflet higher than that of the distal leaflet. Consequently, the order parameter of the proximal leaflet is found to be higher than that of the distal leaflet, the higher the strength of lipid interaction is, the stronger the effect. The proximal leaflet exhibits gel and fluid phases with an abrupt melting transition between the two phases. In contrast, below the melting temperature of the proximal leaflet, the distal leaflet is inhomogeneous with coexisting gel and fluid domains. The size of the fluid domains increases with increasing the strength of the lipid interaction. At low temperatures, the inhomogeneity of the distal leaflet is due to its reduced lipid density

    Tools and data services registry: a community effort to document bioinformatics resources

    Get PDF
    Life sciences are yielding huge data sets that underpin scientific discoveries fundamental to improvement in human health, agriculture and the environment. In support of these discoveries, a plethora of databases and tools are deployed, in technically complex and diverse implementations, across a spectrum of scientific disciplines. The corpus of documentation of these resources is fragmented across the Web, with much redundancy, and has lacked a common standard of information. The outcome is that scientists must often struggle to find, understand, compare and use the best resources for the task at hand. Here we present a community-driven curation effort, supported by ELIXIR—the European infrastructure for biological information—that aspires to a comprehensive and consistent registry of information about bioinformatics resources. The sustainable upkeep of this Tools and Data Services Registry is assured by a curation effort driven by and tailored to local needs, and shared amongst a network of engaged partners. As of November 2015, the registry includes 1785 resources, with depositions from 126 individual registrations including 52 institutional providers and 74 individuals. With community support, the registry can become a standard for dissemination of information about bioinformatics resources: we welcome everyone to join us in this common endeavour. The registry is freely available at https://bio.tools
    corecore