14 research outputs found

    Detailed evaluation of data analysis tools for subtyping of bacterial isolates based on whole genome sequencing : Neisseria meningitidis as a proof of concept

    Get PDF
    Whole genome sequencing is increasingly recognized as the most informative approach for characterization of bacterial isolates. Success of the routine use of this technology in public health laboratories depends on the availability of well-characterized and verified data analysis methods. However, multiple subtyping workflows are now often being used for a single organism, and differences between them are not always well described. Moreover, methodologies for comparison of subtyping workflows, and assessment of their performance are only beginning to emerge. Current work focuses on the detailed comparison of WGS-based subtyping workflows and evaluation of their suitability for the organism and the research context in question. We evaluated the performance of pipelines used for subtyping of Neisseria meningitidis, including the currently widely applied cgMLST approach and different SNP-based methods. In addition, the impact of the use of different tools for detection and filtering of recombinant regions and of different reference genomes were tested. Our benchmarking analysis included both assessment of technical performance of the pipelines and functional comparison of the generated genetic distance matrices and phylogenetic trees. It was carried out using replicate sequencing datasets of high- and low-coverage, consisting mainly of isolates belonging to the clonal complex 269. We demonstrated that cgMLST and some of the SNP-based subtyping workflows showed very good performance characteristics and highly similar genetic distance matrices and phylogenetic trees with isolates belonging to the same clonal complex. However, only two of the tested workflows demonstrated reproducible results for a group of more closely related isolates. Additionally, results of the SNP-based subtyping workflows were to some level dependent on the reference genome used. Interestingly, the use of recombination-filtering software generally reduced the similarity between the gene-by-gene and SNP-based methodologies for subtyping of N. meningitidis. Our study, where N. meningitidis was taken as an example, clearly highlights the need for more benchmarking comparative studies to eventually contribute to a justified use of a specific WGS data analysis workflow within an international public health laboratory context

    Strain-level metagenomic data analysis of enriched in vitro and in silico spiked food samples : paving the way towards a culture-free foodborne outbreak investigation using STEC as a case study

    Get PDF
    Culture-independent diagnostics, such as metagenomic shotgun sequencing of food samples, could not only reduce the turnaround time of samples in an outbreak investigation, but also allow the detection of multi-species and multi-strain outbreaks. For successful foodborne outbreak investigation using a metagenomic approach, it is, however, necessary to bioinformatically separate the genomes of individual strains, including strains belonging to the same species, present in a microbial community, which has up until now not been demonstrated for this application. The current work shows the feasibility of strain-level metagenomics of enriched food matrix samples making use of data analysis tools that classify reads against a sequence database. It includes a brief comparison of two database-based read classification tools, Sigma and Sparse, using a mock community obtained by in vitro spiking minced meat with a Shiga toxin-producing Escherichia coli (STEC) isolate originating from a described outbreak. The more optimal tool Sigma was further evaluated using in silico simulated metagenomic data to explore the possibilities and limitations of this data analysis approach. The performed analysis allowed us to link the pathogenic strains from food samples to human isolates previously collected during the same outbreak, demonstrating that the metagenomic approach could be applied for the rapid source tracking of foodborne outbreaks. To our knowledge, this is the first study demonstrating a data analysis approach for detailed characterization and phylogenetic placement of multiple bacterial strains of one species from shotgun metagenomic WGS data of an enriched food sample

    A practical method to implement strain-level metagenomics-based foodborne outbreak investigation and source tracking in routine

    Get PDF
    The management of a foodborne outbreak depends on the rapid and accurate identification of the responsible food source. Conventional methods based on isolation of the pathogen from the food matrix and target-specific real-time polymerase chain reactions (qPCRs) are used in routine. In recent years, the use of whole genome sequencing (WGS) of bacterial isolates has proven its value to collect relevant information for strain characterization as well as tracing the origin of the contamination by linking the food isolate with the patient’s isolate with high resolution. However, the isolation of a bacterial pathogen from food matrices is often time-consuming and not always successful. Therefore, we aimed to improve outbreak investigation by developing a method that can be implemented in reference laboratories to characterize the pathogen in the food vehicle without its prior isolation and link it back to human cases. We tested and validated a shotgun metagenomics approach by spiking food pathogens in specific food matrices using the Shiga toxin-producing Escherichia coli (STEC) as a case study. Different DNA extraction kits and enrichment procedures were investigated to obtain the most practical workflow. We demonstrated the feasibility of shotgun metagenomics to obtain the same information as in ISO/TS 13136:2012 and WGS of the isolate in parallel by inferring the genome of the contaminant and characterizing it in a shorter timeframe. This was achieved in food samples containing different E. coli strains, including a combination of different STEC strains. For the first time, we also managed to link individual strains from a food product to isolates from human cases, demonstrating the power of shotgun metagenomics for rapid outbreak investigation and source tracking

    Combining short and long read sequencing to characterize antimicrobial resistance genes on plasmids applied to an unauthorized genetically modified Bacillus

    Get PDF
    Antimicrobial resistance (AMR) is a major public health threat. Plasmids are able to transfer AMR genes among bacterial isolates. Whole genome sequencing (WGS) is a powerful tool to monitor AMR determinants. However, plasmids are difficult to reconstruct from WGS data. This study aimed to improve the characterization, including the localization of AMR genes using short and long read WGS strategies. We used a genetically modified (GM) Bacillus subtilis isolated as unexpected contamination in a feed additive, and therefore considered unauthorized (RASFF 2014.1249), as a case study. In GM organisms, AMR genes are used as selection markers. Because of the concern of spread of these AMR genes when present on mobile genetic elements, it is crucial to characterize their location. Our approach resulted in an assembly of one chromosome and one plasmid, each with several AMR determinants of which five are against critically important antibiotics. Interestingly, we found several plasmids, containing AMR genes, integrated in the chromosome in a repetitive region of at least 53 kb. Our findings would have been impossible using short reads only. We illustrated the added value of long read sequencing in addressing the challenges of plasmid reconstruction within the context of evaluating the risk of AMR spread

    Application of a strain- level shotgun metagenomics approach on food samples : resolution of the source of a Salmonella food-borne outbreak

    Get PDF
    Food- borne outbreak investigation currently relies on the time- consuming and challenging bacterial isolation from food, to be able to link food- derived strains to more easily obtained isolates from infected people. When no food isolate can be obtained, the source of the outbreak cannot be unambiguously determined. Shotgun metagenomics approaches applied to the food samples could circumvent this need for isolation from the suspected source, but require downstream strain- level data analysis to be able to accurately link to the human isolate. Until now, this approach has not yet been applied outside research settings to analyse real food- borne outbreak samples. In September 2019, a Salmonella outbreak occurred in a hotel school in Bruges, Belgium, affecting over 200 students and teachers. Following standard procedures, the Belgian National Reference Center for human salmonellosis and the National Reference Laboratory for Salmonella in food and feed used conventional analysis based on isolation, serotyping and MLVA (multilocus variable number tandem repeat analysis) comparison, followed by wholegenome sequencing, to confirm the source of the contamination over 2 weeks after receipt of the sample, which was freshly prepared tartar sauce in a meal cooked at the school. Our team used this outbreak as a case study to deliver a proof of concept for a short- read strain- level shotgun metagenomics approach for source tracking. We received two suspect food samples: the full meal and some freshly made tartar sauce served with this meal, requiring the use of raw eggs. After analysis, we could prove, without isolation, that Salmonella was present in both samples, and we obtained an inferred genome of a Salmonella enterica subsp. enterica serovar Enteritidis that could be linked back to the human isolates of the outbreak in a phylogenetic tree. These metagenomics- derived outbreak strains were separated from sporadic cases as well as from another outbreak circulating in Europe at the same time period. This is, to our knowledge, the first Salmonella food- borne outbreak investigation uniquely linking the food source using a metagenomics approach and this in a fast time frame

    Development of a real-time PCR method for the genoserotyping of Salmonella Paratyphi B variant Java

    No full text
    Discriminating between d-tartrate fermenting and non-fermenting strains of Salmonella enterica subsp. enterica serotype Paratyphi B is of major importance as these two variants have different pathogenic profiles. While d-tartrate non-fermenting S. Paratyphi B isolates are the causative agent of typhoid-like fever, d-tartrate fermenting isolates (also called variant Java) of the same serotype trigger the less dangerous gastroenteritis. The determination of S. Paratyphi B variants requires a time-consuming process and complex biochemical tests. Therefore, a quadruplex real-time PCR method, based on the allelic discrimination of molecular markers selected from the scientific literature and from whole genome sequencing data produced in-house, was developed in this study, to be applied to Salmonella isolates. This method was validated with the analysis of 178 S. Paratyphi B (d-tartrate fermenting and non-fermenting) and other serotypes reaching an accuracy, compared with the classical methods, of 98% for serotyping by slide agglutination and 100% for replacement of the biochemical test. The developed real-time PCR permits to save time and to obtain an accurate identification of a S. Paratyphi B serotype and its d-tartrate fermenting profile, which is needed in routine laboratories for fast and efficient diagnostics

    Comparison of SNP-based subtyping workflows for bacterial isolates using WGS data, applied to <i>Salmonella enterica</i> serotype Typhimurium and serotype 1,4,[5],12:i:-

    Get PDF
    <div><p>Whole genome sequencing represents a promising new technology for subtyping of bacterial pathogens. Besides the technological advances which have pushed the approach forward, the last years have been marked by considerable evolution of the whole genome sequencing data analysis methods. Prior to application of the technology as a routine epidemiological typing tool, however, reliable and efficient data analysis strategies need to be identified among the wide variety of the emerged methodologies. In this work, we have compared three existing SNP-based subtyping workflows using a benchmark dataset of 32 <i>Salmonella enterica</i> subsp. <i>enterica</i> serovar Typhimurium and serovar 1,4,[<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0192504#pone.0192504.ref005" target="_blank">5</a>],12:i:- isolates including five isolates from a confirmed outbreak and three isolates obtained from the same patient at different time points. The analysis was carried out using the original (high-coverage) and a down-sampled (low-coverage) datasets and two different reference genomes. All three tested workflows, namely CSI Phylogeny-based workflow, CFSAN-based workflow and PHEnix-based workflow, were able to correctly group the confirmed outbreak isolates and isolates from the same patient with all combinations of reference genomes and datasets. However, the workflows differed strongly with respect to the SNP distances between isolates and sensitivity towards sequencing coverage, which could be linked to the specific data analysis strategies used therein. To demonstrate the effect of particular data analysis steps, several modifications of the existing workflows were also tested. This allowed us to propose data analysis schemes most suitable for routine SNP-based subtyping applied to <i>S</i>. Typhimurium and <i>S</i>. 1,4,[<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0192504#pone.0192504.ref005" target="_blank">5</a>],12:i:-. Results presented in this study illustrate the importance of using correct data analysis strategies and to define benchmark and fine-tune parameters applied within routine data analysis pipelines to obtain optimal results.</p></div