6 research outputs found

    Evaluation of WGS performance for bacterial pathogen characterization with the Illumina technology optimized for time-critical situations

    Get PDF
    Whole genome sequencing (WGS) has become the reference standard for bacterial outbreak investigation and pathogen typing, providing a resolution unattainable with conventional molecular methods. Data generated with Illumina sequencers can however only be analysed after the sequencing run has finished, thereby losing valuable time during emergency situations. We evaluated both the effect of decreasing overall run time, and also a protocol to transfer and convert intermediary files generated by Illumina sequencers enabling real-time data analysis for multiple samples part of the same ongoing sequencing run, as soon as the forward reads have been sequenced. To facilitate implementation for laboratories operating under strict quality systems, extensive validation of several bioinformatics assays (16S rRNA species confirmation, gene detection against virulence factor and antimicrobial resistance databases, SNP-based antimicrobial resistance detection, serotype determination, and core genome multilocus sequence typing) for three bacterial pathogens (Mycobacterium tuberculosis, Neisseria meningitidis, and Shiga-toxin producing Escherichia coli) was performed by evaluating performance in function of the two most critical sequencing parameters, i.e. read length and coverage. For the majority of evaluated bioinformatics assays, actionable results could be obtained between 14 and 22 h of sequencing, decreasing the overall sequencing-to- results time by more than half. This study aids in reducing the turn-around time of WGS analysis by facilitating a faster response in time-critical scenarios and provides recommendations for time-optimized WGS with respect to required read length and coverage to achieve a minimum level of performance for the considered bioinformatics assay(s), which can also be used to maximize the cost-effectiveness of routine surveillance sequencing when response time is not essential.The Belgian Federal Public Service of Health, Food Chain Safety and Environment and Sciensano RP-PJ - Belgium.https://www.microbiologyresearch.org/content/journal/mgenam2022Genetic

    Validation strategy of a bioinformatics whole genome sequencing workflow for Shiga toxin-producing Escherichia coli using a reference collection extensively characterized with conventional methods

    Get PDF
    Whole genome sequencing (WGS) enables complete characterization of bacterial pathogenic isolates at single nucleotide resolution, making it the ultimate tool for routine surveillance and outbreak investigation. The lack of standardization, and the variation regarding bioinformatics workflows and parameters, however, complicates interoperability among (inter)national laboratories. We present a validation strategy applied to a bioinformatics workflow for Illumina data that performs complete characterization of Shiga toxin-producing Escherichia coli (STEC) isolates including antimicrobial resistance prediction, virulence gene detection, serotype prediction, plasmid replicon detection and sequence typing. The workflow supports three commonly used bioinformatics approaches for the detection of genes and alleles: alignment with blast+, kmer-based read mapping with KMA, and direct read mapping with SRST2. A collection of 131 STEC isolates collected from food and human sources, extensively characterized with conventional molecular methods, was used as a validation dataset. Using a validation strategy specifically adopted to WGS, we demonstrated high performance with repeatability, reproducibility, accuracy, precision, sensitivity and specificity above 95 % for the majority of all assays. The WGS workflow is publicly available as a ‘push-button’ pipeline at https://galaxy.sciensano.be. Our validation strategy and accompanying reference dataset consisting of both conventional and WGS data can be used for characterizing the performance of various bioinformatics workflows and assays, facilitating interoperability between laboratories with different WGS and bioinformatics set-ups

    Validation of a Bioinformatics Workflow for Routine Analysis of Whole-Genome Sequencing Data and Related Challenges for Pathogen Typing in a European National Reference Center: Neisseria meningitidis as a Proof-of-Concept

    Get PDF
    Despite being a well-established research method, the use of whole-genome sequencing (WGS) for routine molecular typing and pathogen characterization remains a substantial challenge due to the required bioinformatics resources and/or expertise. Moreover, many national reference laboratories and centers, as well as other laboratories working under a quality system, require extensive validation to demonstrate that employed methods are “fit-for-purpose” and provide high-quality results. A harmonized framework with guidelines for the validation of WGS workflows does currently, however, not exist yet, despite several recent case studies highlighting the urgent need thereof. We present a validation strategy focusing specifically on the exhaustive characterization of the bioinformatics analysis of a WGS workflow designed to replace conventionally employed molecular typing methods for microbial isolates in a representative small-scale laboratory, using the pathogen Neisseria meningitidis as a proof-of-concept. We adapted several classically employed performance metrics specifically toward three different bioinformatics assays: resistance gene characterization (based on the ARG-ANNOT, ResFinder, CARD, and NDARO databases), several commonly employed typing schemas (including, among others, core genome multilocus sequence typing), and serogroup determination. We analyzed a core validation dataset of 67 well-characterized samples typed by means of classical genotypic and/or phenotypic methods that were sequenced in-house, allowing to evaluate repeatability, reproducibility, accuracy, precision, sensitivity, and specificity of the different bioinformatics assays. We also analyzed an extended validation dataset composed of publicly available WGS data for 64 samples by comparing results of the different bioinformatics assays against results obtained from commonly used bioinformatics tools. We demonstrate high performance, with values for all performance metrics >87%, >97%, and >90% for the resistance gene characterization, sequence typing, and serogroup determination assays, respectively, for both validation datasets. Our WGS workflow has been made publicly available as a “push-button” pipeline for Illumina data at https://galaxy.sciensano.be to showcase its implementation for non-profit and/or academic usage. Our validation strategy can be adapted to other WGS workflows for other pathogens of interest and demonstrates the added value and feasibility of employing WGS with the aim of being integrated into routine use in an applied public health setting

    Phylogenomic investigation of increasing fluoroquinolone resistance among Belgian cases of Shigellosis between 2013 and 2018 indicates both travel-related imports and domestic circulation

    No full text
    Shigellosis is an acute enteric infection caused mainly by the species Shigella flexneri and Shigella sonnei. Since surveillance of these pathogens indicated an increase in ciprofloxacin-resistant samples collected in Belgium between 2013 and 2018, a subset of 148 samples was analyzed with whole genome sequencing (WGS) to investigate their dispersion and underlying genomic features associated with ciprofloxacin resistance. A comparison between observed phenotypes and WGS-based resistance prediction to ciprofloxacin revealed perfect correspondence for all samples. Core genome multi-locus sequence typing and single nucleotide polymorphism-typing were used for phylogenomic investigation to characterize the spread of these infections within Belgium, supplemented with data from international reference collections to place the Belgian isolates within their global context. For S. flexneri, substantial diversity was observed with ciprofloxacin-resistant isolates assigned to several phylogenetic groups. Besides travel-related imports, several clusters of highly similar Belgian isolates could not be linked directly to international travel suggesting the presence of domestically circulating strains. For S. sonnei, Belgian isolates were all limited to lineage III, and could often be traced back to travel to countries in Asia and Africa, sometimes followed by domestic circulation. For both species, several clusters of isolates obtained exclusively from male patients were observed. Additionally, we illustrated the limitations of conventional serotyping of S. flexneri, which was impacted by serotype switching. This study contributes to a better understanding of the spread of shigellosis within Belgium and internationally, and highlights the added value of WGS for the surveillance of this pathogen.SUPPLEMENTARY MATERIAL : FIGURE S1: Occurrence of gyrA p83, gyrA p87 and parC p80 mutations associated with ciprofloxacin resistance in the background collection. FIGURE S2: Network representation of minimum spanning trees for S. sonnei (A) and S. flexneri (B), TABLE S1: Accession number and metadata for the Belgian samples, TABLE S2: Read trimming statistics, TABLE S3: Assembly and coverage statistics, TABLE S4: Percentage of cgMLST alleles identified, TABLE S5: In silico determined serotypes for all S. flexneri samples, TABLE S6: Detected AMR genes for all samples, TABLE S7: Detected AMR point mutations for all samples, TABLE S8: Detected qnr genes, TABLE S9: Classification of contigs carrying qnr genes, TABLE S10: Lineage and PG classification, and ciprofloxacin resistance for the Belgian samples and background collection.The NRC is partially supported by the Belgian Ministry of Social Affairs through a fund within the Health Insurance System.https://www.mdpi.com/journal/microorganismsam2022Genetic

    Targeting the 16S rRNA gene for bacterial identification in complex mixed samples : comparative evaluation of second (Illumina) and third (Oxford Nanopore Technologies) generation sequencing technologies

    No full text
    Rapid, accurate bacterial identification in biological samples is an important task for microbiology laboratories, for which 16S rRNA gene Sanger sequencing of cultured isolates is frequently used. In contrast, next-generation sequencing does not require intermediate culturing steps and can be directly applied on communities, but its performance has not been extensively evaluated. We present a comparative evaluation of second (Illumina) and third (Oxford Nanopore Technologies (ONT)) generation sequencing technologies for 16S targeted genomics using a well-characterized reference sample. Different 16S gene regions were amplified and sequenced using the Illumina MiSeq, and analyzed with Mothur. Correct classification was variable, depending on the region amplified. Using a majority vote over all regions, most false positives could be eliminated at the genus level but not the species level. Alternatively, the entire 16S gene was amplified and sequenced using the ONT MinION, and analyzed with Mothur, EPI2ME, and GraphMap. Although >99% of reads were correctly classified at the genus level, up to approximate to 40% were misclassified at the species level. Both technologies, therefore, allow reliable identification of bacterial genera, but can potentially misguide identification of bacterial species, and constitute viable alternatives to Sanger sequencing for rapid analysis of mixed samples without requiring any culturing steps

    Galaxy@Sciensano: A toolkit for NGS analysis for applied microbial genomics

    No full text
    <p>Poster on the Sciensano Galaxy publicly available instance, presented at the ELIXIR Belgium Conference, 2023: "ELIXIR Belgium: Your data, Our services, European success".</p&gt
    corecore