46,916 research outputs found

    Detailed evaluation of data analysis tools for subtyping of bacterial isolates based on whole genome sequencing : Neisseria meningitidis as a proof of concept

    Get PDF
    Whole genome sequencing is increasingly recognized as the most informative approach for characterization of bacterial isolates. Success of the routine use of this technology in public health laboratories depends on the availability of well-characterized and verified data analysis methods. However, multiple subtyping workflows are now often being used for a single organism, and differences between them are not always well described. Moreover, methodologies for comparison of subtyping workflows, and assessment of their performance are only beginning to emerge. Current work focuses on the detailed comparison of WGS-based subtyping workflows and evaluation of their suitability for the organism and the research context in question. We evaluated the performance of pipelines used for subtyping of Neisseria meningitidis, including the currently widely applied cgMLST approach and different SNP-based methods. In addition, the impact of the use of different tools for detection and filtering of recombinant regions and of different reference genomes were tested. Our benchmarking analysis included both assessment of technical performance of the pipelines and functional comparison of the generated genetic distance matrices and phylogenetic trees. It was carried out using replicate sequencing datasets of high- and low-coverage, consisting mainly of isolates belonging to the clonal complex 269. We demonstrated that cgMLST and some of the SNP-based subtyping workflows showed very good performance characteristics and highly similar genetic distance matrices and phylogenetic trees with isolates belonging to the same clonal complex. However, only two of the tested workflows demonstrated reproducible results for a group of more closely related isolates. Additionally, results of the SNP-based subtyping workflows were to some level dependent on the reference genome used. Interestingly, the use of recombination-filtering software generally reduced the similarity between the gene-by-gene and SNP-based methodologies for subtyping of N. meningitidis. Our study, where N. meningitidis was taken as an example, clearly highlights the need for more benchmarking comparative studies to eventually contribute to a justified use of a specific WGS data analysis workflow within an international public health laboratory context

    Taxonomy of anaerobic digestion microbiome reveals biases associated with the applied high throughput sequencing strategies

    Get PDF
    In the past few years, many studies investigated the anaerobic digestion microbiome by means of 16S rRNA amplicon sequencing. Results obtained from these studies were compared to each other without taking into consideration the followed procedure for amplicons preparation and data analysis. This negligence was mainly due to the lack of knowledge regarding the biases influencing specific steps of the microbiome investigation process. In the present study, the main technical aspects of the 16S rRNA analysis were checked giving special attention to the approach used for high throughput sequencing. More specifically, the microbial compositions of three laboratory scale biogas reactors were analyzed before and after addition of sodium oleate by sequencing the microbiome with three different approaches: 16S rRNA amplicon sequencing, shotgun DNA and shotgun RNA. This comparative analysis revealed that, in amplicon sequencing, abundance of some taxa (Euryarchaeota and Spirochaetes) was biased by the inefficiency of universal primers to hybridize all the templates. Reliability of the results obtained was also influenced by the number of hypervariable regions under investigation. Finally, amplicon sequencing and shotgun DNA underestimated the Methanoculleus genus, probably due to the low 16S rRNA gene copy number encoded in this taxon

    Evaluation of WGS-subtyping methods for epidemiological surveillance of foodborne salmonellosis

    Get PDF
    Background: Salmonellosis is one of the most common foodborne diseases worldwide. Although human infection by non-typhoidal Salmonella (NTS) enterica subspecies enterica is associated primarily with a self-limiting diarrhoeal illness, invasive bacterial infections (such as septicaemia, bacteraemia and meningitis) were also reported. Human outbreaks of NTS were reported in several countries all over the world including developing as well as high-income countries. Conventional laboratory methods such as pulsed field gel electrophoresis (PFGE) do not display adequate discrimination and have their limitations in epidemiological surveillance. It is therefore very crucial to use accurate, reliable and highly discriminative subtyping methods for epidemiological characterisation and outbreak investigation. Methods: Here, we used different whole genome sequence (WGS)-based subtyping methods for retrospective investigation of two different outbreaks of Salmonella Typhimurium and Salmonella Dublin that occurred in 2013 in UK and Ireland respectively. Results: Single nucleotide polymorphism (SNP)-based cluster analysis of Salmonella Typhimurium genomes revealed well supported clades, that were concordant with epidemiologically defined outbreak and confirmed the source of outbreak is due to consumption of contaminated mayonnaise. SNP-analyses of Salmonella Dublin genomes confirmed the outbreak however the source of infection could not be determined. The core genome multilocus sequence typing (cgMLST) was discriminatory and separated the outbreak strains of Salmonella Dublin from the non-outbreak strains that were concordant with the epidemiological data however cgMLST could neither discriminate between the outbreak and non-outbreak strains of Salmonella Typhimurium nor confirm that contaminated mayonnaise is the source of infection, On the other hand, other WGS-based subtyping methods including multilocus sequence typing (MLST), ribosomal MLST (rMLST), whole genome MLST (wgMLST), clustered regularly interspaced short palindromic repeats (CRISPRs), prophage sequence profiling, antibiotic resistance profile and plasmid typing methods were less discriminatory and could not confirm the source of the outbreak. Conclusions: Foodborne salmonellosis is an important concern for public health therefore, it is crucial to use accurate, reliable and highly discriminative subtyping methods for epidemiological surveillance and outbreak investigation. In this study, we showed that SNP-based analyses do not only have the ability to confirm the occurrence of the outbreak but also to provide definitive evidence of the source of the outbreak in real-time

    Microbial genomic taxonomy

    Get PDF
    A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups

    Detection of genetic diversity among Indian strains of _Xanthomonas campestris_ pv. _mangiferaeindicae_ using PCR-RAPD

    Get PDF
    The randomly amplified polymorphic DNA (RAPD) technique was used to investigate the genetic diversity in 6 strains of _Xanthomonas campestris_ pv. _mangiferaeindicae_ (_Xcmi_), the causal pathogen of mango bacterial canker disease (MBCD). The RAPD analysis was also intended to identify molecular markers, specific to the species to develop PCR-based markers for detection of _Xcmi_ in mango field and planting materials. Twenty RAPD primers (CP 1-CP 20) were evaluated to establish molecular characters and genetic variability in the genome of _Xcmi_. Among these, only 4 were found efficient for development of reproducible banding pattern. It has been observed that the largest and smallest amplified RAPD products were of 2.036 and 0.201 kbp. A total of 136 bands were scored against 6 strains of _Xcmi_. There was 7.66 per cent polymorphism in individual isolates which indicates significant polymorphism among the evaluated strains, with mean difference of 0.33 (_Xcmi_ 2 vs. _Xcmi_ 8) and 0.29 (_Xcmi_ 10 vs. _Xcmi_ 12). However, the single linkage euclidean distances were statistically significant (P>0.05), i.e., 0.58. The markers CP 5, 10, 16 and 19 were amplified in all the strains with polymorphic alleles, which indicates that these markers could be used for rapid detection of genetic variability in _Xcmi_ strains

    Oligotyping reveals stronger relationship of organic soil bacterial community structure with N-amendments and soil chemistry in comparison to that of mineral soil at Harvard Forest, MA, USA

    Get PDF
    The impact of chronic nitrogen amendments on bacterial communities was evaluated at Harvard Forest, Petersham, MA, USA. Thirty soil samples (3 treatments × 2 soil horizons × 5 subplots) were collected in 2009 from untreated (control), low nitrogen-amended (LN; 50 kg NH4NO3 ha-1 yr-1) and high nitrogen-amended (HN; 150 kg NH4NO3 ha-1 yr-1) plots. PCR-amplified partial 16S rRNA gene sequences made from soil DNA were subjected to pyrosequencing (Turlapati et al., 2013) and analyses using oligotyping. The parameters M (the minimum count of the most abundant unique sequence in an oligotype) and s (the minimum number of samples in which an oligotype is expected to be present) had to be optimized for forest soils because of high diversity and the presence of rare organisms. Comparative analyses of the pyrosequencing data by oligotyping and operational taxonomic unit clustering tools indicated that the former yields more refined units of taxonomy with sequence similarity of ≥99.5%. Sequences affiliated with four new phyla and 73 genera were identified in the present study as compared to 27 genera reported earlier from the same data (Turlapati et al., 2013). Significant rearrangements in the bacterial community structure were observed with N-amendments revealing the presence of additional genera in N-amended plots with the absence of some that were present in the control plots. Permutational MANOVA analyses indicated significant variation associated with soil horizon and N treatment for a majority of the phyla. In most cases soil horizon partitioned more variation relative to treatment and treatment effects were more evident for the organic (Org) horizon. Mantel test results for Org soil showed significant positive correlations between bacterial communities and most soil parameters including NH4 and NO3. In mineral soil, correlations were seen only with pH, NH4, and NO3. Regardless of the pipeline used, a major hindrance for such a study remains to be the lack of reference databases for forest soils
    corecore