154 research outputs found

    NASP: an accurate, rapid method for the identification of SNPs in WGS datasets that supports flexible input and output formats

    Get PDF
    Whole-genome sequencing (WGS) of bacterial isolates has become standard practice in many laboratories. Applications for WGS analysis include phylogeography and molecular epidemiology, using single nucleotide polymorphisms (SNPs) as the unit of evolution. NASP was developed as a reproducible method that scales well with the hundreds to thousands of WGS data typically used in comparative genomics applications. In this study, we demonstrate how NASP compares with other tools in the analysis of two real bacterial genomics datasets and one simulated dataset. Our results demonstrate that NASP produces similar, and often better, results in comparison with other pipelines, but is much more flexible in terms of data input types, job management systems, diversity of supported tools and output formats. We also demonstrate differences in results based on the choice of the reference genome and choice of inferring phylogenies from concatenated SNPs or alignments including monomorphic positions. NASP represents a source-available, version-controlled, unit-tested method and can be obtained from tgennorth.github.io/NASP

    The large-scale blast score ratio (LS-BSR) pipeline: a method to rapidly compare genetic content between bacterial genomes

    Get PDF
    Background. As whole genome sequence data from bacterial isolates becomes cheaper to generate, computational methods are needed to correlate sequence data with biological observations. Here we present the large-scale BLAST score ratio (LS-BSR) pipeline, which rapidly compares the genetic content of hundreds to thousands of bacterial genomes, and returns a matrix that describes the relatedness of all coding sequences (CDSs) in all genomes surveyed. This matrix can be easily parsed in order to identify genetic relationships between bacterial genomes. Although pipelines have been published that group peptides by sequence similarity, no other software performs the rapid, large-scale, full-genome comparative analyses carried out by LS-BSR. Results. To demonstrate the utility of the method, the LS-BSR pipeline was tested on 96 Escherichia coli and Shigella genomes; the pipeline ran in 163 min using 16 processors, which is a greater than 7-fold speedup compared to using a single processor. The BSR values for each CDS, which indicate a relative level of relatedness, were then mapped to each genome on an independent core genome single nucleotide polymorphism (SNP) based phylogeny. Comparisons were then used to identify clade specific CDS markers and validate the LS-BSR pipeline based on molecular markers that delineate between classical E. coli pathogenic variant (pathovar) designations. Scalability tests demonstrated that the LS-BSR pipeline can process 1,000 E. coli genomes in 27-57 h, depending upon the alignment method, using 16 processors. Conclusions. LS-BSR is an open-source, parallel implementation of the BSR algorithm, enabling rapid comparison of the genetic content of large numbers of genomes. The results of the pipeline can be used to identify specific markers between user-defined phylogenetic groups, and to identify the loss and/or acquisition of genetic information between bacterial isolates. Taxa-specific genetic markers can then be translated into clinical diagnostics, or can be used to identify broadly conserved putative therapeutic candidates

    Evolution of a pathogen: a comparative genomics analysis identifies a genetic pathway to pathogenesis in acinetobacter.

    Get PDF
    Acinetobacter baumannii is an emergent and global nosocomial pathogen. In addition to A. baumannii, other Acinetobacter species, especially those in the Acinetobacter calcoaceticus-baumannii (Acb) complex, have also been associated with serious human infection. Although mechanisms of attachment, persistence on abiotic surfaces, and pathogenesis in A. baumannii have been identified, the genetic mechanisms that explain the emergence of A. baumannii as the most widespread and virulent Acinetobacter species are not fully understood. Recent whole genome sequencing has provided insight into the phylogenetic structure of the genus Acinetobacter. However, a global comparison of genomic features between Acinetobacter spp. has not been described in the literature. In this study, 136 Acinetobacter genomes, including 67 sequenced in this study, were compared to identify the acquisition and loss of genes in the expansion of the Acinetobacter genus. A whole genome phylogeny confirmed that A. baumannii is a monophyletic clade and that the larger Acb complex is also a well-supported monophyletic group. The whole genome phylogeny provided the framework for a global genomic comparison based on a blast score ratio (BSR) analysis. The BSR analysis demonstrated that specific genes have been both lost and acquired in the evolution of A. baumannii. In addition, several genes associated with A. baumannii pathogenesis were found to be more conserved in the Acb complex, and especially in A. baumannii, than in other Acinetobacter genomes; until recently, a global analysis of the distribution and conservation of virulence factors across the genus was not possible. The results demonstrate that the acquisition of specific virulence factors has likely contributed to the widespread persistence and virulence of A. baumannii. The identification of novel features associated with transcriptional regulation and acquired by clades in the Acb complex presents targets for better understanding the evolution of pathogenesis and virulence in the expansion of the genus

    Draft genome sequences of two Bulgarian Bacillus anthracis strains

    Get PDF
    Bacillus anthracis strains previously isolated from Bulgaria form a unique subcluster within the A1.a cluster that is typical for isolates from southeastern Europe. Here, we report the draft genome sequences of two Bulgarian B. anthracis strains belonging to the A branch (A.Br.) 008/009 canonical single nucleotide polymorphism (SNP) group of the major A branch

    Using Whole Genome Analysis to Examine Recombination across Diverse Sequence Types of Staphylococcus aureus

    Get PDF
    Staphylococcus aureus is an important clinical pathogen worldwide and understanding this organism\u27s phylogeny and, in particular, the role of recombination, is important both to understand the overall spread of virulent lineages and to characterize outbreaks. To further elucidate the phylogeny of S. aureus, 35 diverse strains were sequenced using whole genome sequencing. In addition, 29 publicly available whole genome sequences were included to create a single nucleotide polymorphism (SNP)-based phylogenetic tree encompassing 11 distinct lineages. All strains of a particular sequence type fell into the same clade with clear groupings of the major clonal complexes of CC8, CC5, CC30, CC45 and CC1. Using a novel analysis method, we plotted the homoplasy density and SNP density across the whole genome and found evidence of recombination throughout the entire chromosome, but when we examined individual clonal lineages we found very little recombination. However, when we analyzed three branches of multiple lineages, we saw intermediate and differing levels of recombination between them. These data demonstrate that in S. aureus, recombination occurs across major lineages that subsequently expand in a clonal manner. Estimated mutation rates for the CC8 and CC5 lineages were different from each other. While the CC8 lineage rate was similar to previous studies, the CC5 lineage was 100-fold greater. Fifty known virulence genes were screened in all genomes in silico to determine their distribution across major clades. Thirty-three genes were present variably across clades, most of which were not constrained by ancestry, indicating horizontal gene transfer or gene loss

    Genome sequence of Burkholderia pseudomallei NCTC 13392

    Get PDF
    Here, we describe the draft genome sequence of Burkholderia pseudomallei NCTC 13392. This isolate has been distributed as K96243, but distinct genomic differences have been identified. The genomic sequence of this isolate will provide the genomic context for previously conducted functional studies

    Resolving antagonistic interactions among clinical isolates of Acinetobacter baumannii with phylogenetic analysis

    Get PDF
    Acinetobacter baumannii is a nosocomial species frequently isolated from the traumatic wounds of injured military personnel and increasingly detected in civilian healthcare facilities. Many clinical isolates of A. baumannii are drug resistant, so new treatments are needed for infections. Recently, the ability of strains of conspecific bacteria to inhibit the growth of other strains has been observed in increasing numbers of species. We previously reported on intraspecific semisolid-phase growth inhibition (antagonism) among 94 clinical isolates of A. baumannii. These antagonistic interactions may be the result of genetically-encoded molecules, so more closely related isolates would be expected to produce similar patterns of interactions caused by identically active gene products. However, the phylogeny of clinical A. baumannii below the species level has not been established for this set of isolates.In this study, we used Phylomark to identify three genetic loci that recapitulated a whole-genome phylogeny of published A. baumannii genomes and we created a parsimony-based phylogeny from the 1.2 kilobase concatenated sequences. One clade appeared to exhibit the highest incidence of antagonistic interactions against all other isolates screened, except for itself and one relatively distant clade. This clade's nearest neighbor was susceptible to the most consistent antagonistic activity by the first clade; both of these clades appear to belong to MLST ST1 with reference strain AYE. Other isolates with high rates of antagonistic activity fall outside of ST1. Future studies aim to elucidate the genetic basis of the antagonism phenotype in clinical Acinetobacter, particularly in the most antagonistic isolates. The next step will be to mine these interactions to identify expressed antimicrobial molecules with potential for drug therapy

    Transcriptional modulation of enterotoxigenic Escherichia coli virulence genes in response to epithelial cell interactions

    Get PDF
    Enterotoxigenic Escherichia coli (ETEC) strains are a leading cause of morbidity and mortality due to diarrheal illness in developing countries. There is currently no effective vaccine against these important pathogens. Because genes modulated by pathogen-host interactions potentially encode putative vaccine targets, we investigated changes in gene expression and surface morphology of ETEC upon interaction with intestinal epithelial cells in vitro. Pan-genome microarrays, quantitative reverse transcriptase PCR (qRT-PCR), and transcriptional reporter fusions of selected promoters were used to study changes in ETEC transcriptomes. Flow cytometry, immunofluorescence microscopy, and scanning electron microscopy were used to investigate alterations in surface antigen expression and morphology following pathogen-host interactions. Following host cell contact, genes for motility, adhesion, toxin production, immunodominant peptides, and key regulatory molecules, including cyclic AMP (cAMP) receptor protein (CRP) and c-di-GMP, were substantially modulated. These changes were accompanied by visible changes in both ETEC architecture and the expression of surface antigens, including a novel highly conserved adhesin molecule, EaeH. The studies reported here suggest that pathogen-host interactions are finely orchestrated by ETEC and are characterized by coordinated responses involving the sequential deployment of multiple virulence molecules. Elucidation of the molecular details of these interactions could highlight novel strategies for development of vaccines for these important pathogens

    Diversity, Virulence, and Antimicrobial Resistance in Isolates From the Newly Emerging Klebsiella pneumoniae ST101 Lineage

    Get PDF
    The global dissemination of Klebsiella pneumoniae and Klebsiella pneumoniae carbapenemase (KPC) has been largely attributed to a few high-risk sequence types (STs) (ST258, ST11, ST512) associated with human disease. ST101 is an emerging clone that has been identified in different parts of the world with the potential to become a global, persistent public health threat. Recent research suggests the ST101 lineage is associated with an 11% increase in mortality rate in comparison to non-ST101 infections. In this study, we generated a high-quality, near-finished genome assembly of a multidrug-resistant (MDR) isolate from Italy (isolate 4743) that is a single locus variant of ST101 (ST1685). We demonstrate that the 4743 genome contains virulence features such as an integrative conjugative element carrying the yersiniabactin siderophore (ICEKp3), the mannose-resistant Klebsiella-like (type III) fimbriae cluster (mrkABCDFHIJ), the ferric uptake system (kfuABC), the yersiniabactin receptor gene fyuA, a capsular K type K17, and an O antigen type of O1. K. pneumoniae 4743 carries the blaKPC-2 carbapenemase gene along with genes conferring resistance to aminoglycosides, beta-lactams, fluoroquinolones, fosfomycin, macrolides, lincosamides, and streptogramin B. A comparative genomics analysis of 44 ST101 genomes as well as newly sequenced isolate 4743 identified variable antimicrobial resistance (AMR) resistance profiles and incompatibility plasmid types, but similar virulence factor profiles. Using Bayesian methodologies, we estimate the common ancestor for the ST101 lineage emerged in 1990 (95% HPD: 1965 to 2007) and isolates within the lineage acquired blaKPC after the divergence from its parental clonal group and dissemination. The identification of virulence factors and antibiotic resistance genes acquired by this newly emerging clone provides insight into the reported increased mortality rates and highlights its potential success as a persistent nosocomial pathogen. With a combination of both colistin resistance, carbapenem resistance, and several known virulence factors, the ST101 genetic repertoire may be a “perfect storm” allowing for a newly emerging, high-risk, extensively antibiotic resistant clone. This high-risk clone appears adept at acquiring resistance and may perpetuate the dissemination of extensive antimicrobial resistance. Greater focus on the acquisition of virulence factors and antibiotic resistance genes is crucial for understanding the spread of antibiotic resistance

    Examination of the enterotoxigenic Escherichia coli population structure during human infection

    Get PDF
    Enterotoxigenic E. coli (ETEC) can cause severe diarrhea and death in children in developing countries; however, bacterial diversity in natural infection is uncharacterized. In this study, we explored the natural population variation of ETEC from individuals with cholera-like diarrhea. Genomic sequencing and comparative analysis of multiple ETEC isolates from twelve cases of severe diarrhea demonstrated clonal populations in the majority of subjects (10/12). In contrast, a minority of individuals (2/12) yielded phylogenomically divergent ETEC isolates. Detailed examination revealed that isolates also differed in virulence factor content. These genomic data suggest that severe, cholera-like ETEC infections are largely caused by a clonal population of organisms within individual patients. Additionally, the isolation of similar clones from geographically and temporally dispersed cases with similar clinical presentations suggests that some isolates are particularly suited for virulence. The identification of multiple genomically diverse isolates with variable virulence factor profiles from a single subject highlights the dynamic nature of ETEC, as well as a potential weakness in the examination of cultures obtained from a single colony in clinical settings. These findings have implications for vaccine design and provide a framework for the study of population variation in other human pathogens
    • …
    corecore