137 research outputs found

    The large-scale blast score ratio (LS-BSR) pipeline: a method to rapidly compare genetic content between bacterial genomes

    Get PDF
    Background. As whole genome sequence data from bacterial isolates becomes cheaper to generate, computational methods are needed to correlate sequence data with biological observations. Here we present the large-scale BLAST score ratio (LS-BSR) pipeline, which rapidly compares the genetic content of hundreds to thousands of bacterial genomes, and returns a matrix that describes the relatedness of all coding sequences (CDSs) in all genomes surveyed. This matrix can be easily parsed in order to identify genetic relationships between bacterial genomes. Although pipelines have been published that group peptides by sequence similarity, no other software performs the rapid, large-scale, full-genome comparative analyses carried out by LS-BSR. Results. To demonstrate the utility of the method, the LS-BSR pipeline was tested on 96 Escherichia coli and Shigella genomes; the pipeline ran in 163 min using 16 processors, which is a greater than 7-fold speedup compared to using a single processor. The BSR values for each CDS, which indicate a relative level of relatedness, were then mapped to each genome on an independent core genome single nucleotide polymorphism (SNP) based phylogeny. Comparisons were then used to identify clade specific CDS markers and validate the LS-BSR pipeline based on molecular markers that delineate between classical E. coli pathogenic variant (pathovar) designations. Scalability tests demonstrated that the LS-BSR pipeline can process 1,000 E. coli genomes in 27-57 h, depending upon the alignment method, using 16 processors. Conclusions. LS-BSR is an open-source, parallel implementation of the BSR algorithm, enabling rapid comparison of the genetic content of large numbers of genomes. The results of the pipeline can be used to identify specific markers between user-defined phylogenetic groups, and to identify the loss and/or acquisition of genetic information between bacterial isolates. Taxa-specific genetic markers can then be translated into clinical diagnostics, or can be used to identify broadly conserved putative therapeutic candidates

    Evolution of a pathogen: a comparative genomics analysis identifies a genetic pathway to pathogenesis in acinetobacter.

    Get PDF
    Acinetobacter baumannii is an emergent and global nosocomial pathogen. In addition to A. baumannii, other Acinetobacter species, especially those in the Acinetobacter calcoaceticus-baumannii (Acb) complex, have also been associated with serious human infection. Although mechanisms of attachment, persistence on abiotic surfaces, and pathogenesis in A. baumannii have been identified, the genetic mechanisms that explain the emergence of A. baumannii as the most widespread and virulent Acinetobacter species are not fully understood. Recent whole genome sequencing has provided insight into the phylogenetic structure of the genus Acinetobacter. However, a global comparison of genomic features between Acinetobacter spp. has not been described in the literature. In this study, 136 Acinetobacter genomes, including 67 sequenced in this study, were compared to identify the acquisition and loss of genes in the expansion of the Acinetobacter genus. A whole genome phylogeny confirmed that A. baumannii is a monophyletic clade and that the larger Acb complex is also a well-supported monophyletic group. The whole genome phylogeny provided the framework for a global genomic comparison based on a blast score ratio (BSR) analysis. The BSR analysis demonstrated that specific genes have been both lost and acquired in the evolution of A. baumannii. In addition, several genes associated with A. baumannii pathogenesis were found to be more conserved in the Acb complex, and especially in A. baumannii, than in other Acinetobacter genomes; until recently, a global analysis of the distribution and conservation of virulence factors across the genus was not possible. The results demonstrate that the acquisition of specific virulence factors has likely contributed to the widespread persistence and virulence of A. baumannii. The identification of novel features associated with transcriptional regulation and acquired by clades in the Acb complex presents targets for better understanding the evolution of pathogenesis and virulence in the expansion of the genus

    Draft genome sequences of two Bulgarian Bacillus anthracis strains

    Get PDF
    Bacillus anthracis strains previously isolated from Bulgaria form a unique subcluster within the A1.a cluster that is typical for isolates from southeastern Europe. Here, we report the draft genome sequences of two Bulgarian B. anthracis strains belonging to the A branch (A.Br.) 008/009 canonical single nucleotide polymorphism (SNP) group of the major A branch

    Using Whole Genome Analysis to Examine Recombination across Diverse Sequence Types of Staphylococcus aureus

    Get PDF
    Staphylococcus aureus is an important clinical pathogen worldwide and understanding this organism\u27s phylogeny and, in particular, the role of recombination, is important both to understand the overall spread of virulent lineages and to characterize outbreaks. To further elucidate the phylogeny of S. aureus, 35 diverse strains were sequenced using whole genome sequencing. In addition, 29 publicly available whole genome sequences were included to create a single nucleotide polymorphism (SNP)-based phylogenetic tree encompassing 11 distinct lineages. All strains of a particular sequence type fell into the same clade with clear groupings of the major clonal complexes of CC8, CC5, CC30, CC45 and CC1. Using a novel analysis method, we plotted the homoplasy density and SNP density across the whole genome and found evidence of recombination throughout the entire chromosome, but when we examined individual clonal lineages we found very little recombination. However, when we analyzed three branches of multiple lineages, we saw intermediate and differing levels of recombination between them. These data demonstrate that in S. aureus, recombination occurs across major lineages that subsequently expand in a clonal manner. Estimated mutation rates for the CC8 and CC5 lineages were different from each other. While the CC8 lineage rate was similar to previous studies, the CC5 lineage was 100-fold greater. Fifty known virulence genes were screened in all genomes in silico to determine their distribution across major clades. Thirty-three genes were present variably across clades, most of which were not constrained by ancestry, indicating horizontal gene transfer or gene loss

    Genome sequence of Burkholderia pseudomallei NCTC 13392

    Get PDF
    Here, we describe the draft genome sequence of Burkholderia pseudomallei NCTC 13392. This isolate has been distributed as K96243, but distinct genomic differences have been identified. The genomic sequence of this isolate will provide the genomic context for previously conducted functional studies

    Transcriptional modulation of enterotoxigenic Escherichia coli virulence genes in response to epithelial cell interactions

    Get PDF
    Enterotoxigenic Escherichia coli (ETEC) strains are a leading cause of morbidity and mortality due to diarrheal illness in developing countries. There is currently no effective vaccine against these important pathogens. Because genes modulated by pathogen-host interactions potentially encode putative vaccine targets, we investigated changes in gene expression and surface morphology of ETEC upon interaction with intestinal epithelial cells in vitro. Pan-genome microarrays, quantitative reverse transcriptase PCR (qRT-PCR), and transcriptional reporter fusions of selected promoters were used to study changes in ETEC transcriptomes. Flow cytometry, immunofluorescence microscopy, and scanning electron microscopy were used to investigate alterations in surface antigen expression and morphology following pathogen-host interactions. Following host cell contact, genes for motility, adhesion, toxin production, immunodominant peptides, and key regulatory molecules, including cyclic AMP (cAMP) receptor protein (CRP) and c-di-GMP, were substantially modulated. These changes were accompanied by visible changes in both ETEC architecture and the expression of surface antigens, including a novel highly conserved adhesin molecule, EaeH. The studies reported here suggest that pathogen-host interactions are finely orchestrated by ETEC and are characterized by coordinated responses involving the sequential deployment of multiple virulence molecules. Elucidation of the molecular details of these interactions could highlight novel strategies for development of vaccines for these important pathogens

    Diversity, Virulence, and Antimicrobial Resistance in Isolates From the Newly Emerging Klebsiella pneumoniae ST101 Lineage

    Get PDF
    The global dissemination of Klebsiella pneumoniae and Klebsiella pneumoniae carbapenemase (KPC) has been largely attributed to a few high-risk sequence types (STs) (ST258, ST11, ST512) associated with human disease. ST101 is an emerging clone that has been identified in different parts of the world with the potential to become a global, persistent public health threat. Recent research suggests the ST101 lineage is associated with an 11% increase in mortality rate in comparison to non-ST101 infections. In this study, we generated a high-quality, near-finished genome assembly of a multidrug-resistant (MDR) isolate from Italy (isolate 4743) that is a single locus variant of ST101 (ST1685). We demonstrate that the 4743 genome contains virulence features such as an integrative conjugative element carrying the yersiniabactin siderophore (ICEKp3), the mannose-resistant Klebsiella-like (type III) fimbriae cluster (mrkABCDFHIJ), the ferric uptake system (kfuABC), the yersiniabactin receptor gene fyuA, a capsular K type K17, and an O antigen type of O1. K. pneumoniae 4743 carries the blaKPC-2 carbapenemase gene along with genes conferring resistance to aminoglycosides, beta-lactams, fluoroquinolones, fosfomycin, macrolides, lincosamides, and streptogramin B. A comparative genomics analysis of 44 ST101 genomes as well as newly sequenced isolate 4743 identified variable antimicrobial resistance (AMR) resistance profiles and incompatibility plasmid types, but similar virulence factor profiles. Using Bayesian methodologies, we estimate the common ancestor for the ST101 lineage emerged in 1990 (95% HPD: 1965 to 2007) and isolates within the lineage acquired blaKPC after the divergence from its parental clonal group and dissemination. The identification of virulence factors and antibiotic resistance genes acquired by this newly emerging clone provides insight into the reported increased mortality rates and highlights its potential success as a persistent nosocomial pathogen. With a combination of both colistin resistance, carbapenem resistance, and several known virulence factors, the ST101 genetic repertoire may be a “perfect storm” allowing for a newly emerging, high-risk, extensively antibiotic resistant clone. This high-risk clone appears adept at acquiring resistance and may perpetuate the dissemination of extensive antimicrobial resistance. Greater focus on the acquisition of virulence factors and antibiotic resistance genes is crucial for understanding the spread of antibiotic resistance

    Examination of the enterotoxigenic Escherichia coli population structure during human infection

    Get PDF
    Enterotoxigenic E. coli (ETEC) can cause severe diarrhea and death in children in developing countries; however, bacterial diversity in natural infection is uncharacterized. In this study, we explored the natural population variation of ETEC from individuals with cholera-like diarrhea. Genomic sequencing and comparative analysis of multiple ETEC isolates from twelve cases of severe diarrhea demonstrated clonal populations in the majority of subjects (10/12). In contrast, a minority of individuals (2/12) yielded phylogenomically divergent ETEC isolates. Detailed examination revealed that isolates also differed in virulence factor content. These genomic data suggest that severe, cholera-like ETEC infections are largely caused by a clonal population of organisms within individual patients. Additionally, the isolation of similar clones from geographically and temporally dispersed cases with similar clinical presentations suggests that some isolates are particularly suited for virulence. The identification of multiple genomically diverse isolates with variable virulence factor profiles from a single subject highlights the dynamic nature of ETEC, as well as a potential weakness in the examination of cultures obtained from a single colony in clinical settings. These findings have implications for vaccine design and provide a framework for the study of population variation in other human pathogens

    Comparative genomic analyses reveal broad diversity in botulinum-toxin-producing Clostridia

    Get PDF
    Background: Clostridium botulinum is a diverse group of bacteria characterized by the production of botulinum neurotoxin. Botulinum neurotoxins are classified into serotypes (BoNT/A-G), which are produced by six species/Groups of Clostridia, but the genetic background of the bacteria remains poorly understood. The purpose of this study was to use comparative genomics to provide insights into the genetic diversity and evolutionary history of bacteria that produce the potent botulinum neurotoxin. Results: Comparative genomic analyses of over 170 Clostridia genomes, including our draft genome assemblies for 59 newly sequenced Clostridia strains from six continents and publicly available genomic data, provided in-depth insights into the diversity and distribution of BoNT-producing bacteria. These newly sequenced strains included Group I and II strains that express BoNT/A,/B,/E, or/F as well as bivalent strains. BoNT-producing Clostridia and closely related Clostridia species were delineated with a variety of methods including 16S rRNA gene, concatenated marker genes, core genome and concatenated multi-locus sequencing typing (MLST) gene phylogenies that related whole genome sequenced strains to publicly available strains and sequence types. These analyses illustrated the phylogenetic diversity in each Group and the diversity of genomic backgrounds that express the same toxin type or subtype. Comparisons of the botulinum neurotoxin genes did not identify novel toxin types or variants. Conclusions: This study represents one of the most comprehensive analyses of whole genome sequence data for Group I and II BoNT-producing strains. Read data and draft genome assemblies generated for 59 isolates will be a resource to the research community. Core genome phylogenies proved to be a powerful tool for differentiating BoNT-producing strains and can provide a framework for the study of these bacteria. Comparative genomic analyses of Clostridia species illustrate the diversity of botulinum-neurotoxin-producing strains and the plasticity of the genomic backgrounds in which bont genes are found.Peer reviewe

    Population Structure and Genetic Diversity among Isolates of Coccidioides posadasii in Venezuela and Surrounding Regions

    Get PDF
    Coccidioides posadasii is a pathogenic fungus that causes coccidioidomycosis in many arid regions of the Americas. One of these regions is bordered by the Caribbean Sea, and the surrounding landscape may play an important role in the dispersion of C. posadasii across South America through southeastern Mexico, Honduras, Guatemala, and Venezuela. Comparative phylogenomic analyses of C. posadasii reveal that clinical strains from Venezuela are genetically distinct from the North American populations found in (i) Arizona and (ii) Texas, Mexico, and the rest of South America (TX/MX/SA). We find evidence for admixture between the Venezuela and the North American populations of C. posadasii in Central America. Additionally, the proportion of Venezuelan alleles in the admixed population decreases as latitude (and distance from Venezuela) increases. Our results indicate that the population in Venezuela may have been subjected to a recent bottleneck and shows a strong population structure. This analysis provides insight into potential for Coccidioides spp. to invade new regions.IMPORTANCE Valley Fever is a fungal disease caused by two species of fungi: Coccidioides immitis and C. posadasii These fungi are found throughout the arid regions of North and South America; however, our understanding of genetic diversity and disease in South America is limited. In this report, we analyze 10 new genomes of Coccidioides posadasii from regions bordering the Caribbean Sea. We show that these populations are distinct and that isolates from Venezuela are likely a result of a recent bottleneck. These data point to patterns that might be observed when investigating recently established populations.NIH/NIAIDUnited States Department of Health & Human ServicesNational Institutes of Health (NIH) - USANIH National Institute of Allergy & Infectious Diseases (NIAID) [R21AI28536]; NIH/NIGMSUnited States Department of Health & Human ServicesNational Institutes of Health (NIH) - USANIH National Institute of General Medical Sciences (NIGMS) [R01GM121750]Open access journalThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]
    • …
    corecore