72 research outputs found

    BLASTPLOT: a PERL module to plot next generation sequencing NCBI-BLAST results

    Get PDF
    BACKGROUND: The development of Next Generation Sequencing (NGS) during the last decade has created an unprecedented amount of sequencing data, as well as the ability to rapidly sequence specimens of interest. Read-based BLAST analysis of NGS data is a common procedure especially in the case of metagenomic samples. However, coverage is usually not enough to allow for de novo assembly. This type of read-based analysis often creates the question of how the reads that align to the same sequence are distributed. The same question applies to preparation of primers or probes for microarray experiments. Although there are several packages that allow the visualization of DNA segments in relation to a reference, in most cases they require the visualization of one reference at a time and the capture of screen shots for each segment. Such a procedure could be tedious and time consuming. The field is in need of a solution that automates the capture of coverage plots for all the segments of interest. RESULTS: We have created BLASTPLOT, a PERL module to quickly plot the BLAST results from short sequences (primers, probes, reads) against reference targets. CONCLUSIONS: BLASTPLOT is a simple to use PERL module that allows the generation of PNG graphs for all the reference sequences associated with a BLAST result set

    Finishing genomes with limited resources: lessons from an ensemble of microbial genomes

    Get PDF
    While new sequencing technologies have ushered in an era where microbial genomes can be easily sequenced, the goal of routinely producing high-quality draft and finished genomes in a cost-effective fashion has still remained elusive. Due to shorter read lengths and limitations in library construction protocols, shotgun sequencing and assembly based on these technologies often results in fragmented assemblies. Correspondingly, while draft assemblies can be obtained in days, finishing can take many months and hence the time and effort can only be justified for high-priority genomes and in large sequencing centers. In this work, we revisit this issue in light of our own experience in producing finished and nearly-finished genomes for a range of microbial species in a small-lab setting. These genomes were finished with surprisingly little investments in terms of time, computational effort and lab work, suggesting that the increased access to sequencing might also eventually lead to a greater proportion of finished genomes from small labs and genomics cores

    Characterization of pPCP1 Plasmids in Yersinia pestis Strains Isolated from the Former Soviet Union

    Get PDF
    Complete sequences of 9.5-kb pPCP1 plasmids in three Yersinia pestis strains from the former Soviet Union (FSU) were determined and compared with those of pPCP1 plasmids in three well-characterized, non-FSU Y. pestis strains (KIM, CO92, and 91001). Two of the FSU plasmids were from strains C2614 and C2944, isolated from plague foci in Russia, and one plasmid was from strain C790 from Kyrgyzstan. Sequence analyses identified four sequence types among the six plasmids. The pPCP1 plasmids in the FSU strains were most genetically related to the pPCP1 plasmid in the KIM strain and least related to the pPCP1 plasmid in Y. pestis 91001. The FSU strains generally had larger pPCP1 plasmid copy numbers compared to strain CO92. Expression of the plasmid's pla gene was significantly (P ≤ .05) higher in strain C2944 than in strain CO92. Given pla's role in Y. pestis virulence, this difference may have important implications for the strain's virulence

    PheMaDB: A solution for storage, retrieval, and analysis of high throughput phenotype data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>OmniLog™ phenotype microarrays (PMs) have the capability to measure and compare the growth responses of biological samples upon exposure to hundreds of growth conditions such as different metabolites and antibiotics over a time course of hours to days. In order to manage the large amount of data produced from the OmniLog™ instrument, PheMaDB (Phenotype Microarray DataBase), a web-based relational database, was designed. PheMaDB enables efficient storage, retrieval and rapid analysis of the OmniLog™ PM data.</p> <p>Description</p> <p>PheMaDB allows the user to quickly identify records of interest for data analysis by filtering with a hierarchical ordering of Project, Strain, Phenotype, Replicate, and Temperature. PheMaDB then provides various statistical analysis options to identify specific growth pattern characteristics of the experimental strains, such as: outlier analysis, negative controls analysis (signal/background calibration), bar plots, pearson's correlation matrix, growth curve profile search, <it>k</it>-means clustering, and a heat map plot. This web-based database management system allows for both easy data sharing among multiple users and robust tools to phenotype organisms of interest.</p> <p>Conclusions</p> <p>PheMaDB is an open source system standardized for OmniLog™ PM data. PheMaDB could facilitate the banking and sharing of phenotype data. The source code is available for download at <url>http://phemadb.sourceforge.net</url>.</p

    Selective deforestation and exposure of African wildlife to bat-borne viruses

    Get PDF
    Proposed mechanisms of zoonotic virus spillover often posit that wildlife transmission and amplification precede human outbreaks. Between 2006 and 2012, the palm Raphia farinifera, a rich source of dietary minerals for wildlife, was nearly extirpated from Budongo Forest, Uganda. Since then, chimpanzees, black-and-white colobus, and red duiker were observed feeding on bat guano, a behavior not previously observed. Here we show that guano consumption may be a response to dietary mineral scarcity and may expose wildlife to bat-borne viruses. Videos from 2017–2019 recorded 839 instances of guano consumption by the aforementioned species. Nutritional analysis of the guano revealed high concentrations of sodium, potassium, magnesium and phosphorus. Metagenomic analyses of the guano identified 27 eukaryotic viruses, including a novel betacoronavirus. Our findings illustrate how “upstream” drivers such as socioeconomics and resource extraction can initiate elaborate chains of causation, ultimately increasing virus spillover risk.Peer reviewe

    Rapid Identification of Genetic Modifications in Bacillus anthracis Using Whole Genome Draft Sequences Generated by 454 Pyrosequencing

    Get PDF
    Background The anthrax letter attacks of 2001 highlighted the need for rapid identification of biothreat agents not only for epidemiological surveillance of the intentional outbreak but also for implementing appropriate countermeasures, such as antibiotic treatment, in a timely manner to prevent further casualties. It is clear from the 2001 cases that survival may be markedly improved by administration of antimicrobial therapy during the early symptomatic phase of the illness; i.e., within 3 days of appearance of symptoms. Microbiological detection methods are feasible only for organisms that can be cultured in vitro and cannot detect all genetic modifications with the exception of antibiotic resistance. Currently available immuno or nucleic acid-based rapid detection assays utilize known, organism-specific proteins or genomic DNA signatures respectively. Hence, these assays lack the ability to detect novel natural variations or intentional genetic modifications that circumvent the targets of the detection assays or in the case of a biological attack using an antibiotic resistant or virulence enhanced Bacillus anthracis, to advise on therapeutic treatments. Methodology/Principal Findings We show here that the Roche 454-based pyrosequencing can generate whole genome draft sequences of deep and broad enough coverage of a bacterial genome in less than 24 hours. Furthermore, using the unfinished draft sequences, we demonstrate that unbiased identification of known as well as heretofore-unreported genetic modifications that include indels and single nucleotide polymorphisms conferring antibiotic and phage resistances is feasible within the next 12 hours. Conclusions/Significance Second generation sequencing technologies have paved the way for sequence-based rapid identification of both known and previously undocumented genetic modifications in cultured, conventional and newly emerging biothreat agents. Our findings have significant implications in the context of whole genome sequencing-based routine clinical diagnostics as well as epidemiological surveillance of natural disease outbreaks caused by bacterial and viral agents

    Genomic Signatures of Strain Selection and Enhancement in Bacillus atrophaeus var. globigii, a Historical Biowarfare Simulant

    Get PDF
    (BG) as a simulant for biological warfare (BW) agents, knowledge of its genome composition is limited. Furthermore, the ability to differentiate signatures of deliberate adaptation and selection from natural variation is lacking for most bacterial agents. We characterized a lineage of BGwith a long history of use as a simulant for BW operations, focusing on classical bacteriological markers, metabolic profiling and whole-genome shotgun sequencing (WGS). on the nucleotide level. WGS of variants revealed that several strains were mixed but highly related populations and uncovered a progressive accumulation of mutations among the “military” isolates. Metabolic profiling and microscopic examination of bacterial cultures revealed enhanced growth of “military” isolates on lactate-containing media, and showed that the “military” strains exhibited a hypersporulating phenotype.Our analysis revealed the genomic and phenotypic signatures of strain adaptation and deliberate selection for traits that were desirable in a simulant organism. Together, these results demonstrate the power of whole-genome and modern systems-level approaches to characterize microbial lineages to develop and validate forensic markers for strain discrimination and reveal signatures of deliberate adaptation

    Whole genome sequencing of phage resistant Bacillus anthracis mutants reveals an essential role for cell surface anchoring protein CsaB in phage AP50c adsorption

    Get PDF
    BACKGROUND: Spontaneous Bacillus anthracis mutants resistant to infection by phage AP50c (AP50(R)) exhibit a mucoid colony phenotype and secrete an extracellular matrix. METHODS: Here we utilized a Roche/454-based whole genome sequencing approach to identify mutations that are candidates for conferring AP50c phage resistance, followed by genetic deletion and complementation studies to validate the whole genome sequence data and demonstrate that the implicated gene is necessary for AP50c phage infection. RESULTS: Using whole genome sequence data, we mapped the relevant mutations in six AP50(R) strains to csaB. Eleven additional spontaneous mutants, isolated in two different genetic backgrounds, were screened by PCR followed by Sanger sequencing of the csaB gene. In each spontaneous mutant, we found either a non-synonymous substitution, a nonsense mutation, or a frame-shift mutation caused by single nucleotide polymorphisms or a 5 base pair insertion in csaB. All together, 5 and 12 of the 17 spontaneous mutations are predicted to yield altered full length and truncated CsaB proteins respectively. As expected from these results, a targeted deletion or frame-shift mutations introduced into csaB in a different genetic background, in a strain not exposed to AP50c, resulted in a phage resistant phenotype. Also, substitution of a highly conserved histidine residue with an alanine residue (H270A) in CsaB resulted in phage resistance, suggesting that a functional CsaB is necessary for phage sensitivity. Conversely, introduction of the wild type allele of csaB in cis into the csaB deletion mutant by homologous recombination or supplying the wild type CsaB protein in trans from a plasmid restored phage sensitivity. The csaB mutants accumulated cell wall material and appeared to have a defective S-layer, whereas these phenotypes were reverted in the complemented strains. CONCLUSIONS: Taken together, these data suggest an essential role for csaB in AP50c phage infection, most likely in phage adsorption. (The whole genome sequences generated from this study have been submitted to GenBank under SRA project ID: SRA023659.1 and sample IDs: AP50 R1: SRS113675.1, AP50 R2: SRS113676.1, AP50 R3: SRS113728.1, AP50 R4: SRS113733.1, AP50 R6: SRS113734.1, JB220 Parent: SRS150209.1, JB220 Mutant: SRS150211.1)

    Crystal Structure of the Hendra Virus Attachment G Glycoprotein Bound to a Potent Cross-Reactive Neutralizing Human Monoclonal Antibody

    Get PDF
    The henipaviruses, represented by Hendra (HeV) and Nipah (NiV) viruses are highly pathogenic zoonotic paramyxoviruses with uniquely broad host tropisms responsible for repeated outbreaks in Australia, Southeast Asia, India and Bangladesh. The high morbidity and mortality rates associated with infection and lack of licensed antiviral therapies make the henipaviruses a potential biological threat to humans and livestock. Henipavirus entry is initiated by the attachment of the G envelope glycoprotein to host cell membrane receptors. Previously, henipavirus-neutralizing human monoclonal antibodies (hmAb) have been isolated using the HeV-G glycoprotein and a human naïve antibody library. One cross-reactive and receptor-blocking hmAb (m102.4) was recently demonstrated to be an effective post-exposure therapy in two animal models of NiV and HeV infection, has been used in several people on a compassionate use basis, and is currently in development for use in humans. Here, we report the crystal structure of the complex of HeV-G with m102.3, an m102.4 derivative, and describe NiV and HeV escape mutants. This structure provides detailed insight into the mechanism of HeV and NiV neutralization by m102.4, and serves as a blueprint for further optimization of m102.4 as a therapeutic agent and for the development of entry inhibitors and vaccines

    The evolving SARS-CoV-2 epidemic in Africa: Insights from rapidly expanding genomic surveillance

    Get PDF
    INTRODUCTION Investment in Africa over the past year with regard to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing has led to a massive increase in the number of sequences, which, to date, exceeds 100,000 sequences generated to track the pandemic on the continent. These sequences have profoundly affected how public health officials in Africa have navigated the COVID-19 pandemic. RATIONALE We demonstrate how the first 100,000 SARS-CoV-2 sequences from Africa have helped monitor the epidemic on the continent, how genomic surveillance expanded over the course of the pandemic, and how we adapted our sequencing methods to deal with an evolving virus. Finally, we also examine how viral lineages have spread across the continent in a phylogeographic framework to gain insights into the underlying temporal and spatial transmission dynamics for several variants of concern (VOCs). RESULTS Our results indicate that the number of countries in Africa that can sequence the virus within their own borders is growing and that this is coupled with a shorter turnaround time from the time of sampling to sequence submission. Ongoing evolution necessitated the continual updating of primer sets, and, as a result, eight primer sets were designed in tandem with viral evolution and used to ensure effective sequencing of the virus. The pandemic unfolded through multiple waves of infection that were each driven by distinct genetic lineages, with B.1-like ancestral strains associated with the first pandemic wave of infections in 2020. Successive waves on the continent were fueled by different VOCs, with Alpha and Beta cocirculating in distinct spatial patterns during the second wave and Delta and Omicron affecting the whole continent during the third and fourth waves, respectively. Phylogeographic reconstruction points toward distinct differences in viral importation and exportation patterns associated with the Alpha, Beta, Delta, and Omicron variants and subvariants, when considering both Africa versus the rest of the world and viral dissemination within the continent. Our epidemiological and phylogenetic inferences therefore underscore the heterogeneous nature of the pandemic on the continent and highlight key insights and challenges, for instance, recognizing the limitations of low testing proportions. We also highlight the early warning capacity that genomic surveillance in Africa has had for the rest of the world with the detection of new lineages and variants, the most recent being the characterization of various Omicron subvariants. CONCLUSION Sustained investment for diagnostics and genomic surveillance in Africa is needed as the virus continues to evolve. This is important not only to help combat SARS-CoV-2 on the continent but also because it can be used as a platform to help address the many emerging and reemerging infectious disease threats in Africa. In particular, capacity building for local sequencing within countries or within the continent should be prioritized because this is generally associated with shorter turnaround times, providing the most benefit to local public health authorities tasked with pandemic response and mitigation and allowing for the fastest reaction to localized outbreaks. These investments are crucial for pandemic preparedness and response and will serve the health of the continent well into the 21st century
    corecore