8 research outputs found
VGEA: an RNA viral assembly toolkit.
Next generation sequencing (NGS)-based studies have vastly increased our understanding of viral diversity. Viral sequence data obtained from NGS experiments are a rich source of information, these data can be used to study their epidemiology, evolution, transmission patterns, and can also inform drug and vaccine design. Viral genomes, however, represent a great challenge to bioinformatics due to their high mutation rate and forming quasispecies in the same infected host, bringing about the need to implement advanced bioinformatics tools to assemble consensus genomes well-representative of the viral population circulating in individual patients. Many tools have been developed to preprocess sequencing reads, carry-out de novo or reference-assisted assembly of viral genomes and assess the quality of the genomes obtained. Most of these tools however exist as standalone workflows and usually require huge computational resources. Here we present (Viral Genomes Easily Analyzed), a Snakemake workflow for analyzing RNA viral genomes. VGEA enables users to map sequencing reads to the human genome to remove human contaminants, split bam files into forward and reverse reads, carry out de novo assembly of forward and reverse reads to generate contigs, pre-process reads for quality and contamination, map reads to a reference tailored to the sample using corrected contigs supplemented by the user's choice of reference sequences and evaluate/compare genome assemblies. We designed a project with the aim of creating a flexible, easy-to-use and all-in-one pipeline from existing/stand-alone bioinformatics tools for viral genome analysis that can be deployed on a personal computer. VGEA was built on the Snakemake workflow management system and utilizes existing tools for each step: fastp (Chen et al., 2018) for read trimming and read-level quality control, BWA (Li & Durbin, 2009) for mapping sequencing reads to the human reference genome, SAMtools (Li et al., 2009) for extracting unmapped reads and also for splitting bam files into fastq files, IVA (Hunt et al., 2015) for de novo assembly to generate contigs, shiver (Wymant et al., 2018) to pre-process reads for quality and contamination, then map to a reference tailored to the sample using corrected contigs supplemented with the user's choice of existing reference sequences, SeqKit (Shen et al., 2016) for cleaning shiver assembly for QUAST, QUAST (Gurevich et al., 2013) to evaluate/assess the quality of genome assemblies and MultiQC (Ewels et al., 2016) for aggregation of the results from fastp, BWA and QUAST. Our pipeline was successfully tested and validated with SARS-CoV-2 (n = 20), HIV-1 (n = 20) and Lassa Virus (n = 20) datasets all of which have been made publicly available. VGEA is freely available on GitHub at: https://github.com/pauloluniyi/VGEA under the GNU General Public License
Emergence and spread of two SARS-CoV-2 variants of interest in Nigeria.
Identifying the dissemination patterns and impacts of a virus of economic or health importance during a pandemic is crucial, as it informs the public on policies for containment in order to reduce the spread of the virus. In this study, we integrated genomic and travel data to investigate the emergence and spread of the SARS-CoV-2 B.1.1.318 and B.1.525 (Eta) variants of interest in Nigeria and the wider Africa region. By integrating travel data and phylogeographic reconstructions, we find that these two variants that arose during the second wave in Nigeria emerged from within Africa, with the B.1.525 from Nigeria, and then spread to other parts of the world. Data from this study show how regional connectivity of Nigeria drove the spread of these variants of interest to surrounding countries and those connected by air-traffic. Our findings demonstrate the power of genomic analysis when combined with mobility and epidemiological data to identify the drivers of transmission, as bidirectional transmission within and between African nations are grossly underestimated as seen in our import risk index estimates
The evolving SARS-CoV-2 epidemic in Africa: Insights from rapidly expanding genomic surveillance.
Investment in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing in Africa over the past year has led to a major increase in the number of sequences that have been generated and used to track the pandemic on the continent, a number that now exceeds 100,000 genomes. Our results show an increase in the number of African countries that are able to sequence domestically and highlight that local sequencing enables faster turnaround times and more-regular routine surveillance. Despite limitations of low testing proportions, findings from this genomic surveillance study underscore the heterogeneous nature of the pandemic and illuminate the distinct dispersal dynamics of variants of concern-particularly Alpha, Beta, Delta, and Omicron-on the continent. Sustained investment for diagnostics and genomic surveillance in Africa is needed as the virus continues to evolve while the continent faces many emerging and reemerging infectious disease threats. These investments are crucial for pandemic preparedness and response and will serve the health of the continent well into the 21st century
The evolving SARS-CoV-2 epidemic in Africa: Insights from rapidly expanding genomic surveillance
INTRODUCTION
Investment in Africa over the past year with regard to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing has led to a massive increase in the number of sequences, which, to date, exceeds 100,000 sequences generated to track the pandemic on the continent. These sequences have profoundly affected how public health officials in Africa have navigated the COVID-19 pandemic.
RATIONALE
We demonstrate how the first 100,000 SARS-CoV-2 sequences from Africa have helped monitor the epidemic on the continent, how genomic surveillance expanded over the course of the pandemic, and how we adapted our sequencing methods to deal with an evolving virus. Finally, we also examine how viral lineages have spread across the continent in a phylogeographic framework to gain insights into the underlying temporal and spatial transmission dynamics for several variants of concern (VOCs).
RESULTS
Our results indicate that the number of countries in Africa that can sequence the virus within their own borders is growing and that this is coupled with a shorter turnaround time from the time of sampling to sequence submission. Ongoing evolution necessitated the continual updating of primer sets, and, as a result, eight primer sets were designed in tandem with viral evolution and used to ensure effective sequencing of the virus. The pandemic unfolded through multiple waves of infection that were each driven by distinct genetic lineages, with B.1-like ancestral strains associated with the first pandemic wave of infections in 2020. Successive waves on the continent were fueled by different VOCs, with Alpha and Beta cocirculating in distinct spatial patterns during the second wave and Delta and Omicron affecting the whole continent during the third and fourth waves, respectively. Phylogeographic reconstruction points toward distinct differences in viral importation and exportation patterns associated with the Alpha, Beta, Delta, and Omicron variants and subvariants, when considering both Africa versus the rest of the world and viral dissemination within the continent. Our epidemiological and phylogenetic inferences therefore underscore the heterogeneous nature of the pandemic on the continent and highlight key insights and challenges, for instance, recognizing the limitations of low testing proportions. We also highlight the early warning capacity that genomic surveillance in Africa has had for the rest of the world with the detection of new lineages and variants, the most recent being the characterization of various Omicron subvariants.
CONCLUSION
Sustained investment for diagnostics and genomic surveillance in Africa is needed as the virus continues to evolve. This is important not only to help combat SARS-CoV-2 on the continent but also because it can be used as a platform to help address the many emerging and reemerging infectious disease threats in Africa. In particular, capacity building for local sequencing within countries or within the continent should be prioritized because this is generally associated with shorter turnaround times, providing the most benefit to local public health authorities tasked with pandemic response and mitigation and allowing for the fastest reaction to localized outbreaks. These investments are crucial for pandemic preparedness and response and will serve the health of the continent well into the 21st century
Streamlined inactivation, amplification, and Cas13-based detection of SARS-CoV-2
The COVID-19 pandemic has highlighted that new diagnostic technologies are essential for controlling disease transmission. Here, we develop SHINE (Streamlined Highlighting of Infections to Navigate Epidemics), a sensitive and specific diagnostic tool that can detect SARS-CoV-2 RNA from unextracted samples. We identify the optimal conditions to allow RPA-based amplification and Cas13-based detection to occur in a single step, simplifying assay preparation and reducing run-time. We improve HUDSON to rapidly inactivate viruses in nasopharyngeal swabs and saliva in 10 min. SHINE’s results can be visualized with an in-tube fluorescent readout — reducing contamination risk as amplification reaction tubes remain sealed — and interpreted by a companion smartphone application. We validate SHINE on 50 nasopharyngeal patient samples, demonstrating 90% sensitivity and 100% specificity compared to RT-qPCR with a sample-to-answer time of 50 min. SHINE has the potential to be used outside of hospitals and clinical laboratories, greatly enhancing diagnostic capabilities
Deployable CRISPR-Cas13a diagnostic tools to detect and report Ebola and Lassa virus cases in real-time
Outbreaks of viral hemorrhagic fevers highlight the need for sensitive, field-deployable diagnostics. Here the authors present a CRISPR-based SHERLOCK platform with field protocol and mobile app for Ebola and Lassa fever outbreaks
The Origins and Future of Sentinel: An Early-Warning System for Pandemic Preemption and Response
While investigating a signal of adaptive evolution in humans at the gene LARGE, we encountered an intriguing finding by Dr. Stefan Kunz that the gene plays a critical role in Lassa virus binding and entry. This led us to pursue field work to test our hypothesis that natural selection acting on LARGE—detected in the Yoruba population of Nigeria—conferred resistance to Lassa Fever in some West African populations. As we delved further, we conjectured that the “emerging” nature of recently discovered diseases like Lassa fever is related to a newfound capacity for detection, rather than a novel viral presence, and that humans have in fact been exposed to the viruses that cause such diseases for much longer than previously suspected. Dr. Stefan Kunz’s critical efforts not only laid the groundwork for this discovery, but also inspired and catalyzed a series of events that birthed Sentinel, an ambitious and large-scale pandemic prevention effort in West Africa. Sentinel aims to detect and characterize deadly pathogens before they spread across the globe, through implementation of its three fundamental pillars: Detect, Connect, and Empower. More specifically, Sentinel is designed to detect known and novel infections rapidly, connect and share information in real time to identify emerging threats, and empower the public health community to improve pandemic preparedness and response anywhere in the world. We are proud to dedicate this work to Stefan Kunz, and eagerly invite new collaborators, experts, and others to join us in our efforts