39 research outputs found
Recurrent SARS-CoV-2 mutations in immunodeficient patients
Long-term severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections in immunodeficient patients are an important source of variation for the virus but are understudied. Many case studies have been published which describe one or a small number of long-term infected individuals but no study has combined these sequences into a cohesive dataset. This work aims to rectify this and study the genomics of this patient group through a combination of literature searches as well as identifying new case series directly from the COVID-19 Genomics UK (COG-UK) dataset. The spike gene receptor-binding domain and N-terminal domain (NTD) were identified as mutation hotspots. Numerous mutations associated with variants of concern were observed to emerge recurrently. Additionally a mutation in the envelope gene, T30I was determined to be the second most frequent recurrently occurring mutation arising in persistent infections. A high proportion of recurrent mutations in immunodeficient individuals are associated with ACE2 affinity, immune escape, or viral packaging optimisation.
There is an apparent selective pressure for mutations that aid cell–cell transmission within the host or persistence which are often different from mutations that aid inter-host transmission, although the fact that multiple recurrent de novo mutations are considered defining for variants of concern strongly indicates that this potential source of novel variants should not be discounted
A phylogenetics and variant calling pipeline to support SARS-CoV-2 genomic epidemiology in the UK
In response to the escalating SARS-CoV-2 pandemic, in March 2020 the COVID-19 Genomics UK (COG-UK) consortium was established to enable national-scale genomic surveillance in the United Kingdom. By the end of 2020, 49% of all SARS-CoV-2 genome sequences globally had been generated as part of the COG-UK programme and to date this system has generated more than 3 million SARS-CoV-2 genomes. Rapidly and reliably analysing this unprecedented number of genomes was an enormous challenge. To fulfil this need and to inform public health decision making, we developed a centralised pipeline that performs quality control, alignment and variant calling, and provides the global phylogenetic context of sequences. We present this pipeline and describe how we tailored it as the pandemic progressed to scale with the increasing amounts of data and to provide the most relevant analyses on a daily basis
CLIMB (the Cloud Infrastructure for Microbial Bioinformatics):an online resource for the medical microbiology community
The increasing availability and decreasing cost of high-throughput sequencing has transformed academic medical microbiology, delivering an explosion in available genomes while also driving advances in bioinformatics. However, many microbiologists are unable to exploit the resulting large genomics datasets because they do not have access to relevant computational resources and to an appropriate bioinformatics infrastructure. Here, we present the Cloud Infrastructure for Microbial Bioinformatics (CLIMB) facility, a shared computing infrastructure that has been designed from the ground up to provide an environment where microbiologists can share and reuse methods and data
CLIMB-COVID: continuous integration supporting decentralised sequencing for SARS-CoV-2 genomic surveillance.
Funder: Wellcome TrustIn response to the ongoing SARS-CoV-2 pandemic in the UK, the COVID-19 Genomics UK (COG-UK) consortium was formed to rapidly sequence SARS-CoV-2 genomes as part of a national-scale genomic surveillance strategy. The network consists of universities, academic institutes, regional sequencing centres and the four UK Public Health Agencies. We describe the development and deployment of CLIMB-COVID, an encompassing digital infrastructure to address the challenge of collecting and integrating both genomic sequencing data and sample-associated metadata produced across the COG-UK network
Generation and transmission of interlineage recombinants in the SARS-CoV-2 pandemic.
We present evidence for multiple independent origins of recombinant SARS-CoV-2 viruses sampled from late 2020 and early 2021 in the United Kingdom. Their genomes carry single-nucleotide polymorphisms and deletions that are characteristic of the B.1.1.7 variant of concern but lack the full complement of lineage-defining mutations. Instead, the remainder of their genomes share contiguous genetic variation with non-B.1.1.7 viruses circulating in the same geographic area at the same time as the recombinants. In four instances, there was evidence for onward transmission of a recombinant-origin virus, including one transmission cluster of 45 sequenced cases over the course of 2 months. The inferred genomic locations of recombination breakpoints suggest that every community-transmitted recombinant virus inherited its spike region from a B.1.1.7 parental virus, consistent with a transmission advantage for B.1.1.7's set of mutations.The COG-UK Consortium is supported by funding from the Medical Research Council (MRC) part of UK Research & Innovation (UKRI), the National Institute of Health Research (NIHR) (MC_PC_19027), and Genome Research Limited, operating as the Wellcome Sanger Institute. O.G.P. was supported by the Oxford Martin School. J.T.M., R.M.C., N.J.L., and A.R. acknowledge the support of the Wellcome Trust (Collaborators Award 206298/Z/17/Z – ARTIC network). D.L.R. acknowledges the support of the MRC (MC_UU_12014/12) and the Wellcome Trust (220977/Z/20/Z). E.S. and A.R. are supported by the European Research Council (grant agreement no. 725422 – ReservoirDOCS). T.R.C. and N.J.L. acknowledge the support of the MRC, which provided the funding for the MRC CLIMB infrastructure used to analyze, store, and share the UK sequencing dataset (MR/L015080/1 and MR/T030062/1). The samples sequenced in Wales were sequenced partly using funding provided by the Welsh Government
Rapid in-country sequencing of whole virus genomes to inform rabies elimination programmes.
Genomic surveillance is an important aspect of contemporary disease management but has yet to be used routinely to monitor endemic disease transmission and control in low- and middle-income countries. Rabies is an almost invariably fatal viral disease that causes a large public health and economic burden in Asia and Africa, despite being entirely vaccine preventable. With policy efforts now directed towards achieving a global goal of zero dog-mediated human rabies deaths by 2030, establishing effective surveillance tools is critical. Genomic data can provide important and unique insights into rabies spread and persistence that can direct control efforts. However, capacity for genomic research in low- and middle-income countries is held back by limited laboratory infrastructure, cost, supply chains and other logistical challenges. Here we present and validate an end-to-end workflow to facilitate affordable whole genome sequencing for rabies surveillance utilising nanopore technology. We used this workflow in Kenya, Tanzania and the Philippines to generate rabies virus genomes in two to three days, reducing costs to approximately £60 per genome. This is over half the cost of metagenomic sequencing previously conducted for Tanzanian samples, which involved exporting samples to the UK and a three- to six-month lag time. Ongoing optimization of workflows are likely to reduce these costs further. We also present tools to support routine whole genome sequencing and interpretation for genomic surveillance. Moreover, combined with training workshops to empower scientists in-country, we show that local sequencing capacity can be readily established and sustainable, negating the common misperception that cutting-edge genomic research can only be conducted in high resource laboratories. More generally, we argue that the capacity to harness genomic data is a game-changer for endemic disease surveillance and should precipitate a new wave of researchers from low- and middle-income countries
Genomics-informed outbreak investigations of SARS-CoV-2 using civet
The scale of data produced during the SARS-CoV-2 pandemic has been unprecedented, with more than 13 million sequences shared publicly at the time of writing. This wealth of sequence data provides important context for interpreting local outbreaks. However, placing sequences of interest into national and international context is difficult given the size of the global dataset. Often outbreak investigations and genomic surveillance efforts require running similar analyses again and again on the latest dataset and producing reports. We developed civet (cluster investigation and virus epidemiology tool) to aid these routine analyses and facilitate virus outbreak investigation and surveillance. Civet can place sequences of interest in the local context of background diversity, resolving the query into different ’catchments’ and presenting the phylogenetic results alongside metadata in an interactive, distributable report. Civet can be used on a fine scale for clinical outbreak investigation, for local surveillance and cluster discovery, and to routinely summarise the virus diversity circulating on a national level. Civet reports have helped researchers and public health bodies feedback genomic information in the appropriate context within a timeframe that is useful for public health
Genomics-informed outbreak investigations of SARS-CoV-2 using civet
The scale of data produced during the SARS-CoV-2 pandemic has been unprecedented, with more than 13 million sequences shared publicly at the time of writing. This wealth of sequence data provides important context for interpreting local outbreaks. However, placing sequences of interest into national and international context is difficult given the size of the global dataset. Often outbreak investigations and genomic surveillance efforts require running similar analyses again and again on the latest dataset and producing reports. We developed civet (cluster investigation and virus epidemiology tool) to aid these routine analyses and facilitate virus outbreak investigation and surveillance. Civet can place sequences of interest in the local context of background diversity, resolving the query into different ’catchments’ and presenting the phylogenetic results alongside metadata in an interactive, distributable report. Civet can be used on a fine scale for clinical outbreak investigation, for local surveillance and cluster discovery, and to routinely summarise the virus diversity circulating on a national level. Civet reports have helped researchers and public health bodies feedback genomic information in the appropriate context within a timeframe that is useful for public health
Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity.
Global dispersal and increasing frequency of the SARS-CoV-2 spike protein variant D614G are suggestive of a selective advantage but may also be due to a random founder effect. We investigate the hypothesis for positive selection of spike D614G in the United Kingdom using more than 25,000 whole genome SARS-CoV-2 sequences. Despite the availability of a large dataset, well represented by both spike 614 variants, not all approaches showed a conclusive signal of positive selection. Population genetic analysis indicates that 614G increases in frequency relative to 614D in a manner consistent with a selective advantage. We do not find any indication that patients infected with the spike 614G variant have higher COVID-19 mortality or clinical severity, but 614G is associated with higher viral load and younger age of patients. Significant differences in growth and size of 614G phylogenetic clusters indicate a need for continued study of this variant
Exponential growth, high prevalence of SARS-CoV-2, and vaccine effectiveness associated with the Delta variant
SARS-CoV-2 infections were rising during early summer 2021 in many countries associated with the Delta variant. We assessed RT-PCR swab-positivity in the REal-time Assessment of Community Transmission-1 (REACT-1) study in England. We observed sustained exponential growth with average doubling time (June-July 2021) of 25 days driven by complete replacement of Alpha variant by Delta, and by high prevalence at younger less-vaccinated ages. Unvaccinated people were three times more likely than double-vaccinated people to test positive. However, after adjusting for age and other variables, vaccine effectiveness for double-vaccinated people was estimated at between ~50% and ~60% during this period in England. Increased social mixing in the presence of Delta had the potential to generate sustained growth in infections, even at high levels of vaccination