26 research outputs found
Recommended from our members
The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples
Shotgun metagenomics is increasingly used to characterise microbial communities, particularly for the investigation of antimicrobial resistance (AMR) in different animal and environmental contexts. There are many different approaches for inferring the taxonomic composition and AMR gene content of complex community samples from shotgun metagenomic data, but there has been little work establishing the optimum sequencing depth, data processing and analysis methods for these samples. In this study we used shotgun metagenomics and sequencing of cultured isolates from the same samples to address these issues. We sampled three potential environmental AMR gene reservoirs (pig caeca, river sediment, effluent) and sequenced samples with shotgun metagenomics at high depth (~ 200 million reads per sample). Alongside this, we cultured single-colony isolates of Enterobacteriaceae from the same samples and used hybrid sequencing (short- and long-reads) to create high- quality assemblies for comparison to the metagenomic data. To automate data processing, we developed an open- source software pipeline, ‘ResPipe’
Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes
Illumina sequencing allows rapid, cheap and accurate whole genome bacterial analyses, but short reads (<300 bp) do not usually enable complete genome assembly. Long-read sequencing greatly assists with resolving complex bacterial genomes, particularly when combined with short-read Illumina data (hybrid assembly). However, it is not clear how different long-read sequencing methods affect hybrid assembly accuracy. Relative automation of the assembly process is also crucial to facilitating high-throughput complete bacterial genome reconstruction, avoiding multiple bespoke filtering and data manipulation steps. In this study, we compared hybrid assemblies for 20 bacterial isolates, including two reference strains, using Illumina sequencing and long reads from either Oxford Nanopore Technologies (ONT) or SMRT Pacific Biosciences (PacBio) sequencing platforms. We chose isolates from the family Enterobacteriaceae, as these frequently have highly plastic, repetitive genetic structures, and complete genome reconstruction for these species is relevant for a precise understanding of the epidemiology of antimicrobial resistance. We de novo assembled genomes using the hybrid assembler Unicycler and compared different read processing strategies, as well as comparing to long-read-only assembly with Flye followed by short-read polishing with Pilon. Hybrid assembly with either PacBio or ONT reads facilitated high-quality genome reconstruction, and was superior to the long-read assembly and polishing approach evaluated with respect to accuracy and completeness. Combining ONT and Illumina reads fully resolved most genomes without additional manual steps, and at a lower consumables cost per isolate in our setting. Automated hybrid assembly is a powerful tool for complete and accurate bacterial genome assembly
SARS-CoV-2 RNA detected in blood products from patients with COVID-19 is not associated with infectious virus
Background: Laboratory diagnosis of SARS-CoV-2 infection (the cause of COVID-19) uses PCR to detect viral RNA (vRNA) in respiratory samples. SARS-CoV-2 RNA has also been detected in other sample types, but there is limited understanding of the clinical or laboratory significance of its detection in blood. Methods: We undertook a systematic literature review to assimilate the evidence for the frequency of vRNA in blood, and to identify associated clinical characteristics. We performed RT-PCR in serum samples from a UK clinical cohort of acute and convalescent COVID-19 cases (n=212), together with convalescent plasma samples collected by NHS Blood and Transplant (NHSBT) (n=462 additional samples). To determine whether PCR-positive blood samples could pose an infection risk, we attempted virus isolation from a subset of RNA-positive samples. Results: We identified 28 relevant studies, reporting SARS-CoV-2 RNA in 0-76% of blood samples; pooled estimate 10% (95%CI 5-18%). Among serum samples from our clinical cohort, 27/212 (12.7%) had SARS-CoV-2 RNA detected by RT-PCR. RNA detection occurred in samples up to day 20 post symptom onset, and was associated with more severe disease (multivariable odds ratio 7.5). Across all samples collected ≥28 days post symptom onset, 0/494 (0%, 95%CI 0-0.7%) had vRNA detected. Among our PCR-positive samples, cycle threshold (ct) values were high (range 33.5-44.8), suggesting low vRNA copy numbers. PCR-positive sera inoculated into cell culture did not produce any cytopathic effect or yield an increase in detectable SARS-CoV-2 RNA. Conclusions: vRNA was detectable at low viral loads in a minority of serum samples collected in acute infection, but was not associated with infectious SARS-CoV-2 (within the limitations of the assays used). This work helps to inform biosafety precautions for handling blood products from patients with current or previous COVID-19
Recommended from our members
Genomic network analysis of environmental and livestock F-type plasmid populations
F-type plasmids are diverse and of great clinical significance, often carrying genes conferring antimicrobial resistance (AMR) such as extended-spectrum β-lactamases, particularly in Enterobacterales. Organising this plasmid diversity is challenging, and current knowledge is largely based on plasmids from clinical settings. Here, we present a network community analysis of a large survey of F-type plasmids from environmental (influent, effluent, and upstream/downstream waterways surrounding wastewater treatment works) and livestock settings. We use a tractable and scalable methodology to examine the relationship between plasmid metadata and network communities. This reveals how niche (sampling compartment and host genera) partition and shape plasmid diversity. We also perform pangenome-style analyses on network communities. We show that such communities define unique combinations of core genes, with limited overlap. Building plasmid phylogenies based on alignments of these core genes, we demonstrate that plasmid accessory function is closely linked to core gene content. Taken together, our results suggest that stable F-type plasmid backbone structures can persist in environmental settings while allowing dramatic variation in accessory gene content that may be linked to niche adaptation. The association of F-type plasmids with AMR likely reflects their suitability for rapid niche adaptation
Recommended from our members
Niche and local geography shape the pangenome of wastewater- and livestock-associated Enterobacteriaceae
Escherichia coli and other Enterobacteriaceae are diverse species with “open” pangenomes, where genes move intra- and interspecies via horizontal gene transfer. However, most analyses focus on clinical isolates. The pangenome dynamics of natural populations remain understudied, despite their suggested role as reservoirs for antimicrobial resistance (AMR) genes. Here, we analyze near-complete genomes for 827 Enterobacteriaceae (553 Escherichia and 274 non-Escherichia spp.) with 2292 circularized plasmids in total, collected from 19 locations (livestock farms and wastewater treatment works in the United Kingdom) within a 30-km radius at three time points over a year. We find different dynamics for chromosomal and plasmid-borne genes. Plasmids have a higher burden of AMR genes and insertion sequences, and AMR-gene-carrying plasmids show evidence of being under stronger selective pressure. Environmental niche and local geography both play a role in shaping plasmid dynamics. Our results highlight the importance of local strategies for controlling the spread of AMR
A haemagglutination test for rapid detection of antibodies to SARS-CoV-2
Serological detection of antibodies to SARS-CoV-2 is essential for establishing rates of seroconversion in populations, and for seeking evidence for a level of antibody that may be protective against COVID-19 disease. Several high-performance commercial tests have been described, but these require centralised laboratory facilities that are comparatively expensive, and therefore not available universally. Red cell agglutination tests do not require special equipment, are read by eye, have short development times, low cost and can be applied at the Point of Care. Here we describe a quantitative Haemagglutination test (HAT) for the detection of antibodies to the receptor binding domain of the SARS-CoV-2 spike protein. The HAT has a sensitivity of 90% and specificity of 99% for detection of antibodies after a PCR diagnosed infection. We will supply aliquots of the test reagent sufficient for ten thousand test wells free of charge to qualified research groups anywhere in the world
Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity.
Global dispersal and increasing frequency of the SARS-CoV-2 spike protein variant D614G are suggestive of a selective advantage but may also be due to a random founder effect. We investigate the hypothesis for positive selection of spike D614G in the United Kingdom using more than 25,000 whole genome SARS-CoV-2 sequences. Despite the availability of a large dataset, well represented by both spike 614 variants, not all approaches showed a conclusive signal of positive selection. Population genetic analysis indicates that 614G increases in frequency relative to 614D in a manner consistent with a selective advantage. We do not find any indication that patients infected with the spike 614G variant have higher COVID-19 mortality or clinical severity, but 614G is associated with higher viral load and younger age of patients. Significant differences in growth and size of 614G phylogenetic clusters indicate a need for continued study of this variant
Community prevalence of SARS-CoV-2 in England from April to November, 2020: results from the ONS Coronavirus Infection Survey
Background: Decisions about the continued need for control measures to contain the spread of severe acute respiratory
syndrome coronavirus 2 (SARS-CoV-2) rely on accurate and up-to-date information about the number of people
testing positive for SARS-CoV-2 and risk factors for testing positive. Existing surveillance systems are generally not
based on population samples and are not longitudinal in design.
Methods: Samples were collected from individuals aged 2 years and older living in private households in England that
were randomly selected from address lists and previous Office for National Statistics surveys in repeated crosssectional household surveys with additional serial sampling and longitudinal follow-up. Participants completed a
questionnaire and did nose and throat self-swabs. The percentage of individuals testing positive for SARS-CoV-2 RNA
was estimated over time by use of dynamic multilevel regression and poststratification, to account for potential
residual non-representativeness. Potential changes in risk factors for testing positive over time were also assessed.
The study is registered with the ISRCTN Registry, ISRCTN21086382.
Findings: Between April 26 and Nov 1, 2020, results were available from 1 191 170 samples from 280327 individuals; 5231
samples were positive overall, from 3923 individuals. The percentage of people testing positive for SARS-CoV-2 changed
substantially over time, with an initial decrease between April 26 and June 28, 2020, from 0·40% (95% credible interval
0·29–0·54) to 0·06% (0·04–0·07), followed by low levels during July and August, 2020, before substantial increases at
the end of August, 2020, with percentages testing positive above 1% from the end of October, 2020. Having a patient facing role and working outside your home were important risk factors for testing positive for SARS-CoV-2 at the end of
the first wave (April 26 to June 28, 2020), but not in the second wave (from the end of August to Nov 1, 2020). Age (young
adults, particularly those aged 17–24 years) was an important initial driver of increased positivity rates in the second
wave. For example, the estimated percentage of individuals testing positive was more than six times higher in those
aged 17–24 years than in those aged 70 years or older at the end of September, 2020. A substantial proportion of
infections were in individuals not reporting symptoms around their positive test (45–68%, dependent on calendar time.
Interpretation: Important risk factors for testing positive for SARS-CoV-2 varied substantially between the part of the
first wave that was captured by the study (April to June, 2020) and the first part of the second wave of increased
positivity rates (end of August to Nov 1, 2020), and a substantial proportion of infections were in individuals not
reporting symptoms, indicating that continued monitoring for SARS-CoV-2 in the community will be important for
managing the COVID-19 pandemic moving forwards
The 2021 WHO catalogue of Mycobacterium tuberculosis complex mutations associated with drug resistance: a genotypic analysis.
Background: Molecular diagnostics are considered the most promising route to achievement of rapid, universal drug susceptibility testing for Mycobacterium tuberculosis complex (MTBC). We aimed to generate a WHO-endorsed catalogue of mutations to serve as a global standard for interpreting molecular information for drug resistance prediction. Methods: In this systematic analysis, we used a candidate gene approach to identify mutations associated with resistance or consistent with susceptibility for 13 WHO-endorsed antituberculosis drugs. We collected existing worldwide MTBC whole-genome sequencing data and phenotypic data from academic groups and consortia, reference laboratories, public health organisations, and published literature. We categorised phenotypes as follows: methods and critical concentrations currently endorsed by WHO (category 1); critical concentrations previously endorsed by WHO for those methods (category 2); methods or critical concentrations not currently endorsed by WHO (category 3). For each mutation, we used a contingency table of binary phenotypes and presence or absence of the mutation to compute positive predictive value, and we used Fisher's exact tests to generate odds ratios and Benjamini-Hochberg corrected p values. Mutations were graded as associated with resistance if present in at least five isolates, if the odds ratio was more than 1 with a statistically significant corrected p value, and if the lower bound of the 95% CI on the positive predictive value for phenotypic resistance was greater than 25%. A series of expert rules were applied for final confidence grading of each mutation. Findings: We analysed 41 137 MTBC isolates with phenotypic and whole-genome sequencing data from 45 countries. 38 215 MTBC isolates passed quality control steps and were included in the final analysis. 15 667 associations were computed for 13 211 unique mutations linked to one or more drugs. 1149 (7·3%) of 15 667 mutations were classified as associated with phenotypic resistance and 107 (0·7%) were deemed consistent with susceptibility. For rifampicin, isoniazid, ethambutol, fluoroquinolones, and streptomycin, the mutations' pooled sensitivity was more than 80%. Specificity was over 95% for all drugs except ethionamide (91·4%), moxifloxacin (91·6%) and ethambutol (93·3%). Only two resistance mutations were identified for bedaquiline, delamanid, clofazimine, and linezolid as prevalence of phenotypic resistance was low for these drugs. Interpretation: We present the first WHO-endorsed catalogue of molecular targets for MTBC drug susceptibility testing, which is intended to provide a global standard for resistance interpretation. The existence of this catalogue should encourage the implementation of molecular diagnostics by national tuberculosis programmes. Funding: Unitaid, Wellcome Trust, UK Medical Research Council, and Bill and Melinda Gates Foundation
Evaluating the Effects of SARS-CoV-2 Spike Mutation D614G on Transmissibility and Pathogenicity
Global dispersal and increasing frequency of the SARS-CoV-2 spike protein variant D614G are suggestive of a selective advantage but may also be due to a random founder effect. We investigate the hypothesis for positive selection of spike D614G in the United Kingdom using more than 25,000 whole genome SARS-CoV-2 sequences. Despite the availability of a large dataset, well represented by both spike 614 variants, not all approaches showed a conclusive signal of positive selection. Population genetic analysis indicates that 614G increases in frequency relative to 614D in a manner consistent with a selective advantage. We do not find any indication that patients infected with the spike 614G variant have higher COVID-19 mortality or clinical severity, but 614G is associated with higher viral load and younger age of patients. Significant differences in growth and size of 614G phylogenetic clusters indicate a need for continued study of this variant