15 research outputs found

    Putative novel cps loci in a large global collection of pneumococci

    Get PDF
    The pneumococcus produces a polysaccharide capsule, encoded by the cps locus, that provides protection against phagocytosis and determines serotype. Nearly 100 serotypes have been identified with new serotypes still being discovered, especially in previously understudied regions. Here we present an analysis of the cps loci of more than 18  000 genomes from the Global Pneumococcal Sequencing (GPS) project with the aim of identifying novel cps loci with the potential to produce previously unrecognized capsule structures. Serotypes were assigned using whole genome sequence data and 66 of the approximately 100 known serotypes were included in the final dataset. Closer examination of each serotype’s sequences identified nine putative novel cps loci (9X, 11X, 16X, 18X1, 18X2, 18X3, 29X, 33X and 36X) found in ~2.6  % of the genomes. The large number and global distribution of GPS genomes provided an unprecedented opportunity to identify novel cps loci and consider their phylogenetic and geographical distribution. Nine putative novel cps loci were identified and examples of each will undergo subsequent structural and immunological analysis

    Visualizing variation within Global Pneumococcal Sequence Clusters (GPSCs) and country population snapshots to contextualize pneumococcal isolates.

    Get PDF
    Knowledge of pneumococcal lineages, their geographic distribution and antibiotic resistance patterns, can give insights into global pneumococcal disease. We provide interactive bioinformatic outputs to explore such topics, aiming to increase dissemination of genomic insights to the wider community, without the need for specialist training. We prepared 12 country-specific phylogenetic snapshots, and international phylogenetic snapshots of 73 common Global Pneumococcal Sequence Clusters (GPSCs) previously defined using PopPUNK, and present them in Microreact. Gene presence and absence defined using Roary, and recombination profiles derived from Gubbins are presented in Phandango for each GPSC. Temporal phylogenetic signal was assessed for each GPSC using BactDating. We provide examples of how such resources can be used. In our example use of a country-specific phylogenetic snapshot we determined that serotype 14 was observed in nine unrelated genetic backgrounds in South Africa. The international phylogenetic snapshot of GPSC9, in which most serotype 14 isolates from South Africa were observed, highlights that there were three independent sub-clusters represented by South African serotype 14 isolates. We estimated from the GPSC9-dated tree that the sub-clusters were each established in South Africa during the 1980s. We show how recombination plots allowed the identification of a 20 kb recombination spanning the capsular polysaccharide locus within GPSC97. This was consistent with a switch from serotype 6A to 19A estimated to have occured in the 1990s from the GPSC97-dated tree. Plots of gene presence/absence of resistance genes (tet, erm, cat) across the GPSC23 phylogeny were consistent with acquisition of a composite transposon. We estimated from the GPSC23-dated tree that the acquisition occurred between 1953 and 1975. Finally, we demonstrate the assignment of GPSC31 to 17 externally generated pneumococcal serotype 1 assemblies from Utah via Pathogenwatch. Most of the Utah isolates clustered within GPSC31 in a USA-specific clade with the most recent common ancestor estimated between 1958 and 1981. The resources we have provided can be used to explore to data, test hypothesis and generate new hypotheses. The accessible assignment of GPSCs allows others to contextualize their own collections beyond the data presented here

    Pneumococcal lineages associated with serotype replacement and antibiotic resistance in childhood invasive pneumococcal disease in the post-PCV13 era: an international whole-genome sequencing study.

    Get PDF
    BACKGROUND: Invasive pneumococcal disease remains an important health priority owing to increasing disease incidence caused by pneumococci expressing non-vaccine serotypes. We previously defined 621 Global Pneumococcal Sequence Clusters (GPSCs) by analysing 20 027 pneumococcal isolates collected worldwide and from previously published genomic data. In this study, we aimed to investigate the pneumococcal lineages behind the predominant serotypes, the mechanism of serotype replacement in disease, as well as the major pneumococcal lineages contributing to invasive pneumococcal disease in the post-vaccine era and their antibiotic resistant traits. METHODS: We whole-genome sequenced 3233 invasive pneumococcal disease isolates from laboratory-based surveillance programmes in Hong Kong (n=78), Israel (n=701), Malawi (n=226), South Africa (n=1351), The Gambia (n=203), and the USA (n=674). The genomes represented pneumococci from before and after pneumococcal conjugate vaccine (PCV) introductions and were from children younger than 3 years. We identified predominant serotypes by prevalence and their major contributing lineages in each country, and assessed any serotype replacement by comparing the incidence rate between the pre-PCV and PCV periods for Israel, South Africa, and the USA. We defined the status of a lineage as vaccine-type GPSC (≥50% 13-valent PCV [PCV13] serotypes) or non-vaccine-type GPSC (>50% non-PCV13 serotypes) on the basis of its initial serotype composition detected in the earliest vaccine period to measure their individual contribution toward serotype replacement in each country. Major pneumococcal lineages in the PCV period were identified by pooled incidence rate using a random effects model. FINDINGS: The five most prevalent serotypes in the PCV13 period varied between countries, with only serotypes 5, 12F, 15B/C, 19A, 33F, and 35B/D common to two or more countries. The five most prevalent serotypes in the PCV13 period varied between countries, with only serotypes 5, 12F, 15B/C, 19A, 33F, and 35B/D common to two or more countries. These serotypes were associated with more than one lineage, except for serotype 5 (GPSC8). Serotype replacement was mainly mediated by expansion of non-vaccine serotypes within vaccine-type GPSCs and, to a lesser extent, by increases in non-vaccine-type GPSCs. A globally spreading lineage, GPSC3, expressing invasive serotypes 8 in South Africa and 33F in the USA and Israel, was the most common lineage causing non-vaccine serotype invasive pneumococcal disease in the PCV13 period. We observed that same prevalent non-vaccine serotypes could be associated with distinctive lineages in different countries, which exhibited dissimilar antibiotic resistance profiles. In non-vaccine serotype isolates, we detected significant increases in the prevalence of resistance to penicillin (52 [21%] of 249 vs 169 [29%] of 575, p=0·0016) and erythromycin (three [1%] of 249 vs 65 [11%] of 575, p=0·0031) in the PCV13 period compared with the pre-PCV period. INTERPRETATION: Globally spreading lineages expressing invasive serotypes have an important role in serotype replacement, and emerging non-vaccine serotypes associated with different pneumococcal lineages in different countries might be explained by local antibiotic-selective pressures. Continued genomic surveillance of the dynamics of the pneumococcal population with increased geographical representation in the post-vaccine period will generate further knowledge for optimising future vaccine design. FUNDING: Bill & Melinda Gates Foundation, Wellcome Sanger Institute, and the US Centers for Disease Control

    Emergence of a multidrug-resistant and virulent Streptococcus pneumoniae lineage mediates serotype replacement after PCV13: an international whole-genome sequencing study.

    Get PDF
    BACKGROUND Serotype 24F is one of the emerging pneumococcal serotypes after the introduction of pneumococcal conjugate vaccine (PCV). We aimed to identify lineages driving the increase of serotype 24F in France and place these findings into a global context. METHODS Whole-genome sequencing was performed on a collection of serotype 24F pneumococci from asymptomatic colonisation (n=229) and invasive disease (n=190) isolates among individuals younger than 18 years in France, from 2003 to 2018. To provide a global context, we included an additional collection of 24F isolates in the Global Pneumococcal Sequencing (GPS) project database for analysis. A Global Pneumococcal Sequence Cluster (GPSC) and a clonal complex (CC) were assigned to each genome. Phylogenetic, evolutionary, and spatiotemporal analysis were conducted using the same 24F collection and supplemented with a global collection of genomes belonging to the lineage of interest from the GPS project database (n=25 590). FINDINGS Serotype 24F was identified in numerous countries mainly due to the clonal spread of three lineages: GPSC10 (CC230), GPSC16 (CC156), and GPSC206 (CC7701). GPSC10 was the only multidrug-resistant lineage. GPSC10 drove the increase in 24F in France and had high invasive disease potential. The international dataset of GPSC10 (n=888) revealed that this lineage expressed 16 other serotypes, with only six included in 13-valent PCV (PCV13). All serotype 24F isolates were clustered in a single clade within the GPSC10 phylogeny and long-range transmissions were detected from Europe to other continents. Spatiotemporal analysis showed GPSC10-24F took 3-5 years to spread across France and a rapid change of serotype composition from PCV13 serotype 19A to 24F during the introduction of PCV13 was observed in neighbouring country Spain. INTERPRETATION Our work reveals that GPSC10 alone is a challenge for serotype-based vaccine strategy. More systematic investigation to identify lineages like GPSC10 will better inform and improve next-generation preventive strategies against pneumococcal diseases

    A Streptococcus pneumoniae lineage usually associated with pneumococcal conjugate vaccine (PCV) serotypes is the most common cause of serotype 35B invasive disease in South Africa, following routine use of PCV.

    Get PDF
    Pneumococcal serotype 35B is an important non-conjugate vaccine (non-PCV) serotype. Its continued emergence, post-PCV7 in the USA, was associated with expansion of a pre-existing 35B clone (clonal complex [CC] 558) along with post-PCV13 emergence of a non-35B clone previously associated with PCV serotypes (CC156). This study describes lineages circulating among 35B isolates in South Africa before and after PCV introduction. We also compared 35B isolates belonging to a predominant 35B lineage in South Africa (GPSC5), with isolates belonging to the same lineage in other parts of the world. Serotype 35B isolates that caused invasive pneumococcal disease in South Africa in 2005-2014 were characterized by whole-genome sequencing (WGS). Multi-locus sequence types and global pneumococcal sequence clusters (GPSCs) were derived from WGS data of 63 35B isolates obtained in 2005-2014. A total of 262 isolates that belong to GPSC5 (115 isolates from South Africa and 147 from other countries) that were sequenced as part of the global pneumococcal sequencing (GPS) project were included for comparison. Serotype 35B isolates from South Africa were differentiated into seven GPSCs and GPSC5 was most common (49 %, 31/63). While 35B was the most common serotype among GPSC5/CC172 isolates in South Africa during the PCV13 period (66 %, 29/44), 23F was the most common serotype during both the pre-PCV (80 %, 37/46) and PCV7 period (32 %, 8/25). Serotype 35B represented 15 % (40/262) of GPSC5 isolates within the global GPS database and 75 % (31/40) were from South Africa. The predominance of the GPSC5 lineage within non-vaccine serotype 35B, is possibly unique to South Africa and warrants further molecular surveillance of pneumococci

    International links between Streptococcus pneumoniae vaccine serotype 4 sequence type (ST) 801 in Northern European shipyard outbreaks of invasive pneumococcal disease

    No full text
    Copyright © 2021 The Author(s). Published by Elsevier Ltd.. All rights reserved.Background: Pneumococcal disease outbreaks of vaccine preventable serotype 4 sequence type (ST)801 in shipyards have been reported in several countries. We aimed to use genomics to establish any international links between them. Methods: Sequence data from ST801-related outbreak isolates from Norway (n = 17), Finland (n = 11) and Northern Ireland (n = 2) were combined with invasive pneumococcal disease surveillance from the respective countries, and ST801-related genomes from an international collection (n = 41 of > 40,000), totalling 106 genomes. Raw data were mapped and recombination excluded before phylogenetic dating. Results: Outbreak isolates were relatively diverse, with up to 100 SNPs (single nucleotide polymorphisms) and a common ancestor estimated around the year 2000. However, 19 Norwegian and Finnish isolates were nearly indistinguishable (0–2 SNPs) with the common ancestor dated around 2017. Conclusion: The total diversity of ST801 within the outbreaks could not be explained by recent transmission alone, suggesting that harsh environmental and associated living conditions reported in the shipyards may facilitate invasion of colonising pneumococci. However, near identical strains in the Norwegian and Finnish outbreaks does suggest that transmission between international shipyards also contributed to those outbreaks. This indicates the need for improved preventative measures in this working population including pneumococcal vaccination.Peer reviewe

    A mosaic tetracycline resistance gene tet(S/M) detected in an MDR pneumococcal CC230 lineage that underwent capsular switching in South Africa

    No full text
    Objectives: we reported tet(S/M) in Streptococcus pneumoniae and investigated its temporal spread in relation to nationwide clinical interventions. Methods: we whole-genome sequenced 12 254 pneumococcal isolates from 29 countries on an Illumina HiSeq sequencer. Serotype, multilocus ST and antibiotic resistance were inferred from genomes. An SNP tree was built using Gubbins. Temporal spread was reconstructed using a birth–death model. Results: we identified tet(S/M) in 131 pneumococcal isolates and none carried other known tet genes. Tetracycline susceptibility testing results were available for 121 tet(S/M)-positive isolates and all were resistant. A majority (74%) of tet(S/M)-positive isolates were from South Africa and caused invasive diseases among young children (59% HIV positive, where HIV status was available). All but two tet(S/M)-positive isolates belonged to clonal complex (CC) 230. A global phylogeny of CC230 (n=389) revealed that tet(S/M)-positive isolates formed a sublineage predicted to exhibit resistance to penicillin, co-trimoxazole, erythromycin and tetracycline. The birth–death model detected an unrecognized outbreak of this sublineage in South Africa between 2000 and 2004 with expected secondary infections (effective reproductive number, R) of ∼2.5. R declined to ∼1.0 in 2005 and <1.0 in 2012. The declining epidemic could be related to improved access to ART in 2004 and introduction of pneumococcal conjugate vaccine (PCV) in 2009. Capsular switching from vaccine serotype 14 to non-vaccine serotype 23A was observed within the sublineage. Conclusions: the prevalence of tet(S/M) in pneumococci was low and its dissemination was due to an unrecognized outbreak of CC230 in South Africa. Capsular switching in this MDR sublineage highlighted its potential to continue to cause disease in the post-PCV13 era

    SeroBA: rapid high-throughput serotyping of Streptococcus pneumoniae from whole genome sequence data

    No full text
    Streptococcus pneumoniae is responsible for 240 000–460 000 deaths in children under 5 years of age each year. Accurate identification of pneumococcal serotypes is important for tracking the distribution and evolution of serotypes following the introduction of effective vaccines. Recent efforts have been made to infer serotypes directly from genomic data but current software approaches are limited and do not scale well. Here, we introduce a novel method, SeroBA, which uses a k-mer approach. We compare SeroBA against real and simulated data and present results on the concordance and computational performance against a validation dataset, the robustness and scalability when analysing a large dataset, and the impact of varying the depth of coverage on sequence-based serotyping. SeroBA can predict serotypes, by identifying the cps locus, directly from raw whole genome sequencing read data with 98 % concordance using a k-mer-based method, can process 10 000 samples in just over 1 day using a standard server and can call serotypes at a coverage as low as 15–21×. SeroBA is implemented in Python3 and is freely available under an open source GPLv3 licence from: https://github.com/sanger-pathogens/serob

    SeroBA: rapid high-throughput serotyping of <i>Streptococcus pneumoniae</i> from whole genome sequence data

    Get PDF
    Streptococcus pneumoniae is responsible for 240 000–460 000 deaths in children under 5 years of age each year. Accurate identification of pneumococcal serotypes is important for tracking the distribution and evolution of serotypes following the introduction of effective vaccines. Recent efforts have been made to infer serotypes directly from genomic data but current software approaches are limited and do not scale well. Here, we introduce a novel method, SeroBA, which uses a k-mer approach. We compare SeroBA against real and simulated data and present results on the concordance and computational performance against a validation dataset, the robustness and scalability when analysing a large dataset, and the impact of varying the depth of coverage on sequence-based serotyping. SeroBA can predict serotypes, by identifying the cps locus, directly from raw whole genome sequencing read data with 98 % concordance using a k-mer-based method, can process 10 000 samples in just over 1 day using a standard server and can call serotypes at a coverage as low as 15–21×. SeroBA is implemented in Python3 and is freely available under an open source GPLv3 licence from: https://github.com/sanger-pathogens/serob

    Pneumococcal lineages associated with serotype replacement and antibiotic resistance in childhood invasive pneumococcal disease in the post-PCV13 era: An international whole-genome sequencing study

    No full text
    Background: Invasive pneumococcal disease remains an important health priority owing to increasing disease incidence caused by pneumococci expressing non-vaccine serotypes. We previously defined 621 Global Pneumococcal Sequence Clusters (GPSCs) by analysing 20 027 pneumococcal isolates collected worldwide and from previously published genomic data. In this study, we aimed to investigate the pneumococcal lineages behind the predominant serotypes, the mechanism of serotype replacement in disease, as well as the major pneumococcal lineages contributing to invasive pneumococcal disease in the post-vaccine era and their antibiotic resistant traits.Methods: We whole-genome sequenced 3233 invasive pneumococcal disease isolates from laboratory-based surveillance programmes in Hong Kong (n=78), Israel (n=701), Malawi (n=226), South Africa (n=1351), The Gambia (n=203), and the USA (n=674). The genomes represented pneumococci from before and after pneumococcal conjugate vaccine (PCV) introductions and were from children younger than 3 years. We identified predominant serotypes by prevalence and their major contributing lineages in each country, and assessed any serotype replacement by comparing the incidence rate between the pre-PCV and PCV periods for Israel, South Africa, and the USA. We defined the status of a lineage as vaccine-type GPSC (≥50% 13-valent PCV [PCV13] serotypes) or non-vaccine-type GPSC (\u3e50% non-PCV13 serotypes) on the basis of its initial serotype composition detected in the earliest vaccine period to measure their individual contribution toward serotype replacement in each country. Major pneumococcal lineages in the PCV period were identified by pooled incidence rate using a random effects model.Findings: The five most prevalent serotypes in the PCV13 period varied between countries, with only serotypes 5, 12F, 15B/C, 19A, 33F, and 35B/D common to two or more countries. The five most prevalent serotypes in the PCV13 period varied between countries, with only serotypes 5, 12F, 15B/C, 19A, 33F, and 35B/D common to two or more countries. These serotypes were associated with more than one lineage, except for serotype 5 (GPSC8). Serotype replacement was mainly mediated by expansion of non-vaccine serotypes within vaccine-type GPSCs and, to a lesser extent, by increases in non-vaccine-type GPSCs. A globally spreading lineage, GPSC3, expressing invasive serotypes 8 in South Africa and 33F in the USA and Israel, was the most common lineage causing non-vaccine serotype invasive pneumococcal disease in the PCV13 period. We observed that same prevalent non-vaccine serotypes could be associated with distinctive lineages in different countries, which exhibited dissimilar antibiotic resistance profiles. In non-vaccine serotype isolates, we detected significant increases in the prevalence of resistance to penicillin (52 [21%] of 249 vs 169 [29%] of 575, p=0·0016) and erythromycin (three [1%] of 249 vs 65 [11%] of 575, p=0·0031) in the PCV13 period compared with the pre-PCV period.Interpretation: Globally spreading lineages expressing invasive serotypes have an important role in serotype replacement, and emerging non-vaccine serotypes associated with different pneumococcal lineages in different countries might be explained by local antibiotic-selective pressures. Continued genomic surveillance of the dynamics of the pneumococcal population with increased geographical representation in the post-vaccine period will generate further knowledge for optimising future vaccine design.Funding: Bill & Melinda Gates Foundation, Wellcome Sanger Institute, and the US Centers for Disease Control
    corecore