636 research outputs found

    International genomic definition of pneumococcal lineages, to contextualise disease, antibiotic resistance and vaccine impact

    Get PDF
    Background: Pneumococcal conjugate vaccines have reduced the incidence of invasive pneumococcal disease, caused by vaccine serotypes, but non-vaccine-serotypes remain a concern. We used whole genome sequencing to study pneumococcal serotype, antibiotic resistance and invasiveness, in the context of genetic background. / Methods: Our dataset of 13,454 genomes, combined with four published genomic datasets, represented Africa (40%), Asia (25%), Europe (19%), North America (12%), and South America (5%). These 20,027 pneumococcal genomes were clustered into lineages using PopPUNK, and named Global Pneumococcal Sequence Clusters (GPSCs). From our dataset, we additionally derived serotype and sequence type, and predicted antibiotic sensitivity. We then measured invasiveness using odds ratios that relating prevalence in invasive pneumococcal disease to carriage. / Findings: The combined collections (n = 20,027) were clustered into 621 GPSCs. Thirty-five GPSCs observed in our dataset were represented by >100 isolates, and subsequently classed as dominant-GPSCs. In 22/35 (63%) of dominant-GPSCs both non-vaccine serotypes and vaccine serotypes were observed in the years up until, and including, the first year of pneumococcal conjugate vaccine introduction. Penicillin and multidrug resistance were higher (p < .05) in a subset dominant-GPSCs (14/35, 9/35 respectively), and resistance to an increasing number of antibiotic classes was associated with increased recombination (R2 = 0.27 p < .0001). In 28/35 dominant-GPSCs, the country of isolation was a significant predictor (p < .05) of its antibiogram (mean misclassification error 0.28, SD ± 0.13). We detected increased invasiveness of six genetic backgrounds, when compared to other genetic backgrounds expressing the same serotype. Up to 1.6-fold changes in invasiveness odds ratio were observed. / Interpretation: We define GPSCs that can be assigned to any pneumococcal genomic dataset, to aid international comparisons. Existing non-vaccine-serotypes in most GPSCs preclude the removal of these lineages by pneumococcal conjugate vaccines; leaving potential for serotype replacement. A subset of GPSCs have increased resistance, and/or serotype-independent invasiveness

    Designing a physical activity parenting course : parental views on recruitment, content and delivery

    Get PDF
    Background Many children do not engage in sufficient levels of physical activity (PA) and spend too much time screen-viewing (SV). High levels of SV (e.g. watching TV, playing video games and surfing the internet) and low levels of PA have been associated with adverse health outcomes. Parenting courses may hold promise as an intervention medium to change children’s PA and SV. The current study was formative work conducted to design a new parenting programme to increase children’s PA and reduce their SV. Specifically, we focussed on interest in a course, desired content and delivery style, barriers and facilitators to participation and opinions on control group provision. Methods In-depth telephone interviews were conducted with thirty two parents (29 female) of 6–8 year olds. Data were analysed thematically. An anonymous online survey was also completed by 750 parents of 6–8 year old children and descriptive statistics calculated. Results Interview participants were interested in a parenting course because they wanted general parenting advice and ideas to help their children be physically active. Parents indicated that they would benefit from knowing how to quantify their child’s PA and SV levels. Parents wanted practical ideas of alternatives to SV. Most parents would be unable to attend unless childcare was provided. Schools were perceived to be a trusted source of information about parenting courses and the optimal recruitment location. In terms of delivery style, the majority of parents stated they would prefer a group-based approach that provided opportunities for peer learning and support with professional input. Survey participants reported the timing of classes and the provision of childcare were essential factors that would affect participation. In terms of designing an intervention, the most preferred control group option was the opportunity to attend the same course at a later date. Conclusions Parents are interested in PA/SV parenting courses but the provision of child care is essential for attendance. Recruitment is likely to be facilitated via trusted sources. Parents want practical advice on how to overcome barriers and suggest advice is provided in a mutually supportive group experience with expert input

    Putative novel cps loci in a large global collection of pneumococci

    Get PDF
    The pneumococcus produces a polysaccharide capsule, encoded by the cps locus, that provides protection against phagocytosis and determines serotype. Nearly 100 serotypes have been identified with new serotypes still being discovered, especially in previously understudied regions. Here we present an analysis of the cps loci of more than 18  000 genomes from the Global Pneumococcal Sequencing (GPS) project with the aim of identifying novel cps loci with the potential to produce previously unrecognized capsule structures. Serotypes were assigned using whole genome sequence data and 66 of the approximately 100 known serotypes were included in the final dataset. Closer examination of each serotype’s sequences identified nine putative novel cps loci (9X, 11X, 16X, 18X1, 18X2, 18X3, 29X, 33X and 36X) found in ~2.6  % of the genomes. The large number and global distribution of GPS genomes provided an unprecedented opportunity to identify novel cps loci and consider their phylogenetic and geographical distribution. Nine putative novel cps loci were identified and examples of each will undergo subsequent structural and immunological analysis

    Cinteny: flexible analysis and visualization of synteny and genome rearrangements in multiple organisms

    Get PDF
    BACKGROUND: Identifying syntenic regions, i.e., blocks of genes or other markers with evolutionary conserved order, and quantifying evolutionary relatedness between genomes in terms of chromosomal rearrangements is one of the central goals in comparative genomics. However, the analysis of synteny and the resulting assessment of genome rearrangements are sensitive to the choice of a number of arbitrary parameters that affect the detection of synteny blocks. In particular, the choice of a set of markers and the effect of different aggregation strategies, which enable coarse graining of synteny blocks and exclusion of micro-rearrangements, need to be assessed. Therefore, existing tools and resources that facilitate identification, visualization and analysis of synteny need to be further improved to provide a flexible platform for such analysis, especially in the context of multiple genomes. RESULTS: We present a new tool, Cinteny, for fast identification and analysis of synteny with different sets of markers and various levels of coarse graining of syntenic blocks. Using Hannenhalli-Pevzner approach and its extensions, Cinteny also enables interactive determination of evolutionary relationships between genomes in terms of the number of rearrangements (the reversal distance). In particular, Cinteny provides: i) integration of synteny browsing with assessment of evolutionary distances for multiple genomes; ii) flexibility to adjust the parameters and re-compute the results on-the-fly; iii) ability to work with user provided data, such as orthologous genes, sequence tags or other conserved markers. In addition, Cinteny provides many annotated mammalian, invertebrate and fungal genomes that are pre-loaded and available for analysis at . CONCLUSION: Cinteny allows one to automatically compare multiple genomes and perform sensitivity analysis for synteny block detection and for the subsequent computation of reversal distances. Cinteny can also be used to interactively browse syntenic blocks conserved in multiple genomes, to facilitate genome annotation and validation of assemblies for newly sequenced genomes, and to construct and assess phylogenomic trees

    Mammalian cell entry genes in Streptomyces may provide clues to the evolution of bacterial virulence

    Get PDF
    Understanding the evolution of virulence is key to appreciating the role specific loci play in pathogenicity. Streptomyces species are generally non-pathogenic soil saprophytes, yet within their genome we can find homologues of virulence loci. One example of this is the mammalian cell entry (mce) locus, which has been characterised in Mycobacterium tuberculosis. To investigate the role in Streptomyces we deleted the mce locus and studied its impact on cell survival, morphology and interaction with other soil organisms. Disruption of the mce cluster resulted in virulence towards amoebae (Acanthamoeba polyphaga) and reduced colonization of plant (Arabidopsis) models, indicating these genes may play an important role in Streptomyces survival in the environment. Our data suggest that loss of mce in Streptomyces spp. may have profound effects on survival in a competitive soil environment, and provides insight in to the evolution and selection of these genes as virulence factors in related pathogenic organisms

    Reconstructing cancer genomes from paired-end sequencing data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>A cancer genome is derived from the germline genome through a series of somatic mutations. Somatic structural variants - including duplications, deletions, inversions, translocations, and other rearrangements - result in a cancer genome that is a scrambling of intervals, or "blocks" of the germline genome sequence. We present an efficient algorithm for reconstructing the block organization of a cancer genome from paired-end DNA sequencing data.</p> <p>Results</p> <p>By aligning paired reads from a cancer genome - and a matched germline genome, if available - to the human reference genome, we derive: (i) a partition of the reference genome into intervals; (ii) adjacencies between these intervals in the cancer genome; (iii) an estimated copy number for each interval. We formulate the Copy Number and Adjacency Genome Reconstruction Problem of determining the cancer genome as a sequence of the derived intervals that is consistent with the measured adjacencies and copy numbers. We design an efficient algorithm, called Paired-end Reconstruction of Genome Organization (PREGO), to solve this problem by reducing it to an optimization problem on an interval-adjacency graph constructed from the data. The solution to the optimization problem results in an Eulerian graph, containing an alternating Eulerian tour that corresponds to a cancer genome that is consistent with the sequencing data. We apply our algorithm to five ovarian cancer genomes that were sequenced as part of The Cancer Genome Atlas. We identify numerous rearrangements, or structural variants, in these genomes, analyze reciprocal vs. non-reciprocal rearrangements, and identify rearrangements consistent with known mechanisms of duplication such as tandem duplications and breakage/fusion/bridge (B/F/B) cycles.</p> <p>Conclusions</p> <p>We demonstrate that PREGO efficiently identifies complex and biologically relevant rearrangements in cancer genome sequencing data. An implementation of the PREGO algorithm is available at <url>http://compbio.cs.brown.edu/software/</url>.</p

    Visualizing variation within Global Pneumococcal Sequence Clusters (GPSCs) and country population snapshots to contextualize pneumococcal isolates

    Get PDF
    Knowledge of pneumococcal lineages, their geographic distribution and antibiotic resistance patterns, can give insights into global pneumococcal disease. We provide interactive bioinformatic outputs to explore such topics, aiming to increase dissemination of genomic insights to the wider community, without the need for specialist training. We prepared 12 country-specific phylogenetic snapshots, and international phylogenetic snapshots of 73 common Global Pneumococcal Sequence Clusters (GPSCs) previously defined using PopPUNK, and present them in Microreact. Gene presence and absence defined using Roary, and recombination profiles derived from Gubbins are presented in Phandango for each GPSC. Temporal phylogenetic signal was assessed for each GPSC using BactDating. We provide examples of how such resources can be used. In our example use of a country-specific phylogenetic snapshot we determined that serotype 14 was observed in nine unrelated genetic backgrounds in South Africa. The international phylogenetic snapshot of GPSC9, in which most serotype 14 isolates from South Africa were observed, highlights that there were three independent sub-clusters represented by South African serotype 14 isolates. We estimated from the GPSC9-dated tree that the sub-clusters were each established in South Africa during the 1980s. We show how recombination plots allowed the identification of a 20kb recombination spanning the capsular polysaccharide locus within GPSC97. This was consistent with a switch from serotype 6A to 19A estimated to have occured in the 1990s from the GPSC97-dated tree. Plots of gene presence/absence of resistance genes (tet, erm, cat) across the GPSC23 phylogeny were consistent with acquisition of a composite transposon. We estimated from the GPSC23-dated tree that the acquisition occurred between 1953 and 1975. Finally, we demonstrate the assignment of GPSC31 to 17 externally generated pneumococcal serotype 1 assemblies from Utah via Pathogenwatch. Most of the Utah isolates clustered within GPSC31 in a USA-specific clade with the most recent common ancestor estimated between 1958 and 1981. The resources we have provided can be used to explore to data, test hypothesis and generate new hypotheses. The accessible assignment of GPSCs allows others to contextualize their own collections beyond the data presented here

    Meraculous: De Novo Genome Assembly with Short Paired-End Reads

    Get PDF
    We describe a new algorithm, meraculous, for whole genome assembly of deep paired-end short reads, and apply it to the assembly of a dataset of paired 75-bp Illumina reads derived from the 15.4 megabase genome of the haploid yeast Pichia stipitis. More than 95% of the genome is recovered, with no errors; half the assembled sequence is in contigs longer than 101 kilobases and in scaffolds longer than 269 kilobases. Incorporating fosmid ends recovers entire chromosomes. Meraculous relies on an efficient and conservative traversal of the subgraph of the k-mer (deBruijn) graph of oligonucleotides with unique high quality extensions in the dataset, avoiding an explicit error correction step as used in other short-read assemblers. A novel memory-efficient hashing scheme is introduced. The resulting contigs are ordered and oriented using paired reads separated by ∼280 bp or ∼3.2 kbp, and many gaps between contigs can be closed using paired-end placements. Practical issues with the dataset are described, and prospects for assembling larger genomes are discussed

    A mosaic tetracycline resistance gene tet(S/M) detected in an MDR pneumococcal CC230 lineage that underwent capsular switching in South Africa

    Get PDF
    Objectives: We reported tet(S/M) in Streptococcus pneumoniae and investigated its temporal spread in relation to nationwide clinical interventions. Methods: We whole-genome sequenced 12 254 pneumococcal isolates from 29 countries on an Illumina HiSeq sequencer. Serotype, multilocus ST and antibiotic resistance were inferred from genomes. An SNP tree was built using Gubbins. Temporal spread was reconstructed using a birth–death model. Results: We identified tet(S/M) in 131 pneumococcal isolates and none carried other known tet genes. Tetracycline susceptibility testing results were available for 121 tet(S/M)-positive isolates and all were resistant. A majority (74%) of tet(S/M)-positive isolates were from South Africa and caused invasive diseases among young children (59% HIV positive, where HIV status was available). All but two tet(S/M)-positive isolates belonged to clonal complex (CC) 230. A global phylogeny of CC230 (n=389) revealed that tet(S/M)-positive isolates formed a sublineage predicted to exhibit resistance to penicillin, co-trimoxazole, erythromycin and tetracycline. The birth–death model detected an unrecognized outbreak of this sublineage in South Africa between 2000 and 2004 with expected secondary infections (effective reproductive number, R) of ∼2.5. R declined to ∼1.0 in 2005 and <1.0 in 2012. The declining epidemic could be related to improved access to ART in 2004 and introduction of pneumococcal conjugate vaccine (PCV) in 2009. Capsular switching from vaccine serotype 14 to non-vaccine serotype 23A was observed within the sublineage. Conclusions: The prevalence of tet(S/M) in pneumococci was low and its dissemination was due to an unrecognized outbreak of CC230 in South Africa. Capsular switching in this MDR sublineage highlighted its potential to continue to cause disease in the post-PCV13 era
    • …
    corecore