267 research outputs found

    Genome-wide association, prediction and heritability in bacteria with application to Streptococcus pneumoniae

    Get PDF
    Whole-genome sequencing has facilitated genome-wide analyses of association, prediction and heritability in many organisms. However, such analyses in bacteria are still in their infancy, being limited by difficulties including genome plasticity and strong population structure. Here we propose a suite of methods including linear mixed models, elastic net and LD-score regression, adapted to bacterial traits using innovations such as frequency-based allele coding, both insertion/deletion and nucleotide testing and heritability partitioning. We compare and validate our methods against the current state-of-art using simulations, and analyse three phenotypes of the major human pathogen Streptococcus pneumoniae, including the first analyses of minimum inhibitory concentrations (MIC) for penicillin and ceftriaxone. We show that the MIC traits are highly heritable with high prediction accuracy, explained by many genetic associations under good population structure control. In ceftriaxone MIC, this is surprising because none of the isolates are resistant as per the inhibition zone criteria. We estimate that half of the heritability of penicillin MIC is explained by a known drug-resistance region, which also contributes a quarter of the ceftriaxone MIC heritability. For the within-host carriage duration phenotype, no associations were observed, but the moderate heritability and prediction accuracy indicate a moderately polygenic trait.Peer reviewe

    PANINI : Pangenome Neighbour Identification for Bacterial Populations

    Get PDF
    The standard workhorse for genomic analysis of the evolution of bacterial populations is phylogenetic modelling of mutations in the core genome. However, a notable amount of information about evolutionary and transmission processes in diverse populations can be lost unless the accessory genome is also taken into consideration. Here, we introduce PANINI (Pangenome Neighbour Identification for Bacterial Populations), a computationally scalable method for identifying the neighbours for each isolate in a data set using unsupervised machine learning with stochastic neighbour embedding based on the t-SNE (t-distributed stochastic neighbour embedding) algorithm. PANINI is browser-based and integrates with the Microreact platform for rapid online visualization and exploration of both core and accessory genome evolutionary signals, together with relevant epidemiological, geographical, temporal and other metadata. Several case studies with single- and multi-clone pneumococcal populations are presented to demonstrate the ability to identify biologically important signals from gene content data. PANINI is available at http://panini.pathogen.watch and code at http://gitlab.com/cgps/panini.Peer reviewe

    Genome-wide identification of lineage and locus specific variation associated with pneumococcal carriage duration.

    Get PDF
    Streptococcus pneumoniae is a leading cause of invasive disease in infants, especially in low-income settings. Asymptomatic carriage in the nasopharynx is a prerequisite for disease, but variability in its duration is currently only understood at the serotype level. Here we developed a model to calculate the duration of carriage episodes from longitudinal swab data, and combined these results with whole genome sequence data. We estimated that pneumococcal genomic variation accounted for 63% of the phenotype variation, whereas the host traits considered here (age and previous carriage) accounted for less than 5%. We further partitioned this heritability into both lineage and locus effects, and quantified the amount attributable to the largest sources of variation in carriage duration: serotype (17%), drug-resistance (9%) and other significant locus effects (7%). A pan-genome-wide association study identified prophage sequences as being associated with decreased carriage duration independent of serotype, potentially by disruption of the competence mechanism. These findings support theoretical models of pneumococcal competition and antibiotic resistance

    Time contracts and temporal precision declines when the mind wanders

    Get PDF
    Our perception of time varies considerably from moment to moment but how this variability relates to endogenous fluctuations in attentional states remains poorly understood. Here we tested the hypothesis that perceptual decoupling during mind wandering would distort interval timing. In two studies with different visual interval timing paradigms, we found that mind wandering states were characterized by underestimation of intervals and a decline in temporal discrimination. Further analyses suggested that temporal contraction during mind wandering, but not a decline in temporal discrimination, could be attributed in part to attentional lapses. These results highlight the role of transient fluctuations in attentional states in intra-individual variability in time perception and have implications for the behavioral markers, and costs and benefits, of mind wandering

    Diversification of Colonization Factors in a Multidrug-Resistant Escherichia coli Lineage Evolving under Negative Frequency-Dependent Selection

    Get PDF
    Escherichia coli is a major cause of bloodstream and urinary tract infections globally. The wide dissemination of multidrug-resistant (MDR) strains of extraintestinal pathogenic E. coli (ExPEC) poses a rapidly increasing public health burden due to narrowed treatment options and increased risk of failure to clear an infection. Here, we present a detailed population genomic analysis of the ExPEC ST131 clone, in which we seek explanations for its success as an emerging pathogenic strain beyond the acquisition of antimicrobial resistance (AMR) genes. We show evidence for evolution toward separate ecological niches for the main clades of ST131 and differential evolution of anaerobic metabolism, key colonization, and virulence factors. We further demonstrate that negative frequency-dependent selection acting across accessory loci is a major mechanism that has shaped the population evolution of this pathogen.IMPORTANCE Infections with multidrug-resistant (MDR) strains of Escherichia coli are a significant global public health concern. To combat these pathogens, we need a deeper understanding of how they evolved from their background populations. By understanding the processes that underpin their emergence, we can design new strategies to limit evolution of new clones and combat existing clones. By combining population genomics with modelling approaches, we show that dominant MDR clones of E. coli are under the influence of negative frequency-dependent selection, preventing them from rising to fixation in a population. Furthermore, we show that this selection acts on genes involved in anaerobic metabolism, suggesting that this key trait, and the ability to colonize human intestinal tracts, is a key step in the evolution of MDR clones of E. coli.Peer reviewe

    Intravital FRAP imaging using an E-cadherin-GFP mouse reveals disease- and drug-dependent dynamic regulation of cell-cell junctions in live tissue

    Get PDF
    E-cadherin-mediated cell-cell junctions play a prominent role in maintaining the epithelial architecture. The disruption or deregulation of these adhesions in cancer can lead to the collapse of tumor epithelia that precedes invasion and subsequent metastasis. Here we generated an E-cadherin-GFP mouse that enables intravital photobleaching and quantification of E-cadherin mobility in live tissue without affecting normal biology. We demonstrate the broad applications of this mouse by examining E-cadherin regulation in multiple tissues, including mammary, brain, liver, and kidney tissue, while specifically monitoring E-cadherin mobility during disease progression in the pancreas. We assess E-cadherin stability in native pancreatic tissue upon genetic manipulation involving Kras and p53 or in response to anti-invasive drug treatment and gain insights into the dynamic remodeling of E-cadherin during in situ cancer progression. FRAP in the E-cadherin-GFP mouse, therefore, promises to be a valuable tool to fundamentally expand our understanding of E-cadherin-mediated events in native microenvironments

    Genomic and panproteomic analysis of the development of infant immune responses to antigenically-diverse pneumococci

    Get PDF
    Streptococcus pneumoniae (pneumococcus) is a nasopharyngeal commensal and respiratory pathogen. This study characterises the immunoglobulin G (IgG) repertoire recognising pneumococci from birth to 24 months old (mo) in a prospectively-sampled cohort of 63 children using a panproteome array. IgG levels are highest at birth, due to transplacental transmission of maternal antibodies. The subsequent emergence of responses to individual antigens exhibit distinct kinetics across the cohort. Stable differences in the strength of individuals’ responses, correlating with maternal IgG concentrations, are established by 6 mo. By 12 mo, children develop unique antibody profiles that are boosted by re-exposure. However, some proteins only stimulate substantial responses in adults. Integrating genomic data on nasopharyngeal colonisation demonstrates rare pneumococcal antigens can elicit strong IgG levels post-exposure. Quantifying such responses to the diverse core loci (DCL) proteins is complicated by cross-immunity between variants. In particular, the conserved N terminus of DCL protein zinc metalloprotease B provokes the strongest early IgG responses. DCL proteins’ ability to inhibit mucosal immunity likely explains continued pneumococcal carriage despite hosts’ polyvalent antibody repertoire. Yet higher IgG levels are associated with reduced incidence, and severity, of pneumonia, demonstrating the importance of the heterogeneity in response strength and kinetics across antigens and individuals

    Detecting co-selection through excess linkage disequilibrium in bacterial genomes

    Get PDF
    Population genomics has revolutionized our ability to study bacterial evolution by enabling data-driven discovery of the genetic architecture of trait variation. Genome-wide association studies (GWAS) have more recently become accompanied by genome-wide epistasis and co-selection (GWES) analysis, which offers a phenotype-free approach to generating hypotheses about selective processes that simultaneously impact multiple loci across the genome. However, existing GWES methods only consider associations between distant pairs of loci within the genome due to the strong impact of linkage-disequilibrium (LD) over short distances. Based on the general functional organisation of genomes it is nevertheless expected that majority of co-selection and epistasis will act within relatively short genomic proximity, on co-variation occurring within genes and their promoter regions, and within operons. Here, we introduce LDWeaver, which enables an exhaustive GWES across both short- and long-range LD, to disentangle likely neutral co-variation from selection. We demonstrate the ability of LDWeaver to efficiently generate hypotheses about co-selection using large genomic surveys of multiple major human bacterial pathogen species and validate several findings using functional annotation and phenotypic measurements. Our approach will facilitate the study of bacterial evolution in the light of rapidly expanding population genomic data
    • …
    corecore