Search CORE

29 research outputs found

The diversity of a distributed genome in bacterial populations

Author: Baumdicker F.
Hess W. R.
Pfaffelhuber P.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 05/11/2010
Field of study

The distributed genome hypothesis states that the set of genes in a population of bacteria is distributed over all individuals that belong to the specific taxon. It implies that certain genes can be gained and lost from generation to generation. We use the random genealogy given by a Kingman coalescent in order to superimpose events of gene gain and loss along ancestral lines. Gene gains occur at a constant rate along ancestral lines. We assume that gained genes have never been present in the population before. Gene losses occur at a rate proportional to the number of genes present along the ancestral line. In this infinitely many genes model we derive moments for several statistics within a sample: the average number of genes per individual, the average number of genes differing between individuals, the number of incongruent pairs of genes, the total number of different genes in the sample and the gene frequency spectrum. We demonstrate that the model gives a reasonable fit with gene frequency data from marine cyanobacteria.Comment: Published in at http://dx.doi.org/10.1214/09-AAP657 the Annals of Applied Probability (http://www.imstat.org/aap/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

Crossref

Horizontal gene transfer-mediated bacterial strain variation affects host fitness in Drosophila

Author: Baumdicker F.
Kuenzel S.
Schweiger P.
Staubach F.
Wang Y.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

How microbes affect host fitness and environmental adaptation has become a fundamental research question in evolutionary biology. To better understand the role of microbial genomic variation for host fitness, we tested for associations of bacterial genomic variation and Drosophila melanogaster offspring number in a microbial Genome Wide Association Study (GWAS)

MPG.PuRe

Prokaryote genome fluidity is dependent on effective population size

Author: A Knöppel
AO Kislyuk
B Haegeman
Elze Hesse
F Baumdicker
H Ochman
H Tettelin
HP Narra
JP Gogarten
JP Gogarten
M Lynch
M Touchon
M Vos
M Vos
MF Polz
Michiel Vos
Nadia Andrea Andreani
RW Nowell
S Wielgoss
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/03/2017
Field of study

Many prokaryote species are known to have fluid genomes, with different strains varying markedly in accessory gene content through the combined action of gene loss, gene gain via lateral transfer, as well as gene duplication. However, the evolutionary forces determining genome fluidity are not yet well understood. We here for the first time systematically analyse the degree to which this distinctive genomic feature differs between bacterial species. We find that genome fluidity is positively correlated with synonymous nucleotide diversity of the core genome, a measure of effective population size Ne. No effects of genome size, phylogeny or homologous recombination rate on genome fluidity were found. Our findings are consistent with a scenario where accessory gene content turnover is for a large part dictated by neutral evolution

University of Lincoln Institutional Repository

Crossref

Open Research Exeter

Efficient ancestry and mutation simulation with msprime 1.0

Author: Baumdicker Franz
Bisschop Gertjan
Eldon Bjarki
Ellerman Castedo E.
Galloway Jared G.
Gladstein Ariella L.
Goldstein Daniel
Gorjanc Gregor
Gower Graham
Gravel Simon
Guo Bing
Jeffery Ben
Kelleher Jerome
Kern Andrew D.
Koskela Jere
Kretzschmar Warren W.
Lohse Konrad
Matschiner Michael
Nelson Dominic
Pope Nathaniel S.
Quinto-Cortés Consuelo D.
Ragsdale Aaron P.
Ralph Peter L.
Rodrigues Murillo F.
Saunack Kumar
Sellinger Thibaut
Thornton Kevin
Tsambos Georgia
van Kemenade Hugo
Wohns Anthony W.
Wong H. Yan
Zhu Sha
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 01/09/2021
Field of study

Edinburgh Research Explorer

Efficient ancestry and mutation simulation with msprime 1.0

Author: Baumdicker Franz
Bisschop Gertjan
Eldon Bjarki
Ellerman Castedo E.
Galloway Jared G.
Gladstein Ariella L.
Goldstein Daniel
Gorjanc Gregor
Gower Graham
Gravel Simon
Guo Bing
Jeffery Ben
Kelleher Jerome
Kern Andrew D.
Koskela Jere
Kretzschmar Warren W.
Lohse Konrad
Matschiner Michael
Nelson Dominic
Pope Nathaniel S.
Quinto-Cortés Consuelo D.
Ragsdale Aaron P.
Ralph Peter L.
Rodrigues Murillo F.
Saunack Kumar
Sellinger Thibaut
Thornton Kevin
Tsambos Georgia
van Kemenade Hugo
Wohns Anthony W.
Wong H. Yan
Zhu Sha
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/09/2021
Field of study

Stochastic simulation is a key tool in population genetics, since the models involved are often analytically intractable and simulation is usually the only way of obtaining ground-truth data to evaluate inferences. Because of this, a large number of specialized simulation programs have been developed, each filling a particular niche, but with largely overlapping functionality and a substantial duplication of effort. Here, we introduce msprime version 1.0, which efficiently implements ancestry and mutation simulations based on the succinct tree sequence data structure and the tskit library. We summarize msprime’s many features, and show that its performance is excellent, often many times faster and more memory efficient than specialized alternatives. These high-performance features have been thoroughly tested and validated, and built using a collaborative, open source development model, which reduces duplication of effort and promotes software quality via community engagement

Copenhagen University Research Information System

PubMed Central

Edinburgh Research Explorer

eScholarship - University of California

Warwick Research Archives Portal Repository

Expanding the stdpopsim species catalog, and lessons learned for realistic genome simulations

Simulation is a key tool in population genetics for both methods development and empirical research, but producing simulations that recapitulate the main features of genomic datasets remains a major obstacle. Today, more realistic simulations are possible thanks to large increases in the quantity and quality of available genetic data, and the sophistication of inference and simulation software. However, implementing these simulations still requires substantial time and specialized knowledge. These challenges are especially pronounced for simulating genomes for species that are not well-studied, since it is not always clear what information is required to produce simulations with a level of realism sufficient to confidently answer a given question. The community-developed framework stdpopsim seeks to lower this barrier by facilitating the simulation of complex population genetic models using up-to-date information. The initial version of stdpopsim focused on establishing this framework using six well-characterized model species (Adrion et al., 2020). Here, we report on major improvements made in the new release of stdpopsim (version 0.2), which includes a significant expansion of the species catalog and substantial additions to simulation capabilities. Features added to improve the realism of the simulated genomes include non-crossover recombination and provision of species-specific genomic annotations. Through community-driven efforts, we expanded the number of species in the catalog more than threefold and broadened coverage across the tree of life. During the process of expanding the catalog, we have identified common sticking points and developed the best practices for setting up genome-scale simulations. We describe the input data required for generating a realistic simulation, suggest good practices for obtaining the relevant information from the literature, and discuss common pitfalls and major considerations. These improvements to stdpopsim aim to further promote the use of realistic whole-genome population genetic simulations, especially in non-model organisms, making them available, transparent, and accessible to everyone

Publikationer från Uppsala Universitet

Edinburgh Research Explorer

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Frequency-dependent selection in vaccine-associated pneumococcal population dynamics

Author: A Pikis
AE Lobkovsky
AJH Cremers
AKC Wong
B Haegeman
B Henriques-Normark
B Kerr
BJ Shapiro
BR Levin
C Bogaardt
C Camacho
C Chewapreecha
D Hyatt
DM Kristensen
DR Zerbino
EL Miller
ER Watkins
F Bagnoli
F Baumdicker
F Cohan
FM Cohan
FM Stewart
G Regev-Yochay
H Goossens
H Li
H Rinta-Kokko
Health Protection Agency COVER programme
J Lintusaari
JE Stajich
JO McInerney
JP Maskell
JP Nuorti
JS Hogg
K Katoh
K Lagesen
L Cheng
M Lipsitch
MN Price
N Takeuchi
NJ Croucher
NJ Croucher
NJ Croucher
NJ Croucher
NJ Croucher
NJ Croucher
NJ Croucher
NJ Croucher
NJ Croucher
OX Cordero
P Danecek
P Dixon
P Turner
R Der
R Durrett
RA Fisher
RA Gladstone
RA Gladstone
RE Collins
S Cobey
S Dawid
S Gupta
S Lehtinen
S Wright
SE Hoover
SS Huang
SS Huang
T Carver
TM Lowe
WJ Kent
Y Haasum
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Many bacterial species are composed of multiple lineages distinguished by extensive variation in gene content. These often cocirculate in the same habitat, but the evolutionary and ecological processes that shape these complex populations are poorly understood. Addressing these questions is particularly important for Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen, because the changes in population structure associated with the recent introduction of partial-coverage vaccines have substantially reduced pneumococcal disease. Here we show that pneumococcal lineages from multiple populations each have a distinct combination of intermediate-frequency genes. Functional analysis suggested that these loci may be subject to negative frequency-dependent selection (NFDS) through interactions with other bacteria, hosts or mobile elements. Correspondingly, these genes had similar frequencies in four populations with dissimilar lineage compositions. These frequencies were maintained following substantial alterations in lineage prevalences once vaccination programmes began. Fitting a multilocus NFDS model of post-vaccine population dynamics to three genomic datasets using Approximate Bayesian Computation generated reproducible estimates of the influence of NFDS on pneumococcal evolution, the strength of which varied between loci. Simulations replicated the stable frequency of lineages unperturbed by vaccination, patterns of serotype switching and clonal replacement. This framework highlights how bacterial ecology affects the impact of clinical interventions.Accessory loci are shown to have similar frequencies in diverse Streptococcus pneumoniae populations, suggesting negative frequency-dependent selection drives post-vaccination population restructuring

Crossref

Harvard University - DASH

Edinburgh Research Explorer

Oxford University Research Archive

Spiral - Imperial College Digital Repository

The Turbulent Network Dynamics of Microbial Evolution and the Statistical Tree of Life

Author: AE Lobkovsky
AE Lobkovsky
AS Lang
B Snel
BG Mirkin
C Darwin
C Johnston
CR Woese
CR Woese
CR Woese
D Charlesworth
E Bapteste
E Bapteste
E Bosi
Eugene V. Koonin
EV Koonin
EV Koonin
EV Koonin
EV Koonin
F Baumdicker
F Bushman
FD Ciccarelli
H Tettelin
H Tettelin
HJ Muller
JM Allen
JO Andersson
JP Claverys
LD McDaniel
M Csuros
M Csuros
M Land
M Lynch
MA O’Malley
N Takeuchi
NT Perna
P Puigbo
P Puigbo
P Puigbo
P Puigbo
RE Collins
S Maslov
S Nelson-Sathi
S Nelson-Sathi
S Ohno
T Dagan
T Dobzhansky
TJ Treangen
V Kunin
V Merhej
WF Doolittle
WF Doolittle
WF Doolittle
WF Doolittle
WF Doolittle
YI Wolf
YI Wolf
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

panX: pan-genome analysis and exploration

Author: Baumdicker F.
Ding W.
Neher R.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2018
Field of study

Horizontal transfer, gene loss, and duplication result in dynamic bacterial genomes shaped by a complex mixture of different modes of evolution. Closely related strains can differ in the presence or absence of many genes, and the total number of distinct genes found in a set of related isolates-the pan-genome-is often many times larger than the genome of individual isolates. We have developed a pipeline that efficiently identifies orthologous gene clusters in the pan-genome. This pipeline is coupled to a powerful yet easy-to-use web-based visualization for interactive exploration of the pan-genome. The visualization consists of connected components that allow rapid filtering and searching of genes and inspection of their evolutionary history. For each gene cluster, panX displays an alignment, a phylogenetic tree, maps mutations within that cluster to the branches of the tree and infers gain and loss of genes on the core-genome phylogeny. PanX is available at pangenome.de. Custom pan-genomes can be visualized either using a web server or by serving panX locally as a browser-based application

MPG.PuRe

Efficient ancestry and mutation simulation with msprime 1.0

Author: Baumdicker F
Bisschop G
Goldstein D
Jeffery B
Kelleher J
Wohns AW
Wong Y
Zhu S
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2021
Field of study

Oxford University Research Archive