Search CORE

63 research outputs found

Bayesian inference of population size history from multiple loci

Author: Drummond Alexei J
Heled Joseph
Publication venue: BioMed Central
Publication date: 01/10/2008
Field of study

Abstract Background Effective population size (<it>N</it><it>e</it>) is related to genetic variability and is a basic parameter in many models of population genetics. A number of methods for inferring current and past population sizes from genetic data have been developed since JFC Kingman introduced the n-coalescent in 1982. Here we present the Extended Bayesian Skyline Plot, a non-parametric Bayesian Markov chain Monte Carlo algorithm that extends a previous coalescent-based method in several ways, including the ability to analyze multiple loci. Results Through extensive simulations we show the accuracy and limitations of inferring population size as a function of the amount of data, including recovering information about evolutionary bottlenecks. We also analyzed two real data sets to demonstrate the behavior of the new method; a single gene Hepatitis C virus data set sampled from Egypt and a 10 locus <it>Drosophila ananassae </it>data set representing 16 different populations. Conclusion The results demonstrate the essential role of multiple loci in recovering population size dynamics. Multi-locus data from a small number of individuals can precisely recover past bottlenecks in population size which can not be characterized by analysis of a single locus. We also demonstrate that sequence data quality is important because even moderate levels of sequencing errors result in a considerable decrease in estimation accuracy for realistic levels of population genetic variability.</p

Directory of Open Access Journals

PubMed Central

Calibrated Tree Priors for Relaxed Phylogenetics and Divergence Time Estimation

Author: Alexei J. Drummond
Drummond
Drummond
Drummond
Drummond
Gernhard
Griffiths
Joseph Heled
Kingman
Phillips
Rannala
Sithaldeen
Stadler
Stadler
Publication venue
Publication date: 29/03/2011
Field of study

The use of fossil evidence to calibrate divergence time estimation has a long history. More recently Bayesian MCMC has become the dominant method of divergence time estimation and fossil evidence has been re-interpreted as the specification of prior distributions on the divergence times of calibration nodes. These so-called "soft calibrations" have become widely used but the statistical properties of calibrated tree priors in a Bayesian setting has not been carefully investigated. Here we clarify that calibration densities, such as those defined in BEAST 1.5, do not represent the marginal prior distribution of the calibration node. We illustrate this with a number of analytical results on small trees. We also describe an alternative construction for a calibrated Yule prior on trees that allows direct specification of the marginal prior distribution of the calibrated divergence time, with or without the restriction of monophyly. This method requires the computation of the Yule prior conditional on the height of the divergence being calibrated. Unfortunately, a practical solution for multiple calibrations remains elusive. Our results suggest that direct estimation of the prior induced by specifying multiple calibration densities should be a prerequisite of any divergence time dating analysis

arXiv.org e-Print Archive

Crossref

PubMed Central

CALIBRATING DIVERGENCE TIMES ON SPECIES TREES VERSUS GENE TREES: IMPLICATIONS FOR SPECIATION HISTORY OF APHELOCOMA JAYS

Author: Delaney Kathleen S.
Heled Joseph
Knowles L. Lacey
Mccormack John E.
Peterson A. Townsend
Publication venue: 'Wiley'
Publication date: 01/01/2011
Field of study

Estimates of the timing of divergence are central to testing the underlying causes of speciation. Relaxed molecular clocks and fossil calibration have improved these estimates; however, these advances are implemented in the context of gene trees, which can overestimate divergence times. Here we couple recent innovations for dating speciation events with the analytical power of species trees, where multilocus data are considered in a coalescent context. Divergence times are estimated in the bird genus Aphelocoma to test whether speciation in these jays coincided with mountain uplift or glacial cycles. Gene trees and species trees show general agreement that diversification began in the Miocene amid mountain uplift. However, dates from the multilocus species tree are more recent, occurring predominately in the Pleistocene, consistent with theory that divergence times can be significantly overestimated with gene-tree based approaches that do not correct for genetic divergence that predates speciation. In addition to coalescent stochasticity, Haldane's rule could account for some differences in timing estimates between mitochondrial DNA and nuclear genes. By incorporating a fossil calibration applied to the species tree, in addition to the process of gene lineage coalescence, the present approach provides a more biologically realistic framework for dating speciation events, and hence for testing the links between diversification and specific biogeographic and geologic events.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/79292/1/j.1558-5646.2010.01097.x.pd

Deep Blue Documents at the University of Michigan

How Many Subpopulations is Too Many? Exponential Lower Bounds for Inferring Population Histories

Author: A Bhaskar
A Bhaskar
A Drummond
EJ Candès
FL Nazarov
GA McVean
H Li
J Heled
J Kim
J Terhorst
J Terhorst
L Excoffier
M Kimura
M Nordborg
P Turán
R Nielsen
RA Blythe
S Myers
S Schiffels
S Sheehan
TA Joseph
W Gautschi
Y Hua
Publication venue
Publication date: 08/05/2019
Field of study

Reconstruction of population histories is a central problem in population genetics. Existing coalescent-based methods, like the seminal work of Li and Durbin (Nature, 2011), attempt to solve this problem using sequence data but have no rigorous guarantees. Determining the amount of data needed to correctly reconstruct population histories is a major challenge. Using a variety of tools from information theory, the theory of extremal polynomials, and approximation theory, we prove new sharp information-theoretic lower bounds on the problem of reconstructing population structure -- the history of multiple subpopulations that merge, split and change sizes over time. Our lower bounds are exponential in the number of subpopulations, even when reconstructing recent histories. We demonstrate the sharpness of our lower bounds by providing algorithms for distinguishing and learning population histories with matching dependence on the number of subpopulations. Along the way and of independent interest, we essentially determine the optimal number of samples needed to learn an exponential mixture distribution information-theoretically, proving the upper bound by analyzing natural (and efficient) algorithms for this problem.Comment: 38 pages, Appeared in RECOMB 201

arXiv.org e-Print Archive

DSpace@MIT

Crossref

BEAST 2:A Software Platform for Bayesian Evolutionary Analysis

Author: A Alekseyenko
AJ Drummond
AJ Drummond
AJ Drummond
AJ Drummond
Alexei J. Drummond
Andreas Prlic
Andrew Rambaut
CH Wu
Chieh-Hsi Wu
D Ayres
D Bryant
D Kühnert
Denise Kühnert
Dong Xie
G Baele
J Felsenstein
J Heled
J Heled
Joseph Heled
K Hayasaka
K Tamura
L Kuo
M Hasegawa
M Kimura
M Suchard
Marc A. Suchard
P Beerli
P Beerli
P Beerli
P Lemey
P Lemey
R Bouckaert
Remco Bouckaert
RR Bouckaert
S Tavaré
T Stadler
T van de Laar
T Vaughan
T Vaughan
Tim Vaughan
V Minin
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/04/2014
Field of study

We present a new open source, extensible and flexible software platform for Bayesian evolutionary analysis called BEAST 2. This software platform is a re-design of the popular BEAST 1 platform to correct structural deficiencies that became evident as the BEAST 1 software evolved. Key among those deficiencies was the lack of post-deployment extensibility. BEAST 2 now has a fully developed package management system that allows third party developers to write additional functionality that can be directly installed to the BEAST 2 analysis platform via a package manager without requiring a new software release of the platform. This package architecture is showcased with a number of recently published new models encompassing birth-death-sampling tree priors, phylodynamics and model averaging for substitution models and site partitioning. A second major improvement is the ability to read/write the entire state of the MCMC chain to/from disk allowing it to be easily shared between multiple instances of the BEAST software. This facilitates checkpointing and better support for multi-processor and high-end computing extensions. Finally, the functionality in new packages can be easily added to the user interface (BEAUti 2) by a simple XML template-based mechanism because BEAST 2 has been re-designed to provide greater integration between the analysis engine and the user interface so that, for example BEAST and BEAUti use exactly the same XML file format

Public Library of Science (PLOS)

Repository for Publications and Research Data

Crossref

Southampton (e-Prints Soton)

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

eScholarship - University of California

FigShare

The Probability of a Gene Tree Topology within a Phylogenetic Network with Applications to Hybridization Detection

Author: A Rokas
AD Leaché
B Carstens
B Holland
B Rannala
C Ané
C Ané
C Ané
C Meng
C Than
C Than
C Than
CR Linder
CV Than
D Huson
D Posada
D Ruths
D Swofford
DA Pollard
DL Swofford
ES Allman
ES Allman
EW Bloomquist
G Schwarz
H Akaike
H Huang
H Lanier
J Heled
J Mallet
J Mallet
J Wakeley
James H. Degnan
JH Degnan
JH Degnan
JH Degnan
JJ Doyle
Joseph Felsenstein
JP Huelsenbeck
K Burnham
L Liu
L Liu
L Nakhleh
LL Knowles
LS Kubatko
LS Kubatko
LS Kubatko
Luay Nakhleh
M DeGiorgio
M Nei
M Slatkin
ML Arnold
NA Rosenberg
SM Ross
SV Edwards
SV Edwards
TC Bruen
W Maddison
Y Wang
Y Wu
Y Yu
Yun Yu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Gene tree topologies have proven a powerful data source for various tasks, including species tree inference and species delimitation. Consequently, methods for computing probabilities of gene trees within species trees have been developed and widely used in probabilistic inference frameworks. All these methods assume an underlying multispecies coalescent model. However, when reticulate evolutionary events such as hybridization occur, these methods are inadequate, as they do not account for such events. Methods that account for both hybridization and deep coalescence in computing the probability of a gene tree topology currently exist for very limited cases. However, no such methods exist for general cases, owing primarily to the fact that it is currently unknown how to compute the probability of a gene tree topology within the branches of a phylogenetic network. Here we present a novel method for computing the probability of gene tree topologies on phylogenetic networks and demonstrate its application to the inference of hybridization in the presence of incomplete lineage sorting. We reanalyze a Saccharomyces species data set for which multiple analyses had converged on a species tree candidate. Using our method, though, we show that an evolutionary hypothesis involving hybridization in this group has better support than one of strict divergence. A similar reanalysis on a group of three Drosophila species shows that the data is consistent with hybridization. Further, using extensive simulation studies, we demonstrate the power of gene tree topologies at obtaining accurate estimates of branch lengths and hybridization probabilities of a given phylogenetic network. Finally, we discuss identifiability issues with detecting hybridization, particularly in cases that involve extinction or incomplete sampling of taxa

Public Library of Science (PLOS)

Crossref

UC Research Repository

Directory of Open Access Journals

PubMed Central

DSpace at Rice University

FigShare

Evolution in Australasian Mangrove Forests: Multilocus Phylogenetic Analysis of the Gerygone Warblers (Aves: Acanthizidae)

Author: A Flórez-Rodríguez
A Rambaut
A Toon
AJ Drummond
AJ Drummond
C Li
CE Filardi
CR Primmer
D Posada
D Zwickl
E Pasquet
F Ronquist
FE Rheindt
FK Barker
H Shimodaira
HK Voris
IJ Lovette
J Fjeldså
J Ford
J Ford
J Ford
J Heled
JA McGuire
JA Nicholls
JA Norman
JD Thompson
JH Degnan
JL Gardner
JL Parra
JT Weir
JY Lee
K Loynes
KA Jønsson
KA Jønsson
L Christidis
L Liu
L Liu
L Liu
L Liu
Leo Joseph
M Byrne
M Marini
MD Sorenson
MD Sorenson
N Backström
R Schodde
R Schodde
R Schodde
R Schodde
R Schodde
RA Noske
RE Johnstone
RE Johnstone
RG Moyle
Robert C. Fleischer
S Fregin
SJ Hackett
SV Edwards
SV Edwards
TF Wright
WB Jennings
WP Maddison
Árpád S. Nyári
ÁS Nyári
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The mangrove forests of Australasia have many endemic bird species but their evolution and radiation in those habitats has been little studied. One genus with several mangrove specialist species is Gerygone (Passeriformes: Acanthizidae). The phylogeny of the Acanthizidae is reasonably well understood but limited taxon sampling for Gerygone has constrained understanding of its evolution and historical biogeography in mangroves. Here we report on a phylogenetic analysis of Gerygone based on comprehensive taxon sampling and a multilocus dataset of thirteen loci spread across the avian genome (eleven nuclear and two mitochondrial loci). Since Gerygone includes three species restricted to Australia's coastal mangrove forests, we particularly sought to understand the biogeography of their evolution in that ecosystem. Analyses of individual loci, as well as of a concatenated dataset drawn from previous molecular studies indicates that the genus as currently defined is not monophyletic, and that the Grey Gerygone (G. cinerea) from New Guinea should be transferred to the genus Acanthiza. The multilocus approach has permitted the nuanced view of the group's evolution into mangrove ecosystems having occurred on multiple occasions, in three non-overlapping time frames, most likely first by the G. magnirostris lineage, and subsequently followed by those of G. tenebrosa and G. levigaster

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

KU ScholarWorks

PubMed Central

FigShare