162 research outputs found

    MTML-msBayes: Approximate Bayesian comparative phylogeographic inference from multiple taxa and multiple loci with rate heterogeneity

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>MTML-msBayes uses hierarchical approximate Bayesian computation (HABC) under a coalescent model to infer temporal patterns of divergence and gene flow across codistributed taxon-pairs. Under a model of multiple codistributed taxa that diverge into taxon-pairs with subsequent gene flow or isolation, one can estimate hyper-parameters that quantify the mean and variability in divergence times or test models of migration and isolation. The software uses multi-locus DNA sequence data collected from multiple taxon-pairs and allows variation across taxa in demographic parameters as well as heterogeneity in DNA mutation rates across loci. The method also allows a flexible sampling scheme: different numbers of loci of varying length can be sampled from different taxon-pairs.</p> <p>Results</p> <p>Simulation tests reveal increasing power with increasing numbers of loci when attempting to distinguish temporal congruence from incongruence in divergence times across taxon-pairs. These results are robust to DNA mutation rate heterogeneity. Estimating mean divergence times and testing simultaneous divergence was less accurate with migration, but improved if one specified the correct migration model. Simulation validation tests demonstrated that one can detect the correct migration or isolation model with high probability, and that this HABC model testing procedure was greatly improved by incorporating a summary statistic originally developed for this task (Wakeley's <it>Ψ<sub>W</sub></it>). The method is applied to an empirical data set of three Australian avian taxon-pairs and a result of simultaneous divergence with some subsequent gene flow is inferred.</p> <p>Conclusions</p> <p>To retain flexibility and compatibility with existing bioinformatics tools, MTML-msBayes is a pipeline software package consisting of Perl, C and R programs that are executed via the command line. Source code and binaries are available for download at <url>http://msbayes.sourceforge.net/</url> under an open source license (GNU Public License).</p

    Testing comparative phylogeographic models of marine vicariance and dispersal using a hierarchical Bayesian approach

    Get PDF
    Background Marine allopatric speciation is an enigma because pelagic larval dispersal can potentially connect disjunct populations thereby preventing reproductive and morphological divergence. Here we present a new hierarchical approximate Bayesian computation model (HABC) that tests two hypotheses of marine allopatric speciation: 1.) soft vicariance , where a speciation involves fragmentation of a large widespread ancestral species range that was previously connected by long distance gene flow; and 2.) peripatric colonization, where speciations in peripheral archipelagos emerge from sweepstakes colonizations from central source regions. The HABC approach analyzes all the phylogeographic datasets at once in order to make across taxon-pair inferences about biogeographic processes while explicitly allowing for uncertainty in the demographic differences within each taxon-pair. Our method uses comparative phylogeographic data that consists of single locus mtDNA sequences from multiple co-distributed taxa containing pairs of central and peripheral populations. We use the method on two comparative phylogeographic data sets consisting of cowrie gastropod endemics co-distributed in the Hawaiian (11 taxon-pairs) and Marquesan archipelagos (7 taxon-pairs). Results Given the Marquesan data, we find strong evidence of simultaneous colonization across all seven cowrie gastropod endemics co-distributed in the Marquesas. In contrast, the lower sample sizes in the Hawaiian data lead to greater uncertainty associated with the Hawaiian estimates. Although, the hyper-parameter estimates point to soft vicariance in a subset of the 11 Hawaiian taxon-pairs, the hyper-prior and hyper-posterior are too similar to make a definitive conclusion. Both results are not inconsistent with what is known about the geologic history of the archipelagos. Simulations verify that our method can successfully distinguish these two histories across a wide range of conditions given sufficient sampling. Conclusion Although soft vicariance and colonization are likely to produce similar genetic patterns when a single taxon-pair is used, our hierarchical Bayesian model can potentially detect if either history is a dominant process across co-distributed taxon-pairs. As comparative phylogeographic datasets grow to include \u3e 100 co-distributed taxon-pairs, the HABC approach will be well suited to dissect temporal patterns in community assembly and evolution, thereby providing a bridge linking comparative phylogeography with community ecology

    Contrasting quaternary histories in an ecologically divergent sister pair of low-dispersing intertidal fish (Xiphister) revealed by multilocus DNA analysis. Evolution

    Get PDF
    Abstract. Recurrent glacial advances have shaped community histories across the planet. While biogeographic responses to glaciations likely varied with latitude, the consequences for temperate marine communities histories are less clear. By coalescent analyses of multiloci DNA sequence data (mitochondrial DNA control region, ␣-enolase intron, and ␣-tropomyosin intron) collected from a low-dispersing sister pair of rocky intertidal fishes commonly found from southeastern Alaska to California (Xiphister atropurpureus and X. mucosus), we uncover two very different responses to historical glaciations. A variety of methods that include a simulation analysis, coestimates of migration and divergence times, and estimates of minimum ages of populations sampled up and down the North American Pacific coast all strongly revealed a history of range persistence in X. atropurpureus and extreme range contraction and expansion from a southern refugium in X. mucosus. Furthermore, these conclusions are not sensitive to the independent estimates of the DNA substitution rates we obtain. While gene flow and dispersal are low in both species, the widely different histories are rather likely to have arisen from ecological differences such as diet breadth, generation time, and habitat specificity. The periodic glacial epochs of the Quaternary were prolonged and occupied nearly 80% of the last 2 million years The community composition of temperate intertidal areas such as the northeastern Pacific is likely to be the result of both historical processes such as these cyclical glaciations as well as contemporary oceanographic processes and changes To this end, phylogeographic studies of this region have focused on testing two general hypotheses of Pleistocene range stability that are suggested from geological and ecological data: (1) the southern refugium hypothesis; and (2) the range persistence hypothesis. While incomplete coastal glaciation would suggest persistence (Pielou 1991), wintertime SST were much lower during these glacial period

    Detecting Concerted Demographic Response across Community Assemblages Using Hierarchical Approximate Bayesian Computation

    Full text link
    Methods that integrate population-level sampling from multiple taxa into a single community-level analysis are an essential addition to the comparative phylogeographic toolkit. Detecting how species within communities have demographically tracked each other in space and time is important for understanding the effects of future climate and landscape changes and the resulting acceleration of extinctions, biological invasions, and potential surges in adaptive evolution. Here, we present a statistical framework for such an analysis based on hierarchical approximate Bayesian computation (hABC) with the goal of detecting concerted demographic histories across an ecological assemblage. Our method combines population genetic data sets from multiple taxa into a single analysis to estimate: 1) the proportion of a community sample that demographically expanded in a temporally clustered pulse and 2) when the pulse occurred. To validate the accuracy and utility of this new approach, we use simulation cross-validation experiments and subsequently analyze an empirical data set of 32 avian populations from Australia that are hypothesized to have expanded from smaller refugia populations in the late Pleistocene. The method can accommodate data set heterogeneity such as variability in effective population size, mutation rates, and sample sizes across species and exploits the statistical strength from the simultaneous analysis of multiple species. This hABC framework used in a multitaxa demographic context can increase our understanding of the impact of historical climate change by determining what proportion of the community responded in concert or independently and can be used with a wide variety of comparative phylogeographic data sets as biota-wide DNA barcoding data sets accumulate

    Strategies for improving approximate Bayesian computation tests for synchronous diversification

    Full text link
    Background: Estimating the variability in isolation times across co-distributed taxon pairs that may have experienced the same allopatric isolating mechanism is a core goal of comparative phylogeography. The use of hierarchical Approximate Bayesian Computation (ABC) and coalescent models to infer temporal dynamics of lineage co-diversification has been a contentious topic in recent years. Key issues that remain unresolved include the choice of an appropriate prior on the number of co-divergence events (Ψ), as well as the optimal strategies for data summarization. Methods: Through simulation-based cross validation we explore the impact of the strategy for sorting summary statistics and the choice of prior on Ψ on the estimation of co-divergence variability. We also introduce a new setting (β) that can potentially improve estimation of Ψ by enforcing a minimal temporal difference between pulses of co-divergence. We apply this new method to three empirical datasets: one dataset each of co-distributed taxon pairs of Panamanian frogs and freshwater fishes, and a large set of Neotropical butterfly sister-taxon pairs. Results: We demonstrate that the choice of prior on Ψ has little impact on inference, but that sorting summary statistics yields substantially more reliable estimates of co-divergence variability despite violations of assumptions about exchangeability. We find the implementation of β improves estimation of Ψ, with improvement being most dramatic given larger numbers of taxon pairs. We find equivocal support for synchronous co-divergence for both of the Panamanian groups, but we find considerable support for asynchronous divergence among the Neotropical butterflies. Conclusions: Our simulation experiments demonstrate that using sorted summary statistics results in improved estimates of the variability in divergence times, whereas the choice of hyperprior on Ψ has negligible effect. Additionally, we demonstrate that estimating the number of pulses of co-divergence across co-distributed taxonpairs is improved by applying a flexible buffering regime over divergence times. This improves the correlation between Ψ and the true variability in isolation times and allows for more meaningful interpretation of this hyperparameter. This will allow for more accurate identification of the number of temporally distinct pulses of codivergence that generated the diversification pattern of a given regional assemblage of sister-taxon-pairs

    Behavioural response to song and genetic divergence in two subspecies of white-crowned sparrows (Zonotrichia leucophrys)

    Get PDF
    © 2016 John Wiley & Sons Ltd Divergence in sexual signals may drive reproductive isolation between lineages, but behavioural barriers can weaken in contact zones. Here, we investigate the role of song as a behavioural and genetic barrier in a contact zone between two subspecies of white-crowned sparrows (Zonotrichia leucophrys). We employed a reduced genomic data set to assess population structure and infer the history underlying divergence, gene flow and hybridization. We also measured divergence in song and tested behavioural responses to song using playback experiments within and outside the contact zone. We found that the subspecies form distinct genetic clusters, and demographic inference supported a model of secondary contact. Song phenotype, particularly length of the first note (a whistle), was a significant predictor of genetic subspecies identity and genetic distance along the hybrid zone, suggesting a close link between song and genetic divergence in this system. Individuals from both parental and admixed localities responded significantly more strongly to their own song than to the other subspecies song, supporting song as a behavioural barrier. Putative parental and admixed individuals were not significantly different in their strength of discrimination between own and other songs; however, individuals from admixed localities tended to discriminate less strongly, and this difference in discrimination strength was explained by song dissimilarity as well as genetic distance. Therefore, we find that song acts as a reproductive isolating mechanism that is potentially weakening in a contact zone between the subspecies. Our findings also support the hypothesis that intraspecific song variation can reduce gene flow between populations

    Comparative Population Genomics of African Montane Forest Mammals Support Population Persistence across a Climatic Gradient and Quaternary Climatic Cycles

    Full text link
    The Eastern Afromontane biodiversity hotspot (EABH) has the highest concentration of biodiversity in tropical Africa, yet few studies have investigated recent historical diversification processes in EABH lineages. Herein, we analyze restriction-site associated DNA-sequences (RAD-Seq) to study recent historical processes in co-distributed mouse (Hylomyscus) and shrew (Sylvisorex) species complexes, with an aim to better determine how historical paleoenvironmental processes might have contributed to the EABH’s high diversity. We analyzed complete SNP matrices of \u3e 50,000 RAD loci to delineate populations, reconstruct the history of isolation and admixture, and discover geographic patterns of genetic partitioning. These analyses demonstrate that persistently unsuitable habitat may have isolated multiple populations distributed across montane habitat islands in the Itombwe Massif and Albertine Rift to the west as well as Mt Elgon and Kenyan Highlands to the east. We detected low genetic diversity in Kenyan Highland populations of both genera, consistent with smaller historical population sizes in this region. We additionally tested predictions that Albertine Rift populations are older and more persistently isolated compared to the Kenyan Highlands. Phylogenetic analyses support greater historical isolation among Albertine Rift populations of both shrews and mice compared to the Kenyan Highlands and suggest that there are genetically isolated populations from both focal genera in the Itombwe Massif, Democratic Republic of Congo. The Albertine Rift ecoregion has the highest mammalian tropical forest species richness per unit area on earth. Our results clearly support accelerating efforts to conserve this diversity

    Sampling strategies for frequency spectrum-based population genomic inference

    Get PDF
    Background The allele frequency spectrum (AFS) consists of counts of the number of single nucleotide polymorphism (SNP) loci with derived variants present at each given frequency in a sample. Multiple approaches have recently been developed for parameter estimation and calculation of model likelihoods based on the joint AFS from two or more populations. We conducted a simulation study of one of these approaches, implemented in the Python module δaδi, to compare parameter estimation and model selection accuracy given different sample sizes under one- and two-population models. Results Our simulations included a variety of demographic models and two parameterizations that differed in the timing of events (divergence or size change). Using a number of SNPs reasonably obtained through next-generation sequencing approaches (10,000 - 50,000), accurate parameter estimates and model selection were possible for models with more ancient demographic events, even given relatively small numbers of sampled individuals. However, for recent events, larger numbers of individuals were required to achieve accuracy and precision in parameter estimates similar to that seen for models with older divergence or population size changes. We quantify i) the uncertainty in model selection, using tools from information theory, and ii) the accuracy and precision of parameter estimates, using the root mean squared error, as a function of the timing of demographic events, sample sizes used in the analysis, and complexity of the simulated models. Conclusions Here, we illustrate the utility of the genome-wide AFS for estimating demographic history and provide recommendations to guide sampling in population genomics studies that seek to draw inference from the AFS. Our results indicate that larger samples of individuals (and thus larger AFS) provide greater power for model selection and parameter estimation for more recent demographic events. Electronic supplementary material The online version of this article (doi:10.1186/s12862-014-0254-4) contains supplementary material, which is available to authorized users

    Population Genomics Reveals Seahorses (Hippocampus erectus) of the Western Mid-Atlantic Coast to Be Residents Rather than Vagrants

    Full text link
    Understanding population structure and areas of demographic persistence and transients is critical for effective species management. However, direct observational evidence to address the geographic scale and delineation of ephemeral or persistent populations for many marine fishes is limited. The Lined seahorse (Hippocampus erectus) can be commonly found in three western Atlantic zoogeographic provinces, though inhabitants of the temperate northern Virginia Province are often considered tropical vagrants that only arrive during warm seasons from the southern provinces and perish as temperatures decline. Although genetics can locate regions of historical population persistence and isolation, previous evidence of Virginia Province persistence is only provisional due to limited genetic sampling (i.e., mitochondrial DNA and five nuclear loci). To test alternative hypotheses of historical persistence versus the ephemerality of a northern Virginia Province population we used a RADseq generated dataset consisting of 11,708 single nucleotide polymorphisms (SNP) sampled from individuals collected from the eastern Gulf of Mexico to Long Island, NY. Concordant results from genomic analyses all infer three genetically divergent subpopulations, and strongly support Virginia Province inhabitants as a genetically diverged and a historically persistent ancestral gene pool. These results suggest that individuals that emerge in coastal areas during the warm season can be considered “local” and supports offshore migration during the colder months. This research demonstrates how a large number of genes sampled across a geographical range can capture the diversity of coalescent histories (across loci) while inferring population history. Moreover, these results clearly demonstrate the utility of population genomic data to infer peripheral subpopulation persistence in difficult-to-observe species
    corecore