1,967 research outputs found
DNA barcoding reveals the coral “laboratory-rat”, Stylophora pistillata encompasses multiple identities
Stylophora pistillata is a widely used coral “lab-rat” species with highly variable morphology and a broad biogeographic range (Red Sea to western central Pacific). Here we show, by analysing Cytochorme Oxidase I sequences, from 241 samples across this range, that this taxon in fact comprises four deeply divergent clades corresponding to the Pacific-Western Australia, Chagos-Madagascar-South Africa, Gulf of Aden-Zanzibar-Madagascar, and Red Sea-Persian/Arabian Gulf-Kenya. On the basis of the fossil record of Stylophora, these four clades diverged from one another 51.5-29.6 Mya, i.e., long before the closure of the Tethyan connection between the tropical Indo-West Pacific and Atlantic in the early Miocene (16–24 Mya) and should be recognised as four distinct species. These findings have implications for comparative ecological and/or physiological studies carried out using Stylophora pistillata as a model species, and highlight the fact that phenotypic plasticity, thought to be common in scleractinian corals, can mask significant genetic variation
Improving phylogeny reconstruction at the strain level using peptidome datasets
Typical bacterial strain differentiation methods are often challenged by high genetic similarity between strains. To address this problem, we introduce a novel in silico peptide fingerprinting method based on conventional wet-lab protocols that enables the identification of potential strain-specific peptides. These can be further investigated using in vitro approaches, laying a foundation for the development of biomarker detection and application-specific methods. This novel method aims at reducing large amounts of comparative peptide data to binary matrices while maintaining a high phylogenetic resolution. The underlying case study concerns the Bacillus cereus group, namely the differentiation of Bacillus thuringiensis, Bacillus anthracis and Bacillus cereus strains. Results show that trees based on cytoplasmic and extracellular peptidomes are only marginally in conflict with those based on whole proteomes, as inferred by the established Genome-BLAST Distance Phylogeny (GBDP) method. Hence, these results indicate that the two approaches can most likely be used complementarily even in other organismal groups. The obtained results confirm previous reports about the misclassification of many strains within the B. cereus group. Moreover, our method was able to separate the B. anthracis strains with high resolution, similarly to the GBDP results as benchmarked via Bayesian inference and both Maximum Likelihood and Maximum Parsimony. In addition to the presented phylogenomic applications, whole-peptide fingerprinting might also become a valuable complementary technique to digital DNA-DNA hybridization, notably for bacterial classification at the species and subspecies level in the future.This research was funded by Grant AGL2013-44039-R from the Spanish “Plan Estatal de I+D+I”, and by Grant EM2014/046 from the “Plan Galego de investigación, innovación e crecemento 2011-2015”. BS was recipient of a Ramón y Cajal postdoctoral contractfrom the Spanish Ministry of Economyand Competitiveness. This work was also partially funded by the [14VI05] Contract-Programme from the University of Vigo and the Agrupamento INBIOMED from DXPCTSUG-FEDER unha maneira de facer Europa (2012/273).The research leading to these results has also received funding from the European Union’s Seventh Framework Programme FP7/REGPOT-2012-2013.1 under grant agreement n˚ 316265, BIOCAPS. This document reflects only the authors’ views and the European Union is not liable for any use that may be made of the information contained herein. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript
BEAGLE 3: Improved Performance, Scaling, and Usability for a High-Performance Computing Library for Statistical Phylogenetics
© 2019 The Author(s). BEAGLE is a high-performance likelihood-calculation library for phylogenetic inference. The BEAGLE library defines a simple, but flexible, application programming interface (API), and includes a collection of efficient implementations for calculation under a variety of evolutionary models on different hardware devices. The library has been integrated into recent versions of popular phylogenetics software packages including BEAST and MrBayes and has been widely used across a diverse range of evolutionary studies. Here, we present BEAGLE 3 with new parallel implementations, increased performance for challenging data sets, improved scalability, and better usability. We have added new OpenCL and central processing unit-threaded implementations to the library, allowing the effective utilization of a wider range of modern hardware. Further, we have extended the API and library to support concurrent computation of independent partial likelihood arrays, for increased performance of nucleotide-model analyses with greater flexibility of data partitioning. For better scalability and usability, we have improved how phylogenetic software packages use BEAGLE in multi-GPU (graphics processing unit) and cluster environments, and introduced an automated method to select the fastest device given the data set, evolutionary model, and hardware. For application developers who wish to integrate the library, we also have developed an online tutorial. To evaluate the effect of the improvements, we ran a variety of benchmarks on state-of-the-art hardware. For a partitioned exemplar analysis, we observe run-time performance improvements as high as 5.9-fold over our previous GPU implementation. BEAGLE 3 is free, open-source software licensed under the Lesser GPL and available at https://beagle-dev.github.io
Long-Branch Attraction Bias and Inconsistency in Bayesian Phylogenetics
Bayesian inference (BI) of phylogenetic relationships uses the same probabilistic models of evolution as its precursor maximum likelihood (ML), so BI has generally been assumed to share ML's desirable statistical properties, such as largely unbiased inference of topology given an accurate model and increasingly reliable inferences as the amount of data increases. Here we show that BI, unlike ML, is biased in favor of topologies that group long branches together, even when the true model and prior distributions of evolutionary parameters over a group of phylogenies are known. Using experimental simulation studies and numerical and mathematical analyses, we show that this bias becomes more severe as more data are analyzed, causing BI to infer an incorrect tree as the maximum a posteriori phylogeny with asymptotically high support as sequence length approaches infinity. BI's long branch attraction bias is relatively weak when the true model is simple but becomes pronounced when sequence sites evolve heterogeneously, even when this complexity is incorporated in the model. This bias—which is apparent under both controlled simulation conditions and in analyses of empirical sequence data—also makes BI less efficient and less robust to the use of an incorrect evolutionary model than ML. Surprisingly, BI's bias is caused by one of the method's stated advantages—that it incorporates uncertainty about branch lengths by integrating over a distribution of possible values instead of estimating them from the data, as ML does. Our findings suggest that trees inferred using BI should be interpreted with caution and that ML may be a more reliable framework for modern phylogenetic analysis
Selective Constraints on Amino Acids Estimated by a Mechanistic Codon Substitution Model with Multiple Nucleotide Changes
Empirical substitution matrices represent the average tendencies of
substitutions over various protein families by sacrificing gene-level
resolution. We develop a codon-based model, in which mutational tendencies of
codon, a genetic code, and the strength of selective constraints against amino
acid replacements can be tailored to a given gene. First, selective constraints
averaged over proteins are estimated by maximizing the likelihood of each 1-PAM
matrix of empirical amino acid (JTT, WAG, and LG) and codon (KHG) substitution
matrices. Then, selective constraints specific to given proteins are
approximated as a linear function of those estimated from the empirical
substitution matrices.
Akaike information criterion (AIC) values indicate that a model allowing
multiple nucleotide changes fits the empirical substitution matrices
significantly better. Also, the ML estimates of transition-transversion bias
obtained from these empirical matrices are not so large as previously
estimated. The selective constraints are characteristic of proteins rather than
species. However, their relative strengths among amino acid pairs can be
approximated not to depend very much on protein families but amino acid pairs,
because the present model, in which selective constraints are approximated to
be a linear function of those estimated from the JTT/WAG/LG/KHG matrices, can
provide a good fit to other empirical substitution matrices including cpREV for
chloroplast proteins and mtREV for vertebrate mitochondrial proteins.
The present codon-based model with the ML estimates of selective constraints
and with adjustable mutation rates of nucleotide would be useful as a simple
substitution model in ML and Bayesian inferences of molecular phylogenetic
trees, and enables us to obtain biologically meaningful information at both
nucleotide and amino acid levels from codon and protein sequences.Comment: Table 9 in this article includes corrections for errata in the Table
9 published in 10.1371/journal.pone.0017244. Supporting information is
attached at the end of the article, and a computer-readable dataset of the ML
estimates of selective constraints is available from
10.1371/journal.pone.001724
A New Species of River Dolphin from Brazil or:How Little Do We Know Our Biodiversity
True river dolphins are some of the rarest and most endangered of all vertebrates. They comprise relict evolutionary lineages of high taxonomic distinctness and conservation value, but are afforded little protection. We report the discovery of a new species of a river dolphin from the Araguaia River basin of Brazil, the first such discovery in nearly 100 years. The species is diagnosable by a series of molecular and morphological characters and diverged from its Amazonian sister taxon 2.08 million years ago. The estimated time of divergence corresponds to the separation of the Araguaia-Tocantins basin from the Amazon basin. This discovery highlights the immensity of the deficit in our knowledge of Neotropical biodiversity, as well as vulnerability of biodiversity to anthropogenic actions in an increasingly threatened landscape. We anticipate that this study will provide an impetus for the taxonomic and conservation reanalysis of other taxa shared between the Araguaia and Amazon aquatic ecosystems, as well as stimulate historical biogeographical analyses of the two basins
Who Watches the Watchmen? An Appraisal of Benchmarks for Multiple Sequence Alignment
Multiple sequence alignment (MSA) is a fundamental and ubiquitous technique
in bioinformatics used to infer related residues among biological sequences.
Thus alignment accuracy is crucial to a vast range of analyses, often in ways
difficult to assess in those analyses. To compare the performance of different
aligners and help detect systematic errors in alignments, a number of
benchmarking strategies have been pursued. Here we present an overview of the
main strategies--based on simulation, consistency, protein structure, and
phylogeny--and discuss their different advantages and associated risks. We
outline a set of desirable characteristics for effective benchmarking, and
evaluate each strategy in light of them. We conclude that there is currently no
universally applicable means of benchmarking MSA, and that developers and users
of alignment tools should base their choice of benchmark depending on the
context of application--with a keen awareness of the assumptions underlying
each benchmarking strategy.Comment: Revie
Structural biology and phylogenetic estimation
Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/62633/1/388527a0.pd
A broad distribution of the alternative oxidase in microsporidian parasites
Microsporidia are a group of obligate intracellular parasitic eukaryotes that were considered to be amitochondriate until the recent discovery of highly reduced mitochondrial organelles called mitosomes. Analysis of the complete genome of Encephalitozoon cuniculi revealed a highly reduced set of proteins in the organelle, mostly related to the assembly of ironsulphur clusters. Oxidative phosphorylation and the Krebs cycle proteins were absent, in keeping with the notion that the microsporidia and their mitosomes are anaerobic, as is the case for other mitosome bearing eukaryotes, such as Giardia. Here we provide evidence opening the possibility that mitosomes in a number of microsporidian lineages are not completely anaerobic. Specifically, we have identified and characterized a gene encoding the alternative oxidase (AOX), a typically mitochondrial terminal oxidase in eukaryotes, in the genomes of several distantly related microsporidian species, even though this gene is absent from the complete genome of E. cuniculi. In order to confirm that these genes encode functional proteins, AOX genes from both A. locustae and T. hominis were over-expressed in E. coli and AOX activity measured spectrophotometrically using ubiquinol-1 (UQ-1) as substrate. Both A. locustae and T. hominis AOX proteins reduced UQ-1 in a cyanide and antimycin-resistant manner that was sensitive to ascofuranone, a potent inhibitor of the trypanosomal AOX. The physiological role of AOX microsporidia may be to reoxidise reducing equivalents produced by glycolysis, in a manner comparable to that observed in trypanosome
- …