126 research outputs found
The strength and timing of the mitochondrial bottleneck in salmon suggests a conserved mechanism in vertebrates
In most species mitochondrial DNA (mtDNA) is inherited maternally in an apparently clonal fashion, although how this is achieved remains uncertain. Population genetic studies show not only that individuals can harbor more than one type of mtDNA (heteroplasmy) but that heteroplasmy is common and widespread across a diversity of taxa. Females harboring a mixture of mtDNAs may transmit varying proportions of each mtDNA type (haplotype) to their offspring. However, mtDNA variants are also observed to segregate rapidly between generations despite the high mtDNA copy number in the oocyte, which suggests a genetic bottleneck acts during mtDNA transmission. Understanding the size and timing of this bottleneck is important for interpreting population genetic relationships and for predicting the inheritance of mtDNA based disease, but despite its importance the underlying mechanisms remain unclear. Empirical studies, restricted to mice, have shown that the mtDNA bottleneck could act either at embryogenesis, oogenesis or both. To investigate whether the size and timing of the mitochondrial bottleneck is conserved between distant vertebrates, we measured the genetic variance in mtDNA heteroplasmy at three developmental stages (female, ova and fry) in chinook salmon and applied a new mathematical model to estimate the number of segregating units (N(e)) of the mitochondrial bottleneck between each stage. Using these data we estimate values for mtDNA Ne of 88.3 for oogenesis, and 80.3 for embryogenesis. Our results confirm the presence of a mitochondrial bottleneck in fish, and show that segregation of mtDNA variation is effectively complete by the end of oogenesis. Considering the extensive differences in reproductive physiology between fish and mammals, our results suggest the mechanism underlying the mtDNA bottleneck is conserved in these distant vertebrates both in terms of it magnitude and timing. This finding may lead to improvements in our understanding of mitochondrial disorders and population interpretations using mtDNA data
Novel Distances for Dollo Data
We investigate distances on binary (presence/absence) data in the context of
a Dollo process, where a trait can only arise once on a phylogenetic tree but
may be lost many times. We introduce a novel distance, the Additive Dollo
Distance (ADD), which is consistent for data generated under a Dollo model, and
show that it has some useful theoretical properties including an intriguing
link to the LogDet distance. Simulations of Dollo data are used to compare a
number of binary distances including ADD, LogDet, Nei Li and some simple, but
to our knowledge previously unstudied, variations on common binary distances.
The simulations suggest that ADD outperforms other distances on Dollo data.
Interestingly, we found that the LogDet distance performs poorly in the context
of a Dollo process, which may have implications for its use in connection with
conditioned genome reconstruction. We apply the ADD to two Diversity Arrays
Technology (DArT) datasets, one that broadly covers Eucalyptus species and one
that focuses on the Eucalyptus series Adnataria. We also reanalyse gene family
presence/absence data on bacteria from the COG database and compare the results
to previous phylogenies estimated using the conditioned genome reconstruction
approach
Lie Markov models with purine/pyrimidine symmetry
Continuous-time Markov chains are a standard tool in phylogenetic inference.
If homogeneity is assumed, the chain is formulated by specifying
time-independent rates of substitutions between states in the chain. In
applications, there are usually extra constraints on the rates, depending on
the situation. If a model is formulated in this way, it is possible to
generalise it and allow for an inhomogeneous process, with time-dependent rates
satisfying the same constraints. It is then useful to require that there exists
a homogeneous average of this inhomogeneous process within the same model. This
leads to the definition of "Lie Markov models", which are precisely the class
of models where such an average exists. These models form Lie algebras and
hence concepts from Lie group theory are central to their derivation. In this
paper, we concentrate on applications to phylogenetics and nucleotide
evolution, and derive the complete hierarchy of Lie Markov models that respect
the grouping of nucleotides into purines and pyrimidines -- that is, models
with purine/pyrimidine symmetry. We also discuss how to handle the subtleties
of applying Lie group methods, most naturally defined over the complex field,
to the stochastic case of a Markov process, where parameter values are
restricted to be real and positive. In particular, we explore the geometric
embedding of the cone of stochastic rate matrices within the ambient space of
the associated complex Lie algebra.
The whole list of Lie Markov models with purine/pyrimidine symmetry is
available at http://www.pagines.ma1.upc.edu/~jfernandez/LMNR.pdf.Comment: 32 page
Modelling mitochondrial site polymorphisms to infer the number of segregating units and mutation rate
We present a mathematical model of mitochondrial inheritance evolving under neutral evolution to interpret the heteroplasmies observed at some sites. A comparison of the levels of heteroplasmies transmitted from mother to her offspring allows us to estimate the number Nx of inherited mitochondrial genomes (segregating units). The model demonstrates the necessity of accounting for both the multiplicity of an unknown number Nx, and the threshold θ, below which heteroplasmy cannot be detected reliably, in order to estimate the mitochondrial mutation rate μm in the maternal line of descent. Our model is applicable to pedigree studies of any eukaryotic species where site heteroplasmies are observed in regions of the mitochondria, provided neutrality can be assumed. The model is illustrated with an analysis of site heteroplasmies in the first hypervariable region of mitochondrial sequence data sampled from Adélie penguin families, providing an estimate Nx and μm. This estimate of μm was found to be consistent with earlier estimates from ancient DNA analysis
RNase MRP and the RNA processing cascade in the eukaryotic ancestor
BACKGROUND: Within eukaryotes there is a complex cascade of RNA-based macromolecules that process other RNA molecules, especially mRNA, tRNA and rRNA. An example is RNase MRP processing ribosomal RNA (rRNA) in ribosome biogenesis. One hypothesis is that this complexity was present early in eukaryotic evolution; an alternative is that an initial simpler network later gained complexity by gene duplication in lineages that led to animals, fungi and plants. Recently there has been a rapid increase in support for the complexity-early theory because the vast majority of these RNA-processing reactions are found throughout eukaryotes, and thus were likely to be present in the last common ancestor of living eukaryotes, herein called the Eukaryotic Ancestor. RESULTS: We present an overview of the RNA processing cascade in the Eukaryotic Ancestor and investigate in particular, RNase MRP which was previously thought to have evolved later in eukaryotes due to its apparent limited distribution in fungi and animals and plants. Recent publications, as well as our own genomic searches, find previously unknown RNase MRP RNAs, indicating that RNase MRP has a wide distribution in eukaryotes. Combining secondary structure and promoter region analysis of RNAs for RNase MRP, along with analysis of the target substrate (rRNA), allows us to discuss this distribution in the light of eukaryotic evolution. CONCLUSION: We conclude that RNase MRP can now be placed in the RNA-processing cascade of the Eukaryotic Ancestor, highlighting the complexity of RNA-processing in early eukaryotes. Promoter analyses of MRP-RNA suggest that regulation of the critical processes of rRNA cleavage can vary, showing that even these key cellular processes (for which we expect high conservation) show some species-specific variability. We present our consensus MRP-RNA secondary structure as a useful model for further searches
Using decision analysis to support proactive management of emerging infectious wildlife diseases
Despite calls for improved responses to emerging infectious diseases in wildlife, management is seldom considered until a disease has been detected in affected populations. Reactive approaches may limit the potential for control and increase total response costs. An alternative, proactive management framework can identify immediate actions that reduce future impacts even before a disease is detected, and plan subsequent actions that are conditional on disease emergence. We identify four main obstacles to developing proactive management strategies for the newly discovered salamander pathogen Batrachochytrium salamandrivorans (Bsal). Given that uncertainty is a hallmark of wildlife disease management and that associated decisions are often complicated by multiple competing objectives, we advocate using decision analysis to create and evaluate trade-offs between proactive (pre-emergence) and reactive (post-emergence) management options. Policy makers and natural resource agency personnel can apply principles from decision analysis to improve strategies for countering emerging infectious diseases
RNase MRP and the RNA processing cascade in the eukaryotic ancestor
Background
Within eukaryotes there is a complex cascade of RNA-based
macromolecules that process other RNA molecules, especially mRNA, tRNA and
rRNA. A simple example is the RNase MRP processing of ribosomal RNA (rRNA) in
ribosome biogenesis. One hypothesis is that this complexity was present early in
eukaryotic evolution; an alternative is that an initial simplified network later gained
complexity by gene duplication in lineages that led to animals, fungi and plants.
Recently there has been a rapid increase in support for the complexity-early theory because the vast majority of these RNA-processing reactions are found throughout eukaryotes, and thus were likely to be present in the last common ancestor of living eukaryotes, named here as the Eukaryotic Ancestor.
Results
We present an overview of the RNA processing cascade in the Eukaryotic
Ancestor and investigate in particular, RNase MRP which was previously thought to have evolved later in eukaryotes due to its apparent limited distribution in fungi and animals and plants. Recent publications, as well as our own genomic searches have uncovered previously unknown RNase MRP RNAs, indicating that RNase MRP has a wide distribution in eukaryotes. Combining secondary structure and promoter region analysis of new and previously discovered RNase MRP RNAs along with analysis of the primary substrate (rRNA), allows us to discuss this distribution in the light of eukaryotic evolution.
Conclusions
We conclude that RNase MRP can now be placed in the RNA-processing
cascade present in the Eukaryotic Ancestor. This highlights the complexity of RNAprocessing in early eukaryotes
Multiple novel prostate cancer susceptibility signals identified by fine-mapping of known risk loci among Europeans
Genome-wide association studies (GWAS) have identified numerous common prostate cancer (PrCa) susceptibility loci. We have
fine-mapped 64 GWAS regions known at the conclusion of the iCOGS study using large-scale genotyping and imputation in
25 723 PrCa cases and 26 274 controls of European ancestry. We detected evidence for multiple independent signals at 16
regions, 12 of which contained additional newly identified significant associations. A single signal comprising a spectrum of
correlated variation was observed at 39 regions; 35 of which are now described by a novel more significantly associated lead SNP,
while the originally reported variant remained as the lead SNP only in 4 regions. We also confirmed two association signals in
Europeans that had been previously reported only in East-Asian GWAS. Based on statistical evidence and linkage disequilibrium
(LD) structure, we have curated and narrowed down the list of the most likely candidate causal variants for each region.
Functional annotation using data from ENCODE filtered for PrCa cell lines and eQTL analysis demonstrated significant
enrichment for overlap with bio-features within this set. By incorporating the novel risk variants identified here alongside the
refined data for existing association signals, we estimate that these loci now explain ∼38.9% of the familial relative risk of PrCa,
an 8.9% improvement over the previously reported GWAS tag SNPs. This suggests that a significant fraction of the heritability of
PrCa may have been hidden during the discovery phase of GWAS, in particular due to the presence of multiple independent
signals within the same regio
- …
