793 research outputs found
Molecular evolution of the sheep prion protein gene
Transmissible spongiform encephalopathies (TSEs) are infectious, fatal neurodegenerative diseases characterized by aggregates of modified forms of the prion protein (PrP) in the central nervous system. Well known examples include variant Creutzfeldt-Jakob Disease (vCJD) in humans, BSE in cattle, chronic wasting disease in deer and scrapie in sheep and goats. In humans, sheep and deer, disease susceptibility is determined by host genotype at the prion protein gene (PRNP). Here I examine the molecular evolution of PRNP in ruminants and show that variation in sheep appears to have been maintained by balancing selection, a profoundly different process from that seen in other ruminants. Scrapie eradication programs such as those recently implemented in the UK, USA and elsewhere are based on the assumption that PRNP is under positive selection in response to scrapie. If, as these data suggest, that assumption is wrong, eradication programs will disrupt this balancing selection, and may have a negative impact on the fitness or scrapie resistance of national flocks
Gene-history correlation and population structure
Correlation of gene histories in the human genome determines the patterns of
genetic variation (haplotype structure) and is crucial to understanding genetic
factors in common diseases. We derive closed analytical expressions for the
correlation of gene histories in established demographic models for genetic
evolution and show how to extend the analysis to more realistic (but more
complicated) models of demographic structure. We identify two contributions to
the correlation of gene histories in divergent populations: linkage
disequilibrium, and differences in the demographic history of individuals in
the sample. These two factors contribute to correlations at different length
scales: the former at small, and the latter at large scales. We show that
recent mixing events in divergent populations limit the range of correlations
and compare our findings to empirical results on the correlation of gene
histories in the human genome.Comment: Revised and extended version: 26 pages, 5 figures, 1 tabl
Genomic heterogeneity of historical gene flow between two species of newts inferred from transcriptome data
The role of gene flow in species formation is a major unresolved issue in speciation biology. Progress in this area requires information on the long‐term patterns of gene flow between diverging species. Here, we used thousands of single‐nucleotide polymorphisms derived from transcriptome resequencing and a method modeling the joint frequency spectrum of these polymorphisms to reconstruct patterns of historical gene flow between two Lissotriton newts: L. vulgaris (Lv) and L. montandoni (Lm). We tested several models of divergence including complete isolation and various scenarios of historical gene flow. The model of secondary contact received the highest support. According to this model, the species split from their common ancestor ca. 5.5 million years (MY) ago, evolved in isolation for ca. 2 MY, and have been exchanging genes for the last 3.5 MY Demographic changes have been inferred in both species, with the current effective population size of ca. 0.7 million in Lv and 0.2 million in Lm. The postdivergence gene flow resulted in two‐directional introgression which affected the genomes of both species, but was more pronounced from Lv to Lm. Interestingly, we found evidence for genomic heterogeneity of interspecific gene flow. This study demonstrates the complexity of long‐term gene flow between distinct but incompletely reproductively isolated taxa which divergence was initiated millions of years ago
The variant call format and VCFtools
Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API
A Genealogical Interpretation of Principal Components Analysis
Principal components analysis, PCA, is a statistical method commonly used in population genetics to identify structure in the distribution of genetic variation across geographical location and ethnic background. However, while the method is often used to inform about historical demographic processes, little is known about the relationship between fundamental demographic parameters and the projection of samples onto the primary axes. Here I show that for SNP data the projection of samples onto the principal components can be obtained directly from considering the average coalescent times between pairs of haploid genomes. The result provides a framework for interpreting PCA projections in terms of underlying processes, including migration, geographical isolation, and admixture. I also demonstrate a link between PCA and Wright's fst and show that SNP ascertainment has a largely simple and predictable effect on the projection of samples. Using examples from human genetics, I discuss the application of these results to empirical data and the implications for inference
Blood ties: ABO is a trans-species polymorphism in primates
The ABO histo-blood group, the critical determinant of transfusion
incompatibility, was the first genetic polymorphism discovered in humans.
Remarkably, ABO antigens are also polymorphic in many other primates, with the
same two amino acid changes responsible for A and B specificity in all species
sequenced to date. Whether this recurrence of A and B antigens is the result of
an ancient polymorphism maintained across species or due to numerous, more
recent instances of convergent evolution has been debated for decades, with a
current consensus in support of convergent evolution. We show instead that
genetic variation data in humans and gibbons as well as in Old World Monkeys
are inconsistent with a model of convergent evolution and support the
hypothesis of an ancient, multi-allelic polymorphism of which some alleles are
shared by descent among species. These results demonstrate that the ABO
polymorphism is a trans-species polymorphism among distantly related species
and has remained under balancing selection for tens of millions of years, to
date, the only such example in Hominoids and Old World Monkeys outside of the
Major Histocompatibility Complex.Comment: 45 pages, 4 Figures, 4 Supplementary Figures, 5 Supplementary Table
Localizing Recent Adaptive Evolution in the Human Genome
Identifying genomic locations that have experienced selective sweeps is an important first step toward understanding the molecular basis of adaptive evolution. Using statistical methods that account for the confounding effects of population demography, recombination rate variation, and single-nucleotide polymorphism ascertainment, while also providing fine-scale estimates of the position of the selected site, we analyzed a genomic dataset of 1.2 million human single-nucleotide polymorphisms genotyped in African-American, European-American, and Chinese samples. We identify 101 regions of the human genome with very strong evidence (p < 10−5) of a recent selective sweep and where our estimate of the position of the selective sweep falls within 100 kb of a known gene. Within these regions, genes of biological interest include genes in pigmentation pathways, components of the dystrophin protein complex, clusters of olfactory receptors, genes involved in nervous system development and function, immune system genes, and heat shock genes. We also observe consistent evidence of selective sweeps in centromeric regions. In general, we find that recent adaptation is strikingly pervasive in the human genome, with as much as 10% of the genome affected by linkage to a selective sweep
Transplanting the leafy liverwort Herbertus hutchinsiae : A suitable conservation tool to maintain oceanic-montane liverwort-rich heath?
Thanks to the relevant landowners and managers for permission to carry out the experiments, Chris Preston for helping to obtain the liverwort distribution records and the distribution map, Gordon Rothero and Dave Horsfield for advice on choosing experimental sites and Alex Douglas for statistical advice. Juliane Geyer’s help with fieldwork was greatly appreciated. This study was made possible by a NERC PhD studentship and financial support from the Royal Botanic Garden Edinburgh and Scottish Natural Heritage.Peer reviewedPostprin
Genome-wide fine-scale recombination rate variation in Drosophila melanogaster
Estimating fine-scale recombination maps of Drosophila from population genomic data is a challenging problem, in particular because of the high background recombination rate. In this paper, a new computational method is developed to address this challenge. Through an extensive simulation study, it is demonstrated that the method allows more accurate inference, and exhibits greater robustness to the effects of natural selection and noise, compared to a well-used previous method developed for studying fine-scale recombination rate variation in the human genome. As an application, a genome-wide analysis of genetic variation data is performed for two Drosophila melanogaster populations, one from North America (Raleigh, USA) and the other from Africa (Gikongoro, Rwanda). It is shown that fine-scale recombination rate variation is widespread throughout the D. melanogaster genome, across all chromosomes and in both populations. At the fine-scale, a conservative, systematic search for evidence of recombination hotspots suggests the existence of a handful of putative hotspots each with at least a tenfold increase in intensity over the background rate. A wavelet analysis is carried out to compare the estimated recombination maps in the two populations and to quantify the extent to which recombination rates are conserved. In general, similarity is observed at very broad scales, but substantial differences are seen at fine scales. The average recombination rate of the X chromosome appears to be higher than that of the autosomes in both populations, and this pattern is much more pronounced in the African population than the North American population. The correlation between various genomic features—including recombination rates, diversity, divergence, GC content, gene content, and sequence quality—is examined using the wavelet analysis, and it is shown that the most notable difference between D. melanogaster and humans is in the correlation between recombination and diversity
Genomic signatures of population decline in the malaria mosquito Anopheles gambiae
Population genomic features such as nucleotide diversity and linkage disequilibrium are expected to be strongly shaped by changes in population size, and might therefore be useful for monitoring the success of a control campaign. In the Kilifi district of Kenya, there has been a marked decline in the abundance of the malaria vector Anopheles gambiae subsequent to the rollout of insecticide-treated bed nets. To investigate whether this decline left a detectable population genomic signature, simulations were performed to compare the effect of population crashes on nucleotide diversity, Tajima's D, and linkage disequilibrium (as measured by the population recombination parameter ρ). Linkage disequilibrium and ρ were estimated for An. gambiae from Kilifi, and compared them to values for Anopheles arabiensis and Anopheles merus at the same location, and for An. gambiae in a location 200 km from Kilifi. In the first simulations ρ changed more rapidly after a population crash than the other statistics, and therefore is a more sensitive indicator of recent population decline. In the empirical data, linkage disequilibrium extends 100-1000 times further, and ρ is 100-1000 times smaller, for the Kilifi population of An. gambiae than for any of the other populations. There were also significant runs of homozygosity in many of the individual An. gambiae mosquitoes from Kilifi. These results support the hypothesis that the recent decline in An. gambiae was driven by the rollout of bed nets. Measuring population genomic parameters in a small sample of individuals before, during and after vector or pest control may be a valuable method of tracking the effectiveness of interventions
- …