77 research outputs found

    Detecting epistasis via Markov bases

    Full text link
    Rapid research progress in genotyping techniques have allowed large genome-wide association studies. Existing methods often focus on determining associations between single loci and a specific phenotype. However, a particular phenotype is usually the result of complex relationships between multiple loci and the environment. In this paper, we describe a two-stage method for detecting epistasis by combining the traditionally used single-locus search with a search for multiway interactions. Our method is based on an extended version of Fisher's exact test. To perform this test, a Markov chain is constructed on the space of multidimensional contingency tables using the elements of a Markov basis as moves. We test our method on simulated data and compare it to a two-stage logistic regression method and to a fully Bayesian method, showing that we are able to detect the interacting loci when other methods fail to do so. Finally, we apply our method to a genome-wide data set consisting of 685 dogs and identify epistasis associated with canine hair length for four pairs of SNPs

    A likelihood method for estimating present-day human contamination in ancient DNA samples using low-depth haploid chromosome data

    Get PDF
    Motivation The presence of present-day human contaminating DNA fragments is one of the challenges defining ancient DNA (aDNA) research. This is especially relevant to the ancient human DNA field where it is difficult to distinguish endogenous molecules from human contaminants due to their genetic similarity. Recently, with the advent of high-throughput sequencing and new aDNA protocols, hundreds of ancient human genomes have become available. Contamination in those genomes has been measured with computational methods often developed specifically for these empirical studies. Consequently, some of these methods have not been implemented and tested while few are aimed at low-depth data, a common feature in aDNA datasets. Results We develop a new X-chromosome-based maximum likelihood method for estimating present-day human contamination in low-depth sequencing data. We implement our method for general use, assess its performance under conditions typical of ancient human DNA research, and compare it to previous nuclear data-based methods through extensive simulations. For low-depth data, we show that existing methods can produce unusable estimates or substantially underestimate contamination. In contrast, our method provides accurate estimates for a depth of coverage as low as 0.5× on the X-chromosome when contamination is below 25%. Moreover, our method still yields meaningful estimates in very challenging situations, i.e., when the contaminant and the target come from closely related populations or with increased error rates. With a running time below five minutes, our method is applicable to large scale aDNA genomic studies. Availability and implementation The method is implemented in C++ and R and is freely available in https://github.com/sapfo/contaminationX. Contact morenomayar{at}gmail.com, annasapfo.malaspinas{at}unil.ch

    Human bony labyrinth is an indicator of population history and dispersal from Africa.

    Get PDF
    The dispersal of modern humans from Africa is now well documented with genetic data that track population history, as well as gene flow between populations. Phenetic skeletal data, such as cranial and pelvic morphologies, also exhibit a dispersal-from-Africa signal, which, however, tends to be blurred by the effects of local adaptation and in vivo phenotypic plasticity, and that is often deteriorated by postmortem damage to skeletal remains. These complexities raise the question of which skeletal structures most effectively track neutral population history. The cavity system of the inner ear (the so-called bony labyrinth) is a good candidate structure for such analyses. It is already fully formed by birth, which minimizes postnatal phenotypic plasticity, and it is generally well preserved in archaeological samples. Here we use morphometric data of the bony labyrinth to show that it is a surprisingly good marker of the global dispersal of modern humans from Africa. Labyrinthine morphology tracks genetic distances and geography in accordance with an isolation-by-distance model with dispersal from Africa. Our data further indicate that the neutral-like pattern of variation is compatible with stabilizing selection on labyrinth morphology. Given the increasingly important role of the petrous bone for ancient DNA recovery from archaeological specimens, we encourage researchers to acquire 3D morphological data of the inner ear structures before any invasive sampling. Such data will constitute an important archive of phenotypic variation in present and past populations, and will permit individual-based genotype-phenotype comparisons

    <i>bammds</i>:a tool for assessing the ancestry of low-depth whole-genome data using multidimensional scaling (MDS)

    Get PDF
    Summary: We present bammds, a practical tool that allows visualization of samples sequenced by second-generation sequencing when compared with a reference panel of individuals (usually genotypes) using a multidimensional scaling algorithm. Our tool is aimed at determining the ancestry of unknown samples—typical of ancient DNA data—particularly when only low amounts of data are available for those samples. Availability and implementation: The software package is available under GNU General Public License v3 and is freely available together with test datasets https://savannah.nongnu.org/projects/bammds/ . It is using R ( http://www.r-project.org/ ), parallel ( http://www.gnu.org/software/parallel/ ), samtools ( https://github.com/samtools/samtools ). Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.Full Tex

    Genomic evidence for the Pleistocene and recent population history of Native Americans

    Get PDF
    This is the author’s version of the work. It is posted here by permission of the AAAS for personal use, not for redistribution. The definitive version was published in Science on 2015 August 21; 349(6250), DOI: 10.1126/science.aab3884.How and when the Americas were populated remains contentious. Using ancient and modern genome-wide data, we find that the ancestors of all present-day Native Americans, including Athabascans and Amerindians, entered the Americas as a single migration wave from Siberia no earlier than 23 thousand years ago (KYA), and after no more than 8,000-year isolation period in Beringia. Following their arrival to the Americas, ancestral Native Americans diversified into two basal genetic branches around 13 KYA, one that is now dispersed across North and South America and the other is restricted to North America. Subsequent gene flow resulted in some Native Americans sharing ancestry with present-day East Asians (including Siberians) and, more distantly, Australo-Melanesians. Putative ‘Paleoamerican’ relict populations, including the historical Mexican Pericúes and South American Fuego-Patagonians, are not directly related to modern Australo-Melanesians as suggested by the Paleoamerican Model

    Mitochondrial Genomes Reveal an Explosive Radiation of Extinct and Extant Bears near the Miocene-Pliocene Boundary

    Get PDF
    Background: Despite being one of the most studied families within the Carnivora, the phylogenetic relationships among the members of the bear family (Ursidae) have long remained unclear. Widely divergent topologies have been suggested based on various data sets and methods. Results: We present a fully resolved phylogeny for ursids based on ten complete mitochondrial genome sequences from all eight living and two recently extinct bear species, the European cave bear (Ursus spelaeus) and the American giant short-faced bear (Arctodus simus). The mitogenomic data yield a well-resolved topology for ursids, with the sloth bear at the basal position within the genus Ursus. The sun bear is the sister taxon to both the American and Asian black bears, and this clade is the sister clade of cave bear, brown bear and polar bear confirming a recent study on bear mitochondrial genomes. Conclusion: Sequences from extinct bears represent the third and fourth Pleistocene species for which complete mitochondrial genomes have been sequenced. Moreover, the cave bear specimen demonstrates that mitogenomic studies can be applied to Pleistocene fossils that have not been preserved in permafrost, and therefore have a broad application within ancient DNA research. Molecular dating of the mtDNA divergence times suggests a rapid radiation of bears in both the Old and New Worlds around 5 million years ago, at the Miocene-Pliocene boundary. This coincides with major global changes, such as the Messinian crisis and the first opening of the Bering Strait, and suggests a global influence of such events on species radiations

    Paleogenomics. Genomic structure in Europeans dating back at least 36,200 years.

    Get PDF
    The origin of contemporary Europeans remains contentious. We obtained a genome sequence from Kostenki 14 in European Russia dating from 38,700 to 36,200 years ago, one of the oldest fossils of anatomically modern humans from Europe. We find that Kostenki 14 shares a close ancestry with the 24,000-year-old Mal'ta boy from central Siberia, European Mesolithic hunter-gatherers, some contemporary western Siberians, and many Europeans, but not eastern Asians. Additionally, the Kostenki 14 genome shows evidence of shared ancestry with a population basal to all Eurasians that also relates to later European Neolithic farmers. We find that Kostenki 14 contains more Neandertal DNA that is contained in longer tracts than present Europeans. Our findings reveal the timing of divergence of western Eurasians and East Asians to be more than 36,200 years ago and that European genomic structure today dates back to the Upper Paleolithic and derives from a metapopulation that at times stretched from Europe to central Asia.GeoGenetics members were supported by the Lundbeck Foundation and the Danish National Research Foundation (DNRF94). ASM was supported by the Swiss National Science Foundation (PBSKP3_143529). Research on the archaeological background by PRN was supported by a MC Career Integration Grant (322261).This is the accepted manuscript. The final version is available from Science at http://www.sciencemag.org/content/346/6213/1113.short

    Mitochondrial genomes reveal an explosive radiation of extinct and extant bears near the Miocene-Pliocene boundary

    Get PDF
    Background. Despite being one of the most studied families within the Carnivora, the phylogenetic relationships among the members of the bear family (Ursidae) have long remained unclear. Widely divergent topologies have been suggested based on various data sets and methods. Results. We present a fully resolved phylogeny for ursids based on ten complete mitochondrial genome sequences from all eight living and two recently extinct bear species, the European cave bear (Ursus spelaeus) and the American giant short-faced bear (Arctodus simus). The mitogenomic data yield a well-resolved topology for ursids, with the sloth bear at the basal position within the genus Ursus. The sun bear is the sister taxon to both the American and Asian black bears, and this clade is the sister clade of cave bear, brown bear and polar bear confirming a recent study on bear mitochondrial genomes. Conclusion. Sequences from extinct bears represent the third and fourth Pleistocene species for which complete mitochondrial genomes have been sequenced. Moreover, the cave bear specimen demonstrates that mitogenomic studies can be applied to Pleistocene fossils that have not been preserved in permafrost, and therefore have a broad application within ancient DNA research. Molecular dating of the mtDNA divergence times suggests a rapid radiation of bears in both the Old and New Worlds around 5 million years ago, at the Miocene-Pliocene boundary. This coincides with major global changes, such as the Messinian crisis and the first opening of the Bering Strait, and suggests a global influence of such events on species radiations.Facultad de Ciencias Naturales y Muse

    Proboscidean Mitogenomics: Chronology and Mode of Elephant Evolution Using Mastodon as Outgroup

    Get PDF
    We have sequenced the complete mitochondrial genome of the extinct American mastodon (Mammut americanum) from an Alaskan fossil that is between 50,000 and 130,000 y old, extending the age range of genomic analyses by almost a complete glacial cycle. The sequence we obtained is substantially different from previously reported partial mastodon mitochondrial DNA sequences. By comparing those partial sequences to other proboscidean sequences, we conclude that we have obtained the first sequence of mastodon DNA ever reported. Using the sequence of the mastodon, which diverged 24–28 million years ago (mya) from the Elephantidae lineage, as an outgroup, we infer that the ancestors of African elephants diverged from the lineage leading to mammoths and Asian elephants approximately 7.6 mya and that mammoths and Asian elephants diverged approximately 6.7 mya. We also conclude that the nuclear genomes of the African savannah and forest elephants diverged approximately 4.0 mya, supporting the view that these two groups represent different species. Finally, we found the mitochondrial mutation rate of proboscideans to be roughly half of the rate in primates during at least the last 24 million years
    corecore