12 research outputs found

    A universal probe set for targeted sequencing of 353 nuclear genes from any flowering plant designed using k-medoids clustering

    Get PDF
    Sequencing of target-enriched libraries is an efficient and cost-effective method for obtaining DNA sequence data from hundreds of nuclear loci for phylogeny reconstruction. Much of the cost of developing targeted sequencing approaches is associated with the generation of preliminary data needed for the identification of orthologous loci for probe design. In plants, identifying orthologous loci has proven difficult due to a large number of whole-genome duplication events, especially in the angiosperms (flowering plants).We used multiple sequence alignments from over 600 angiosperms for 353 putatively single-copy protein-coding genes identified by the One Thousand Plant Transcriptomes Initiative to design a set of targeted sequencing probes for phylogenetic studies of any angiosperm group. To maximize the phylogenetic potential of the probes, while minimizing the cost of production, we introduce a k-medoids clustering approach to identify the minimum number of sequences necessary to represent each coding sequence in the final probe set. Using this method, 5–15 representative sequences were selected per orthologous locus, representing the sequence diversity of angiosperms more efficiently than if probes were designed using available sequenced genomes alone. To test our approximately 80,000 probes, we hybridized libraries from 42 species spanning all higher-order groups of angiosperms, with a focus on taxa not present in the sequence alignments used to design the probes. Out of a possible 353 coding sequences, we recovered an average of 283 per species and at least 100 in all species. Differences among taxa in sequence recovery could not be explained by relatedness to the representative taxa selected for probe design, suggesting that there is no phylogenetic bias in the probe set. Our probe set, which targeted 260 kbp of coding sequence, achieved a median recovery of 137 kbp per taxon in coding regions, a maximum recovery of 250 kbp, and an additional median of 212 kbp per taxon in flanking non-coding regions across all species. These results suggest that the Angiosperms353 probe set described here is effective for any group of flowering plants and would be useful for phylogenetic studies from the species level to higher-order groups, including the entire angiosperm clade itself

    A roadmap for global synthesis of the plant tree of life

    Get PDF
    Providing science and society with an integrated, up-to-date, high quality, open, reproducible and sustainable plant tree of life would be a huge service that is now coming within reach. However, synthesizing the growing body of DNA sequence data in the public domain and disseminating the trees to a diverse audience are often not straightforward due to numerous informatics barriers. While big synthetic plant phylogenies are being built, they remain static and become quickly outdated as new data are published and tree-building methods improve. Moreover, the body of existing phylogenetic evidence is hard to navigate and access for non-experts. We propose that our community of botanists, tree builders, and informaticians should converge on a modular framework for data integration and phylogenetic analysis, allowing easy collaboration, updating, data sourcing and flexible analyses. With support from major institutions, this pipeline should be re-run at regular intervals, storing trees and their metadata long-term. Providing the trees to a diverse global audience through user-friendly front ends and application development interfaces should also be a priority. Interactive interfaces could be used to solicit user feedback and thus improve data quality and to coordinate the generation of new data. We conclude by outlining a number of steps that we suggest the scientific community should take to achieve global phylogenetic synthesis

    Latin Americans show wide-spread Converso ancestry and imprint of local Native ancestry on physical appearance

    Get PDF
    Historical records and genetic analyses indicate that Latin Americans trace their ancestry mainly to the intermixing (admixture) of Native Americans, Europeans and Sub-Saharan Africans. Using novel haplotype-based methods, here we infer sub-continental ancestry in over 6,500 Latin Americans and evaluate the impact of regional ancestry variation on physical appearance. We find that Native American ancestry components in Latin Americans correspond geographically to the present-day genetic structure of Native groups, and that sources of non-Native ancestry, and admixture timings, match documented migratory flows. We also detect South/East Mediterranean ancestry across Latin America, probably stemming mostly from the clandestine colonial migration of Christian converts of non-European origin (Conversos). Furthermore, we find that ancestry related to highland (Central Andean) versus lowland (Mapuche) Natives is associated with variation in facial features, particularly nose morphology, and detect significant differences in allele frequencies between these groups at loci previously associated with nose morphology in this sample

    Persistent Place-Making in Prehistory: the Creation, Maintenance, and Transformation of an Epipalaeolithic Landscape

    Get PDF
    Most archaeological projects today integrate, at least to some degree, how past people engaged with their surroundings, including both how they strategized resource use, organized technological production, or scheduled movements within a physical environment, as well as how they constructed cosmologies around or created symbolic connections to places in the landscape. However, there are a multitude of ways in which archaeologists approach the creation, maintenance, and transformation of human-landscape interrelationships. This paper explores some of these approaches for reconstructing the Epipalaeolithic (ca. 23,000–11,500 years BP) landscape of Southwest Asia, using macro- and microscale geoarchaeological approaches to examine how everyday practices leave traces of human-landscape interactions in northern and eastern Jordan. The case studies presented here demonstrate that these Epipalaeolithic groups engaged in complex and far-reaching social landscapes. Examination of the Early and Middle Epipalaeolithic (EP) highlights that the notion of “Neolithization” is somewhat misleading as many of the features we use to define this transition were already well-established patterns of behavior by the Neolithic. Instead, these features and practices were enacted within a hunter-gatherer world and worldview

    Ancient DNA of Phoenician remains indicates discontinuity in the settlement history of Ibiza

    No full text
    Ibiza was permanently settled around the 7th century BCE by founders arriving from west Phoenicia. The founding population grew significantly and reached its height during the 4th century BCE. We obtained nine complete mitochondrial genomes from skeletal remains from two Punic necropoli in Ibiza and a Bronze Age site from Formentara. We also obtained low coverage (0.47X average depth) of the genome of one individual, directly dated to 361-178 cal BCE, from the Cas Molí site on Ibiza. We analysed and compared ancient DNA results with 18 new mitochondrial genomes from modern Ibizans to determine the ancestry of the founders of Ibiza. The mitochondrial results indicate a predominantly recent European maternal ancestry for the current Ibizan population while the whole genome data suggest a significant Eastern Mediterranean component. Our mitochondrial results suggest a genetic discontinuity between the early Phoenician settlers and the island's modern inhabitants. Our data, while limited, suggest that the Eastern or North African influence in the Punic population of Ibiza was primarily male dominated

    Persistent Place-Making in Prehistory: the Creation, Maintenance, and Transformation of an Epipalaeolithic Landscape

    No full text
    corecore