18 research outputs found

    Linked read technology for assembling large complex and polyploid genomes

    Get PDF
    Background: Short read DNA sequencing technologies have revolutionized genome assembly by providing high accuracy and throughput data at low cost. But it remains challenging to assemble short read data, particularly for large, complex and polyploid genomes. The linked read strategy has the potential to enhance the value of short reads for genome assembly because all reads originating from a single long molecule of DNA share a common barcode. However, the majority of studies to date that have employed linked reads were focused on human haplotype phasing and genome assembly. Results: Here we describe a de novo maize B73 genome assembly generated via linked read technology which contains ~ 172,000 scaffolds with an N50 of 89 kb that cover 50% of the genome. Based on comparisons to the B73 reference genome, 91% of linked read contigs are accurately assembled. Because it was possible to identify errors with \u3e 76% accuracy using machine learning, it may be possible to identify and potentially correct systematic errors. Complex polyploids represent one of the last grand challenges in genome assembly. Linked read technology was able to successfully resolve the two subgenomes of the recent allopolyploid, proso millet (Panicum miliaceum). Our assembly covers ~ 83% of the 1 Gb genome and consists of 30,819 scaffolds with an N50 of 912 kb. Conclusions: Our analysis provides a framework for future de novo genome assemblies using linked reads, and we suggest computational strategies that if implemented have the potential to further improve linked read assemblies, particularly for repetitive genomes

    Recent advances in understanding the roles of whole genome duplications in evolution

    Get PDF
    Ancient whole-genome duplications (WGDs)—paleopolyploidy events—are key to solving Darwin’s ‘abominable mystery’ of how flowering plants evolved and radiated into a rich variety of species. The vertebrates also emerged from their invertebrate ancestors via two WGDs, and genomes of diverse gymnosperm trees, unicellular eukaryotes, invertebrates, fishes, amphibians and even a rodent carry evidence of lineage-specific WGDs. Modern polyploidy is common in eukaryotes, and it can be induced, enabling mechanisms and short-term cost-benefit assessments of polyploidy to be studied experimentally. However, the ancient WGDs can be reconstructed only by comparative genomics: these studies are difficult because the DNA duplicates have been through tens or hundreds of millions of years of gene losses, mutations, and chromosomal rearrangements that culminate in resolution of the polyploid genomes back into diploid ones (rediploidisation). Intriguing asymmetries in patterns of post-WGD gene loss and retention between duplicated sets of chromosomes have been discovered recently, and elaborations of signal transduction systems are lasting legacies from several WGDs. The data imply that simpler signalling pathways in the pre-WGD ancestors were converted via WGDs into multi-stranded parallelised networks. Genetic and biochemical studies in plants, yeasts and vertebrates suggest a paradigm in which different combinations of sister paralogues in the post-WGD regulatory networks are co-regulated under different conditions. In principle, such networks can respond to a wide array of environmental, sensory and hormonal stimuli and integrate them to generate phenotypic variety in cell types and behaviours. Patterns are also being discerned in how the post-WGD signalling networks are reconfigured in human cancers and neurological conditions. It is fascinating to unpick how ancient genomic events impact on complexity, variety and disease in modern life

    The evolutionary significance of polyploidy

    Get PDF
    Polyploidy, or the duplication of entire genomes, has been observed in prokaryotic and eukaryotic organisms, and in somatic and germ cells. The consequences of polyploidization are complex and variable, and they differ greatly between systems (clonal or non-clonal) and species, but the process has often been considered to be an evolutionary 'dead end'. Here, we review the accumulating evidence that correlates polyploidization with environmental change or stress, and that has led to an increased recognition of its short-term adaptive potential. In addition, we discuss how, once polyploidy has been established, the unique retention profile of duplicated genes following whole-genome duplication might explain key longer-term evolutionary transitions and a general increase in biological complexity

    Phylogenetic patterns and phenotypic profiles of the species of plants and mammals farmed for food

    Get PDF
    The origins of agriculture were key events in human history, during which people came to depend for their food on small numbers of animal and plant species. However, the biological traits determining which species were domesticated for food provision, and which were not, are unclear. Here, we investigate the phylogenetic distribution of livestock and crops, and compare their phenotypic traits with those of wild species. Our results indicate that phylogenetic clustering is modest for crop species but more intense for livestock. Domesticated species explore a reduced portion of the phenotypic space occupied by their wild counterparts and have particular traits in common. For example, herbaceous crops are globally characterized by traits including high leaf nitrogen concentration and tall canopies, which make them fast-growing species and proficient competitors. Livestock species are relatively large mammals with low basal metabolic rates, which indicate moderate to slow life histories. Our study therefore reveals ecological differences in domestication potential between plants and mammals. Domesticated plants belong to clades with traits that are advantageous in intensively managed high-resource habitats, whereas domesticated mammals are from clades adapted to moderately productive environments. Combining comparative phylogenetic methods with ecologically relevant traits has proven useful to unravel the causes and consequences of domestication

    Genomic islands of differentiation in a rapid avian radiation have been driven by recent selective sweeps

    Get PDF
    Numerous studies of emerging species have identified genomic "islands" of elevated differentiation against a background of relative homogeneity. The causes of these islands remain unclear, however, with some signs pointing toward "speciation genes" that locally restrict gene flow and others suggesting selective sweeps that have occurred within nascent species after speciation. Here, we examine this question through the lens of genome sequence data for five species of southern capuchino seedeaters, finch-like birds from South America that have undergone a species radiation during the last ∼50,000 generations. By applying newly developed statistical methods for ancestral recombination graph inference and machine-learning methods for the prediction of selective sweeps, we show that previously identified islands of differentiation in these birds appear to be generally associated with relatively recent, species-specific selective sweeps, most of which are predicted to be soft sweeps acting on standing genetic variation. Many of these sweeps coincide with genes associated with melanin-based variation in plumage, suggesting a prominent role for sexual selection. At the same time, a few loci also exhibit indications of possible selection against gene flow. These observations shed light on the complex manner in which natural selection shapes genome sequences during speciation

    Phylogenomics of the olive tree (Olea europaea) reveals the relative contribution of ancient allo- and autopolyploidization events

    Get PDF
    Background: Polyploidization is one of the major evolutionary processes that shape eukaryotic genomes, being particularly common in plants. Polyploids can arise through direct genome doubling within a species (autopolyploidization) or through the merging of genomes from distinct species after hybridization (allopolyploidization). The relative contribution of both mechanisms in plant evolution is debated. Here we used phylogenomics to dissect the tempo and mode of duplications in the genome of the olive tree (Olea europaea), one of the first domesticated Mediterranean fruit trees. Results: Our results depict a complex scenario involving at least three past polyploidization events, of which two—at the bases of the family Oleaceae and the tribe Oleeae, respectively—are likely to be the result of ancient allopolyploidization. A more recent polyploidization involves specifically the olive tree and relatives. Conclusion: Our results show the power of phylogenomics to distinguish between allo- and auto polyploidization events and clarify the contributions of duplications in the evolutionary history of the olive tree.TG’s group acknowledges support from the Spanish Ministry of Economy and Competitiveness through grants “Centro de Excelencia Severo Ochoa 2013-2017” SEV-2012-0208 and BFU2015-67107, cofounded by the European Regional Development Fund; from the Catalan Research Agency (AGAUR) SGR857, from the CERCA programme/ Generalitat de Catalunya; and from the European Union’s Horizon 2020 research and innovation program under Marie Sklodowska-Curie grant agreement H2020-MSCA-ITN-2014-642095 and European Research Council grant agreement ERC-2016-CoG-724173. TG and PV acknowledge support from Banco Santander for the olive genome sequencing project. IJ was supported in part by a grant from the Peruvian Ministry of Education, “Beca Presidente de la República” (2013-III)
    corecore