12 research outputs found

    The European Reference Genome Atlas: piloting a decentralised approach to equitable biodiversity genomics.

    Get PDF
    ABSTRACT: A global genome database of all of Earth’s species diversity could be a treasure trove of scientific discoveries. However, regardless of the major advances in genome sequencing technologies, only a tiny fraction of species have genomic information available. To contribute to a more complete planetary genomic database, scientists and institutions across the world have united under the Earth BioGenome Project (EBP), which plans to sequence and assemble high-quality reference genomes for all ∼1.5 million recognized eukaryotic species through a stepwise phased approach. As the initiative transitions into Phase II, where 150,000 species are to be sequenced in just four years, worldwide participation in the project will be fundamental to success. As the European node of the EBP, the European Reference Genome Atlas (ERGA) seeks to implement a new decentralised, accessible, equitable and inclusive model for producing high-quality reference genomes, which will inform EBP as it scales. To embark on this mission, ERGA launched a Pilot Project to establish a network across Europe to develop and test the first infrastructure of its kind for the coordinated and distributed reference genome production on 98 European eukaryotic species from sample providers across 33 European countries. Here we outline the process and challenges faced during the development of a pilot infrastructure for the production of reference genome resources, and explore the effectiveness of this approach in terms of high-quality reference genome production, considering also equity and inclusion. The outcomes and lessons learned during this pilot provide a solid foundation for ERGA while offering key learnings to other transnational and national genomic resource projects.info:eu-repo/semantics/publishedVersio

    A high-density linkage map enables a second-generation collared flycatcher genome assembly and reveals the patterns of avian recombination rate variation and chromosomal evolution

    No full text
    Detailed linkage and recombination rate maps are necessary to use the full potential of genome sequencing and population genomic analyses. We used a custom collared flycatcher 50K SNP array to develop a high-density linkage map with 37262 markers assigned to 34 linkage groups in 33 autosomes and the Z chromosome. The best-order map contained 4215 markers, with a total distance of 3132cM and a mean genetic distance between markers of 0.12cM. Facilitated by the array being designed to include markers from most scaffolds, we obtained a second-generation assembly of the flycatcher genome that approaches full chromosome sequences (N50 super-scaffold size 20.2Mb and with 1.042Gb (of 1.116Gb) anchored to and mostly ordered and oriented along chromosomes). We found that flycatcher and zebra finch chromosomes are entirely syntenic but that inversions at mean rates of 1.5-2.0 event (6.6-7.5Mb) per My have changed the organization within chromosomes, rates high enough for inversions to potentially have been involved with many speciation events during avian evolution. The mean recombination rate was 3.1cM/Mb and correlated closely with chromosome size, from 2cM/Mb for chromosomes &gt;100Mb to &gt;10cM/Mb for chromosomes &lt;10Mb. This size dependence seemed entirely due to an obligate recombination event per chromosome; if 50cM was subtracted from the genetic lengths of chromosomes, the rate per physical unit DNA was constant across chromosomes. Flycatcher recombination rate showed similar variation along chromosomes as chicken but lacked the large interior recombination deserts characteristic of zebra finch chromosomes.Kawakami and Smeds contributed equally.</p

    Data from: Estimation of linkage disequilibrium and interspecific gene flow in Ficedula flycatchers by a newly developed 50k SNP array

    No full text
    With the access to draft genome sequence assemblies and whole-genome re-sequencing data from population samples, molecular ecology studies will be able to take truly genome-wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million SNPs in population re-sequencing of this species and its sister species, pied flycatcher. Here we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. By using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. LD declined quickly to the background level at an average distance of 17 kb but the extent of LD varied markedly within the genome and was more than 10-fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later-generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 million years ago, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system

    Data from: Association mapping of morphological traits in wild and captive zebra finches: reliable within but not between populations

    No full text
    Identifying causal genetic variants underlying heritable phenotypic variation is a longstanding goal in evolutionary genetics. We previously identified several quantitative trait loci (QTL) for five morphological traits in a captive population of zebra finches (Taeniopygia guttata) by whole-genome linkage mapping. We here follow up on these studies with the aim to narrow down on the quantitative trait variants (QTN) in one wild and three captive populations. First, we performed an association study using 672 single nucleotide polymorphisms (SNPs) within candidate genes located in the previously identified QTL regions in a sample of 939 wild-caught zebra finches. Then, we validated the most promising SNP-phenotype associations (n = 25 SNPs) in 5,228 birds from four populations. Genotype-phenotype associations were generally weak in the wild population, where linkage disequilibrium (LD) spans only short genomic distances. In contrast, in captive populations, where LD blocks are large, apparent SNP-effects on morphological traits (i.e. associations) were highly repeatable with independent data from the same population. Most of those SNPs also showed significant associations with the same trait in other captive populations, but the direction and magnitude of these effects varied among populations. This suggests that the tested SNPs are not the causal QTN but rather physically linked to them, and that LD between SNPs and causal variants differs between populations due to founder effects. While the identification of QTN remains challenging in non-model organisms, we illustrate that it is indeed possible to confirm the location and magnitude of QTL in a population with stable linkage between markers and causal variants

    data_genotypes_Sequenom

    No full text
    Genotypic data for all individuals from the wild and multiple captive populations used for the validation study. A detailed description of the file is in the README
    corecore