21 research outputs found

    Genomic tools for durum wheat breeding: de novo assembly of Svevo transcriptome and SNP discovery in elite germplasm

    Get PDF
    BACKGROUND: The tetraploid durum wheat (Triticum turgidum L. ssp. durum Desf. Husnot) is an important crop which provides the raw material for pasta production and a valuable source of genetic diversity for breeding hexaploid wheat (Triticum aestivum L.). Future breeding efforts to enhance yield potential and climate resilience will increasingly rely on genomics-based approaches to identify and select beneficial alleles. A deeper characterisation of the molecular and functional diversity of the durum wheat transcriptome will be instrumental to more effectively harness its genetic diversity. RESULTS: We report on the de novo transcriptome assembly of durum wheat cultivar 'Svevo'. The transcriptome of four tissues/organs (shoots and roots at the seedling stage, reproductive organs and developing grains) was assembled de novo, yielding 180,108 contigs, with a N50 length of 1121\u2009bp and mean contig length of 883\u2009bp. Alignment against the transcriptome of nine plant species identified 43% of transcripts with homology to at least one reference transcriptome. The functional annotation was completed by means of a combination of complementary software. The presence of differential expression between the A- and B-homoeolog copies of the durum wheat tetraploid genome was ascertained by phase reconstruction of polymorphic sites based on the T. urartu transcripts and inferring homoeolog-specific sequences. We observed greater expression divergence between A and B homoeologs in grains rather than in leaves and roots. The transcriptomes of 13 durum wheat cultivars spanning the breeding period from 1969 to 2005 were analysed for SNP diversity, leading to 95,358 non-rare, hemi-SNPs shared among two or more cultivars and 33,747 locus-specific (diploid inheritance) SNPs. CONCLUSIONS: Our study updates and expands the de novo transcriptome reference assembly available for durum wheat. Out of 180,108 assembled transcripts, 13,636 were specific to the Svevo cultivar as compared to the only other reference transcriptome available for durum, thus contributing to the identification of the tetraploid wheat pan-transcriptome. Additionally, the analysis of 13 historically relevant hallmark varieties produced a SNP dataset that could successfully validate the genotyping in tetraploid wheat and provide a valuable resource for genomics-assisted breeding of both tetraploid and hexaploid wheats

    A single polyploidization event at the origin of the tetraploid genome of Coffea arabica is responsible for the extremely low genetic variation in wild and cultivated germplasm

    Get PDF
    The genome of the allotetraploid species Coffea arabica L. was sequenced to assemble independently the two component subgenomes (putatively deriving from C. canephora and C. eugenioides) and to perform a genome-wide analysis of the genetic diversity in cultivated coffee germplasm and in wild populations growing in the center of origin of the species. We assembled a total length of 1.536 Gbp, 444 Mb and 527 Mb of which were assigned to the canephora and eugenioides subgenomes, respectively, and predicted 46,562 gene models, 21,254 and 22,888 of which were assigned to the canephora and to the eugeniodes subgenome, respectively. Through a genome-wide SNP genotyping of 736 C. arabica accessions, we analyzed the genetic diversity in the species and its relationship with geographic distribution and historical records. We observed a weak population structure due to low-frequency derived alleles and highly negative values of Taijma's D, suggesting a recent and severe bottleneck, most likely resulting from a single event of polyploidization, not only for the cultivated germplasm but also for the entire species. This conclusion is strongly supported by forward simulations of mutation accumulation. However, PCA revealed a cline of genetic diversity reflecting a west-to-east geographical distribution from the center of origin in East Africa to the Arabian Peninsula. The extremely low levels of variation observed in the species, as a consequence of the polyploidization event, make the exploitation of diversity within the species for breeding purposes less interesting than in most crop species and stress the need for introgression of new variability from the diploid progenitors

    Genomic tools for durum wheat breeding: De novo assembly of Svevo transcriptome and SNP discovery in elite germplasm

    Get PDF
    Abstract Background The tetraploid durum wheat (Triticum turgidum L. ssp. durum Desf. Husnot) is an important crop which provides the raw material for pasta production and a valuable source of genetic diversity for breeding hexaploid wheat (Triticum aestivum L.). Future breeding efforts to enhance yield potential and climate resilience will increasingly rely on genomics-based approaches to identify and select beneficial alleles. A deeper characterisation of the molecular and functional diversity of the durum wheat transcriptome will be instrumental to more effectively harness its genetic diversity. Results We report on the de novo transcriptome assembly of durum wheat cultivar ‘Svevo’. The transcriptome of four tissues/organs (shoots and roots at the seedling stage, reproductive organs and developing grains) was assembled de novo, yielding 180,108 contigs, with a N50 length of 1121 bp and mean contig length of 883 bp. Alignment against the transcriptome of nine plant species identified 43% of transcripts with homology to at least one reference transcriptome. The functional annotation was completed by means of a combination of complementary software. The presence of differential expression between the A- and B-homoeolog copies of the durum wheat tetraploid genome was ascertained by phase reconstruction of polymorphic sites based on the T. urartu transcripts and inferring homoeolog-specific sequences. We observed greater expression divergence between A and B homoeologs in grains rather than in leaves and roots. The transcriptomes of 13 durum wheat cultivars spanning the breeding period from 1969 to 2005 were analysed for SNP diversity, leading to 95,358 non-rare, hemi-SNPs shared among two or more cultivars and 33,747 locus-specific (diploid inheritance) SNPs. Conclusions Our study updates and expands the de novo transcriptome reference assembly available for durum wheat. Out of 180,108 assembled transcripts, 13,636 were specific to the Svevo cultivar as compared to the only other reference transcriptome available for durum, thus contributing to the identification of the tetraploid wheat pan-transcriptome. Additionally, the analysis of 13 historically relevant hallmark varieties produced a SNP dataset that could successfully validate the genotyping in tetraploid wheat and provide a valuable resource for genomics-assisted breeding of both tetraploid and hexaploid wheats

    Physical Mapping of Bread Wheat Chromosome 5A: An Integrated Approach

    Get PDF
    The huge size, redundancy, and highly repetitive nature of the bread wheat [Triticum aestivum (L.)] genome, makes it among the most difficult species to be sequenced. To overcome these limitations, a strategy based on the separation of individual chromosomes or chromosome arms and the subsequent production of physical maps was established within the frame of the International Wheat Genome Sequence Consortium (IWGSC). A total of 95,812 bacterial artificial chromosome (BAC) clones of short-arm chromosome 5A (5AS) and long-arm chromosome 5A (5AL) arm-specific BAC libraries were fingerprinted and assembled into contigs by complementary analytical approaches based on the FingerPrinted Contig (FPC) and Linear Topological Contig (LTC) tools. Combined anchoring approaches based on polymerase chain reaction (PCR) marker screening, microarray, and sequence homology searches applied to several genomic tools (i. e., genetic maps, deletion bin map, neighbor maps, BAC end sequences (BESs), genome zipper, and chromosome survey sequences) allowed the development of a high-quality physical map with an anchored physical coverage of 75% for 5AS and 53% for 5AL with high portions (64 and 48%, respectively) of contigs ordered along the chromosome. In the genome of grasses, Brachypodium [Brachypodium distachyon (L.) Beauv.], rice (Oryza sativa L.), and sorghum [Sorghum bicolor (L.) Moench] homologs of genes on wheat chromosome 5A were separated into syntenic blocks on different chromosomes as a result of translocations and inversions during evolution. The physical map presented represents an essential resource for fine genetic mapping and map-based cloning of agronomically relevant traits and a reference for the 5A sequencing projects

    Caratterizzazione ed evoluzione di satelliti DNA e loro impiego per la individuazione di cromosomi di soia per mezzo della ibridazione in situ

    Full text link
    Dottorato di ricerca in produttivita' delle piante coltivate. 8. ciclo. A.a. 1992-95. Tutore A. Olivieri. Cotutore M. MorganteConsiglio Nazionale delle Ricerche - Biblioteca Centrale - P.le Aldo Moro, 7, Rome; Biblioteca Nazionale Centrale - P.za Cavalleggeri, 1, Florence / CNR - Consiglio Nazionale delle RichercheSIGLEITItal

    Extent of wild–to–crop interspecific introgression in grapevine (Vitis vinifera) as a consequence of resistance breeding and implications for the crop species definition

    Get PDF
    Over the past two centuries, introgression through repeated backcrossing has introduced disease resistance from wild grape species into the domesticated lineage Vitis vinifera subsp. sativa. Introgression lines are being cultivated over increasing vineyard surface areas, as their wines now rival in quality those obtained from preexisting varieties. There is, however, a lot of debate about whether and how wine laws defining commercial product categories, which are based on the classification of V. vinifera and interspecific hybrid grapes, should be revised to accommodate novel varieties that do not fit either category. Here, we developed a method of multilocus genotype analysis using short–read resequencing to identify haplotypic blocks of wild ancestry in introgression lines and quantify the physical length of chromosome segments free–of–introgression or with monoallelic and biallelic introgression. We used this genomic data to characterize species, hybrids and introgression lines and show that newly released resistant varieties contain 76.5–94.8% of V. vinifera DNA. We found that varietal wine ratings are not always commensurate with the percentage of V. vinifera ancestry and linkage drag of wild alleles around known resistance genes persists over at least 7.1–11.5 Mb, slowing down the recovery of the recurrent parental genome. This method also allowed us to identify the donor species of resistance haplotypes, define the ancestry of wild genetic background in introgression lines with complex pedigrees, validate the ancestry of the historic varieties Concord and Norton, and unravel sample curation errors in public databases

    Multilocus Patterns of Nucleotide Diversity, Linkage Disequilibrium and Demographic History of Norway Spruce [Picea abies (L.) Karst]

    Full text link
    DNA polymorphism at 22 loci was studied in an average of 47 Norway spruce [Picea abies (L.) Karst.] haplotypes sampled in seven populations representative of the natural range. The overall nucleotide variation was limited, being lower than that observed in most plant species so far studied. Linkage disequilibrium was also restricted and did not extend beyond a few hundred base pairs. All populations, with the exception of the Romanian population, could be divided into two main domains, a Baltico–Nordic and an Alpine one. Mean Tajima's D and Fay and Wu's H across loci were both negative, indicating the presence of an excess of both rare and high-frequency-derived variants compared to the expected frequency spectrum in a standard neutral model. Multilocus neutrality tests based on D and H led to the rejection of the standard neutral model and exponential growth in the whole population as well as in the two main domains. On the other hand, in all three cases the data are compatible with a severe bottleneck occurring some hundreds of thousands of years ago. Hence, demographic departures from equilibrium expectations and population structure will have to be accounted for when detecting selection at candidate genes and in association mapping studies, respectively

    Large-scale detection of rare variants via pooled multiplexed next-generation sequencing: towards next-generation ecotilling

    Full text link
    Common variants, such as those identified by genome-wide association scans, explain only a small proportion of trait variation. Growing evidence suggests that rare functional variants, which are usually missed by genome-wide association scans, play an important role in determining the phenotype. We used pooled multiplexed next-generation sequencing and a customized analysis workflow to detect mutations in five candidate genes for lignin biosynthesis in 768 pooled Populus nigra accessions. We identified a total of 36 non-synonymous single nucleotide polymorphisms, one of which causes a premature stop codon. The most common variant was estimated to be present in 672 of the 1536 tested chromosomes, while the rarest was estimated to occur only once in 1536 chromosomes. Comparison with individual Sanger sequencing in a selected sub-sample confirmed that variants are identified with high sensitivity and specificity, and that the variant frequency was estimated accurately. This proposed method for identification of rare polymorphisms allows accurate detection of variation in many individuals, and is cost-effective compared to individual sequencing
    corecore