12 research outputs found
Population Genomic Analysis Reveals Differential Evolutionary Histories and Patterns of Diversity across Subgenomes and Subpopulations of Brassica napus L.
The allotetraploid species Brassica napus L. is a global crop of major economic importance, providing canola oil (seed) and vegetables for human consumption and fodder and meal for livestock feed. Characterizing the genetic diversity present in the extant germplasm pool of B. napus is fundamental to better conserve, manage and utilize the genetic resources of this species. We used sequence-based genotyping to identify and genotype 30,881 SNPs in a diversity panel of 782 B. napus accessions, representing samples of winter and spring growth habits originating from 33 countries across Europe, Asia and America. We detected strong population structure broadly concordant with growth habit and geography, and identified three major genetic groups: spring (SP), winter Europe (WE), and winter Asia (WA). Subpopulation-specific polymorphism patterns suggest enriched genetic diversity within the WA group and a smaller effective breeding population for the SP group compared to WE. Interestingly, the two subgenomes of B. napus appear to have different geographic origins, with phylogenetic analysis placing WE and WA as basal clades for the other subpopulations in the C and A subgenomes, respectively. Finally, we identified 16 genomic regions where the patterns of diversity differed markedly from the genome-wide average, several of which are suggestive of genomic inversions. The results obtained in this study constitute a valuable resource for worldwide breeding efforts and the genetic dissection and prediction of complex B. napus traits
Deleterious Mutation Burden and Its Association with Complex Traits in Sorghum (Sorghum bicolor)
Sorghum (Sorghum bicolor L.) is a major food cereal for millions of people worldwide. The sorghum genome, like other species, accumulates deleterious mutations, likely impacting its fitness. The lack of recombination, drift, and the coupling with favorable loci impede the removal of deleterious mutations from the genome by selection. To study how deleterious variants impact phenotypes, we identified putative deleterious mutations among ∼5.5 M segregating variants of 229 diverse biomass sorghum lines. We provide the whole-genome estimate of the deleterious burden in sorghum, showing that ∼33% of nonsynonymous substitutions are putatively deleterious. The pattern of mutation burden varies appreciably among racial groups. Across racial groups, the mutation burden correlated negatively with biomass, plant height, specific leaf area (SLA), and tissue starch content (TSC), suggesting that deleterious burden decreases trait fitness. Putatively deleterious variants explain roughly one-half of the genetic variance. However, there is only moderate improvement in total heritable variance explained for biomass (7.6%) and plant height (average of 3.1% across all stages). There is no advantage in total heritable variance for SLA and TSC. The contribution of putatively deleterious variants to phenotypic diversity therefore appears to be dependent on the genetic architecture of traits. Overall, these results suggest that incorporating putatively deleterious variants into genomic models slightly improves prediction accuracy because of extensive linkage. Knowledge of deleterious variants could be leveraged for sorghum breeding through either genome editing and/or conventional breeding that focuses on the selection of progeny with fewer deleterious alleles
Selection upon Genome Architecture: Conservation of Functional Neighborhoods with Changing Genes
An increasing number of evidences show that genes are not distributed randomly across eukaryotic chromosomes, but rather in functional neighborhoods. Nevertheless, the driving force that originated and maintains such neighborhoods is still a matter of controversy. We present the first detailed multispecies cartography of genome regions enriched in genes with related functions and study the evolutionary implications of such clustering. Our results indicate that the chromosomes of higher eukaryotic genomes contain up to 12% of genes arranged in functional neighborhoods, with a high level of gene co-expression, which are consistently distributed in phylogenies. Unexpectedly, neighborhoods with homologous functions are formed by different (non-orthologous) genes in different species. Actually, instead of being conserved, functional neighborhoods present a higher degree of synteny breaks than the genome average. This scenario is compatible with the existence of selective pressures optimizing the coordinated transcription of blocks of functionally related genes. If these neighborhoods were broken by chromosomal rearrangements, selection would favor further rearrangements reconstructing other neighborhoods of similar function. The picture arising from this study is a dynamic genomic landscape with a high level of functional organization
Data from: Genetic diversity of the two commercial tetraploid cotton species in the Gossypium Diversity Reference Set
A diversity reference set has been constructed for the Gossypium accessions in the U.S. National Cotton Germplasm Collection to facilitate more extensive evaluation and utilization of accessions held in the Collection. A set of 105 mapped simple sequence repeat markers were used to study the allelic diversity of 1,933 tetraploid Gossypium accessions representative of the range of diversity of the improved and wild accessions of G. hirsutum and G. barbadense. The reference set contained 410 G. barbadense accessions and 1,523 G. hirsutum accessions. Observed numbers of polymorphic and private bands indicated a greater diversity in G. hirsutum as compared to G. barbadense as well as in wild type accessions as compared to improved accessions in both species. The markers clearly differentiated the two species. Patterns of diversity within species were observed but not clearly delineated, with much overlap occurring between races and regions of origin for wild accessions and between historical and geographic breeding pools for cultivated accessions. Although the percentage of accessions showing introgression was higher among wild accessions than cultivars in both species, the average level of introgression within individual accessions, as indicated by species-specific bands, was much higher in wild accessions of G. hirsutum than in wild accessions of G. barbadense. The average level of introgression within individual accessions was higher in improved G. barbadense cultivars than in G. hirsutum cultivars. This molecular characterization reveals the levels and distributions of genetic diversity that will allow for better exploration and utilization of cotton genetic resources
Copy number variation analysis in the great apes reveals species-specific patterns of structural variation
Copy number variants (CNVs) are increasingly acknowledged as an important source of evolutionary novelties in the human lineage. However, our understanding of their significance is still hindered by the lack of primate CNV data. We performed intraspecific comparative genomic hybridizations to identify loci harboring copy number variants in each of the four great apes: bonobos, chimpanzees, gorillas, and orangutans. For the first time, we could analyze differences in CNV location and frequency in these four species, and compare them with human CNVs and primate segmental duplication (SD) maps. In addition, for bonobo and gorilla, patterns of CNV and nucleotide diversity were studied in the same individuals. We show that CNVs have been subject to different selective pressures in different lineages. Evidence for purifying selection is stronger in gorilla CNVs overlapping genes, while positive selection appears to have driven the fixation of structural variants in the orangutan lineage. In contrast, chimpanzees and bonobos present high levels of common structural polymorphism, which is indicative of relaxed purifying selection together with the higher mutation rates induced by the known burst of segmental duplication in the ancestor of the African apes. Indeed, the impact of the duplication burst is noticeable by the fact that bonobo and chimpanzee share more CNVs with gorilla than expected. Finally, we identified a number of interesting genomic regions that present high-frequency CNVs in all great apes, while containing only very rare or even pathogenic structural variants in humans
Molecular Data Scores for G. hirsutum and G. barbadense
SSR fragment presence/absence matrix of molecular data originally collected from 1971 G. hirsutum and G. barbadense accessions obtained from the US National Cotton Germplasm Collection (NCGC) using 105 SSR markers. Please see ReadMe file for further information
Copy number variation analysis in the great apes reveals species-specific patterns of structural variation
Copy number variants (CNVs) are increasingly acknowledged as an important source of evolutionary novelties in the human lineage. However, our understanding of their significance is still hindered by the lack of primate CNV data. We performed intraspecific comparative genomic hybridizations to identify loci harboring copy number variants in each of the four great apes: bonobos, chimpanzees, gorillas, and orangutans. For the first time, we could analyze differences in CNV location and frequency in these four species, and compare them with human CNVs and primate segmental duplication (SD) maps. In addition, for bonobo and gorilla, patterns of CNV and nucleotide diversity were studied in the same individuals. We show that CNVs have been subject to different selective pressures in different lineages. Evidence for purifying selection is stronger in gorilla CNVs overlapping genes, while positive selection appears to have driven the fixation of structural variants in the orangutan lineage. In contrast, chimpanzees and bonobos present high levels of common structural polymorphism, which is indicative of relaxed purifying selection together with the higher mutation rates induced by the known burst of segmental duplication in the ancestor of the African apes. Indeed, the impact of the duplication burst is noticeable by the fact that bonobo and chimpanzee share more CNVs with gorilla than expected. Finally, we identified a number of interesting genomic regions that present high-frequency CNVs in all great apes, while containing only very rare or even pathogenic structural variants in humans.Financial support was provided by a Beatriu de Pinos postdoctoral Grant to E.G., the Spanish Ministry of Science and Innovation (Grant BFU2009-13409-C02-02 to A.N.), and the Spanish National Institute for Bioinformatics (INB, www.inab.org). E.E.E. is an investigator of the Howard Hughes Medical Institute. M.R. is grateful to CEGBA (Centro di Eccellenza Geni in campo Biosanitario e Agroalimentare) and MIUR (Ministero Italiano della Un iversita’ e della Ricerca; Cluster CO3, Prog. L.488/92
Recommended from our members
Genome-wide association study identifies acyl-lipid metabolism candidate genes involved in the genetic control of natural variation for seed fatty acid traits in Brassica napus L.
Brassica napus L. represents a potential plant feedstock for the sustainable production of hydrotreated renewable fuels needed to support carbon-based energy production. However, to increase the use of plant-derived oils for energy needs, breeding efforts are required to optimize the amount and profile of fatty acids (FAs) contained in the oil extracted from B. maims seed to meet demands of the various market categories. To this end, we analyzed the genetic basis of FA content and composition of seed from a diverse panel of spring-type B. napus accessions evaluated at four US locations across multiple years. The extent of phenotypic variations for total oil content, nine FA compounds, and 14 derivative traits were found, in general, to be highly heritable. A genome-wide association study (GWAS) was conducted that detected 53 SNPs significantly associated with one or more of the 24 FA seed traits, resulting in the implicated genetic role of 12 candidate genes, four of which had two homologs each, from the acyl-lipid pathway. To our knowledge, the two detected homologs of 3-Ketoacyl-CoA thiolase (KAT), have never been associated with seed oil traits in B. napus. Through the application of whole-genome prediction, the 24 FA seed traits were generally found to have moderately high predictive abilities (70% of traits with abilities > 0.70), suggesting that these traits are highly amenable to genomic selection. Overall, our results contribute to the expanding body of knowledge regarding key enzymes in the acyl-lipid pathway at the quantitative genetic level and illustrate how genomics-assisted breeding could be leveraged to genetically improve FA seed traits in B. napus.24 month embargo; published online: 6 January 2020This item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]