119 research outputs found

    Sequencing the USDA core soybean collection reveals gene loss during domestication and breeding

    Get PDF
    The gene content of plants varies between individuals of the same species due to gene presence/absence variation, and selection can alter the frequency of specific genes in a population. Selection during domestication and breeding will modify the genomic landscape, though the nature of these modifications is only understood for specific genes or on a more general level (e.g., by a loss of genetic diversity). Here we have assembled and analyzed a soybean (Glycine spp.) pangenome representing more than 1,000 soybean accessions derived from the USDA Soybean Germplasm Collection, including both wild and cultivated lineages, to assess genomewide changes in gene and allele frequency during domestication and breeding. We identified 3,765 genes that are absent from the Lee reference genome assembly and assessed the presence/absence of all genes across this population. In addition to a loss of genetic diversity, we found a significant reduction in the average number of protein-coding genes per individual during domestication and subsequent breeding, though with some genes and allelic variants increasing in frequency associated with selection for agronomic traits. This analysis provides a genomic perspective of domestication and breeding in this important oilseed crop

    Evaluation of genetic variation among Brazilian soybean cultivars through genome resequencing.

    Get PDF
    Soybean [Glycine max (L.) Merrill] is one of the most important legumes cultivated worldwide, and Brazil is one of the main producers of this crop. Since the sequencing of its reference genome, interest in structural and allelic variations of cultivated and wild soybean germplasm has grown. To investigate the genetics of the Brazilian soybean germplasm, we selected soybean cultivars based on the year of commercialization, geographical region and maturity group and resequenced their genomes. We resequenced the genomes of 28 Brazilian soybean cultivars with an average genome coverage of 14.8X. A total of 5,835,185 single nucleotide polymorphisms (SNPs) and 1,329,844 InDels were identified across the 20 soybean chromosomes, with 541,762 SNPs, 98,922 InDels and 1,093 CNVs that were exclusive to the 28 Brazilian cultivars. In addition, 668 allelic variations of 327 genes were shared among all of the Brazilian cultivars, including genes related to DNA-dependent transcription-elongation, photosynthesis, ATP synthesis-coupled electron transport, cellular respiration, and precursors of metabolite generation and energy. A very homogeneous structure was also observed for the Brazilian soybean germplasm, and we observed 41 regions putatively influenced by positive selection. Finally, we detected 3,880 regions with copy-number variations (CNVs) that could help to explain the divergence among the accessions evaluated. The large number of allelic and structural variations identified in this study can be used in marker-assisted selection programs to detect unique SNPs for cultivar fingerprinting. The results presented here suggest that despite the diversification of modern Brazilian cultivars, the soybean germplasm remains very narrow because of the large number of genome regions that exhibit low diversity. These results emphasize the need to introduce new alleles to increase the genetic diversity of the Brazilian germplasm

    Identification and Comparative Analysis of Differential Gene Expression in Soybean Leaf Tissue under Drought and Flooding Stress Revealed by RNA-Seq

    Get PDF
    Drought and flooding are two major causes of severe yield loss in soybean worldwide. A lack of knowledge of the molecular mechanisms involved in drought and flood stress has been a limiting factor for the effective management of soybeans; therefore, it is imperative to assess the expression of genes involved in response to flood and drought stress. In this study, differentially expressed genes (DEGs) under drought and flooding conditions were investigated using Illumina RNA-Seq transcriptome profiling. A total of 2724 and 3498 DEGs were identified under drought and flooding treatments, respectively. These genes comprise 289 Transcription Factors (TFs) representing Basic Helix-loop Helix (bHLH), Ethylene Response Factors (ERFs), myeloblastosis (MYB), No apical meristem (NAC), and WRKY amino acid motif (WRKY) type major families known to be involved in the mechanism of stress tolerance. The expression of photosynthesis and chlorophyll synthesis related genes were significantly reduced under both types of stresses, which limit the metabolic processes and thus help prolong survival under extreme conditions. However, cell wall synthesis related genes were up-regulated under drought stress and down-regulated under flooding stress. Transcript profiles involved in the starch and sugar metabolism pathways were also affected under both stress conditions. The changes in expression of genes involved in regulating the flux of cell wall precursors and starch/sugar content can serve as an adaptive mechanism for soybean survival under stress conditions. This study has revealed the involvement of TFs, transporters, and photosynthetic genes, and has also given a glimpse of hormonal cross talk under the extreme water regimes, which will aid as an important resource for soybean crop improvement

    Genomic strategies for soybean oil improvement and biodiesel production

    Get PDF
    Track II: Transportation and BiofuelsIncludes audio file (21 min.)Soybean oil, a promising renewable energy resource, comprises 73% of biodiesel in addition to other industrial applications. Missouri is the fifth largest state in the US for soybean plantation. With the target to produce 225 million gallons of biodiesel by 2015 from the current 75 million gallons produced in 2005, efforts should not only focus on expanding the number of oil crops to meet the demand but also to increase the amount of oil per hectare for each crop. Considering the ever increasing need for biodiesel and the potential for Missouri to play a major role in national and international demand, We, at the National Center for Soybean Biotechnology focus on discovering the genetic factors that are responsible for oil content in soybean using genetic and genomic strategies. The long term goal is to apply discoveries in breeding programs and biotechnology for the development of improved soybean cultivars with increased oil content that will make this crop more competitive in end-uses. Our multidisciplinary approaches include traditional Quantitative Trait Loci (QTL) mapping, association mapping, bioinformatics and transgenics by developing new resources and utilizing already available resources such as mapping populations, diverse germplasm collections, genome sequence information and transgenes. In addition to total oil content, we are focusing on improving quality traits such as oleic acid which has direct human health benefits and application in biodiesel production. With the use of advanced genomic technologies, genetic materials, and synergistic efforts involving intra- and inter institutional collaborations, we believe that our current and future research will contribute substantially to biodiesel production. Increased production using high oil soybean cultivars will not only increase the economic gains to farmers/growers but also facilitate the US to emerge as the global leader in biodiesel production

    Global agricultural intensification during climate change: A role for genomics

    Get PDF
    Summary: Agriculture is now facing the 'perfect storm' of climate change, increasing costs of fertilizer and rising food demands from a larger and wealthier human population. These factors point to a global food deficit unless the efficiency and resilience of crop production is increased. The intensification of agriculture has focused on improving production under optimized conditions, with significant agronomic inputs. Furthermore, the intensive cultivation of a limited number of crops has drastically narrowed the number of plant species humans rely on. A new agricultural paradigm is required, reducing dependence on high inputs and increasing crop diversity, yield stability and environmental resilience. Genomics offers unprecedented opportunities to increase crop yield, quality and stability of production through advanced breeding strategies, enhancing the resilience of major crops to climate variability, and increasing the productivity and range of minor crops to diversify the food supply. Here we review the state of the art of genomic-assisted breeding for the most important staples that feed the world, and how to use and adapt such genomic tools to accelerate development of both major and minor crops with desired traits that enhance adaptation to, or mitigate the effects of climate change. &gt

    Construction and comparison of three reference‐quality genome assemblies for soybean

    Get PDF
    We report reference-quality genome assemblies and annotations for two accessions of soybean (Glycine max) and for one accession of Glycine soja, the closest wild relative of G. max. The G. max assemblies provided are for widely used US cultivars: the northern line Williams 82 (Wm82) and the southern line Lee. The Wm82 assembly improves the prior published assembly, and the Lee and G. soja assemblies are new for these accessions. Comparisons among the three accessions show generally high structural conservation, but nucleotide difference of 1.7 single-nucleotide polymorphisms (snps) per kb between Wm82 and Lee, and 4.7 snps per kb between these lines and G. soja. snp distributions and comparisons with genotypes of the Lee and Wm82 parents highlight patterns of introgression and haplotype structure. Comparisons against the US germplasm collection show placement of the sequenced accessions relative to global soybean diversity. Analysis of a pan-gene collection shows generally high conservation, with variation occurring primarily in genomically clustered gene families. We found approximately 40–42 inversions per chromosome between either Lee or Wm82v4 and G. soja, and approximately 32 inversions per chromosome between Wm82 and Lee. We also investigated five domestication loci. For each locus, we found two different alleles with functional differences between G. soja and the two domesticated accessions. The genome assemblies for multiple cultivated accessions and for the closest wild ancestor of soybean provides a valuable set of resources for identifying causal variants that underlie traits for the domestication and improvement of soybean, serving as a basis for future research and crop improvement efforts for this important crop species

    Prediction of Drought-Resistant Genes in Arabidopsis thaliana Using SVM-RFE

    Get PDF
    Background: Identifying genes with essential roles in resisting environmental stress rates high in agronomic importance. Although massive DNA microarray gene expression data have been generated for plants, current computational approaches underutilize these data for studying genotype-trait relationships. Some advanced gene identification methods have been explored for human diseases, but typically these methods have not been converted into publicly available software tools and cannot be applied to plants for identifying genes with agronomic traits. Methodology: In this study, we used 22 sets of Arabidopsis thaliana gene expression data from GEO to predict the key genes involved in water tolerance. We applied an SVM-RFE (Support Vector Machine-Recursive Feature Elimination) feature selection method for the prediction. To address small sample sizes, we developed a modified approach for SVM-RFE by using bootstrapping and leave-one-out cross-validation. We also expanded our study to predict genes involved in water susceptibility. Conclusions: We analyzed the top 10 genes predicted to be involved in water tolerance. Seven of them are connected to known biological processes in drought resistance. We also analyzed the top 100 genes in terms of their biological functions. Our study shows that the SVM-RFE method is a highly promising method in analyzing plant microarray data for studyin

    Sequencing the USDA core soybean collection reveals gene loss during domestication and breeding

    Get PDF
    The gene content of plants varies between individuals of the same species due to gene presence/absence variation, and selection can alter the frequency of specific genes in a population. Selection during domestication and breeding will modify the genomic landscape, though the nature of these modifications is only understood for specific genes or on a more general level (e.g., by a loss of genetic diversity). Here we have assembled and analyzed a soybean (Glycine spp.) pangenome representingmore than 1,000 soybean accessions derived from the USDA Soybean Germplasm Collection, including both wild and cultivated lineages, to assess genomewide changes in gene and allele frequency during domestication and breeding. We identified 3,765 genes that are absent from the Lee reference genome assembly and assessed the presence/absence of all genes across this population. In addition to a loss of genetic diversity, we found a significant reduction in the average number of protein-coding genes per individual during domestication and subsequent breeding, though with some genes and allelic variants increasing in frequency associated with selection for agronomic traits. This analysis provides a genomic perspective of domestication and breeding in this important oilseed crop
    corecore