36 research outputs found

    Functional genomic effects of indels using Bayesian genome-phenome wide association studies in sorghum

    Get PDF
    High-throughput genomic and phenomic data have enhanced the ability to detect genotype-to-phenotype associations that can resolve broad pleiotropic effects of mutations on plant phenotypes. As the scale of genotyping and phenotyping has advanced, rigorous methodologies have been developed to accommodate larger datasets and maintain statistical precision. However, determining the functional effects of associated genes/loci is expensive and limited due to the complexity associated with cloning and subsequent characterization. Here, we utilized phenomic imputation of a multi-year, multi-environment dataset using PHENIX which imputes missing data using kinship and correlated traits, and we screened insertions and deletions (InDels) from the recently whole-genome sequenced Sorghum Association Panel for putative loss-of-function effects. Candidate loci from genome-wide association results were screened for potential loss of function using a Bayesian Genome-Phenome Wide Association Study (BGPWAS) model across both functionally characterized and uncharacterized loci. Our approach is designed to facilitate in silico validation of associations beyond traditional candidate gene and literature-search approaches and to facilitate the identification of putative variants for functional analysis and reduce the incidence of false-positive candidates in current functional validation methods. Using this Bayesian GPWAS model, we identified associations for previously characterized genes with known loss-of-function alleles, specific genes falling within known quantitative trait loci, and genes without any previous genome-wide associations while additionally detecting putative pleiotropic effects. In particular, we were able to identify the major tannin haplotypes at the Tan1 locus and effects of InDels on the protein folding. Depending on the haplotype present, heterodimer formation with Tan2 was significantly affected. We also identified major effect InDels in Dw2 and Ma1, where proteins were truncated due to frameshift mutations that resulted in early stop codons. These truncated proteins also lost most of their functional domains, suggesting that these indels likely result in loss of function. Here, we show that the Bayesian GPWAS model is able to identify loss-of-function alleles that can have significant effects upon protein structure and folding as well as multimer formation. Our approach to characterize loss-of-function mutations and their functional repercussions will facilitate precision genomics and breeding by identifying key targets for gene editing and trait integration

    Sorghum Association Panel whole-genome sequencing establishes cornerstone resource for dissecting genomic diversity

    Get PDF
    Association mapping panels represent foundational resources for understanding the genetic basis of phenotypic diversity and serve to advance plant breeding by exploring genetic variation across diverse accessions. We report the whole-genome sequencing (WGS) of 400 sorghum (Sorghum bicolor (L.) Moench) accessions from the Sorghum Association Panel (SAP) at an average coverage of 38× (25–72×), enabling the development of a high-density genomic marker set of 43 983 694 variants including single-nucleotide polymorphisms (approximately 38 million), insertions/deletions (indels) (approximately 5 million), and copy number variants (CNVs) (approximately 170 000). We observe slightly more deletions among indels and a much higher prevalence of deletions among CNVs compared to insertions. This new marker set enabled the identification of several novel putative genomic associations for plant height and tannin content, which were not identified when using previous lower-density marker sets. WGS identified and scored variants in 5-kb bins where available genotyping-by-sequencing (GBS) data captured no variants, with half of all bins in the genome falling into this category. The predictive ability of genomic best unbiased linear predictor (GBLUP) models was increased by an average of 30% by using WGS markers rather than GBS markers. We identified 18 selection peaks across subpopulations that formed due to evolutionary divergence during domestication, and we found six Fst peaks resulting from comparisons between converted lines and breeding lines within the SAP that were distinct from the peaks associated with historic selection. This population has served and continues to serve as a significant public resource for sorghum research and demonstrates the value of improving upon existing genomic resources

    Deep expression analysis reveals distinct cold-response strategies in rubber tree (hevea brasiliensis)

    Get PDF
    Natural rubber, an indispensable commodity used in approximately 40,000 products, is fundamental to the tire industry. The rubber tree species Hevea brasiliensis (Willd. ex Adr. de Juss.) Muell-Arg., which is native the Amazon rainforest, is the major producer of latex worldwide. Rubber tree breeding is time consuming, expensive and requires large field areas. Thus, genetic studies could optimize field evaluations, thereby reducing the time and area required for these experiments. In this work, transcriptome sequencing was used to identify a full set of transcripts and to evaluate the gene expression involved in the different cold-response strategies of the RRIM600 (cold-resistant) and GT1 (cold-tolerant) genotypes.ResultsWe built a comprehensive transcriptome using multiple database sources, which resulted in 104,738 transcripts clustered in 49,304 genes. The RNA-seq data from the leaf tissues sampled at four different times for each genotype were used to perform a gene-level expression analysis. Differentially expressed genes (DEGs) were identified through pairwise comparisons between the two genotypes for each time series of cold treatments.DEG annotation revealed that RRIM600 and GT1 exhibit different chilling tolerance strategies. To cope with cold stress, the RRIM600 clone upregulates genes promoting stomata closure, photosynthesis inhibition and a more efficient reactive oxygen species (ROS) scavenging system. The transcriptome was also searched for putative molecular markers (single nucleotide polymorphisms (SNPs) and microsatellites) in each genotype. and a total of 27,111 microsatellites and 202,949 (GT1) and 156,395 (RRIM600) SNPs were identified in GT1 and RRIM600. Furthermore, a search for alternative splicing (AS) events identified a total of 20,279 events.ConclusionsThe elucidation of genes involved in different chilling tolerance strategies associated with molecular markers and information regarding AS events provides a powerful tool for further genetic and genomic analyses of rubber tree breeding20CONSELHO NACIONAL DE DESENVOLVIMENTO CIENTÍFICO E TECNOLÓGICO - CNPQCOORDENAÇÃO DE APERFEIÇOAMENTO DE PESSOAL DE NÍVEL SUPERIOR - CAPESFUNDAÇÃO DE AMPARO À PESQUISA DO ESTADO DE SÃO PAULO - FAPESP478701/2012–8; 402954/2012Sem informação2007/50392–1; 2012/50491–8; 2014/18755–0; 2015/24346–

    Discovering useful genetic variation in the seed parent gene pool for sorghum improvement

    Get PDF
    Multi-parent populations contain valuable genetic material for dissecting complex, quantitative traits and provide a unique opportunity to capture multi-allelic variation compared to the biparental populations. A multi-parent advanced generation inter-cross (MAGIC) B-line (MBL) population composed of 708 F6 recombinant inbred lines (RILs), was recently developed from four diverse founders. These selected founders strategically represented the four most prevalent botanical races (kafir, guinea, durra, and caudatum) to capture a significant source of genetic variation to study the quantitative traits in grain sorghum [Sorghum bicolor (L.) Moench]. MBL was phenotyped at two field locations for seven yield-influencing traits: panicle type (PT), days to anthesis (DTA), plant height (PH), grain yield (GY), 1000-grain weight (TGW), tiller number per meter (TN) and yield per panicle (YPP). High phenotypic variation was observed for all the quantitative traits, with broad-sense heritabilities ranging from 0.34 (TN) to 0.84 (PH). The entire population was genotyped using Diversity Arrays Technology (DArTseq), and 8,800 single nucleotide polymorphisms (SNPs) were generated. A set of polymorphic, quality-filtered markers (3,751 SNPs) and phenotypic data were used for genome-wide association studies (GWAS). We identified 52 marker-trait associations (MTAs) for the seven traits using BLUPs generated from replicated plots in two locations. We also identified desirable allelic combinations based on the plant height loci (Dw1, Dw2, and Dw3), which influences yield related traits. Additionally, two novel MTAs were identified each on Chr1 and Chr7 for yield traits independent of dwarfing genes. We further performed a multi-variate adaptive shrinkage analysis and 15 MTAs with pleiotropic effect were identified. The five best performing MBL progenies were selected carrying desirable allelic combinations. Since the MBL population was designed to capture significant diversity for maintainer line (B-line) accessions, these progenies can serve as valuable resources to develop superior sorghum hybrids after validation of their general combining abilities via crossing with elite pollinators. Further, newly identified desirable allelic combinations can be used to enrich the maintainer germplasm lines through marker-assisted backcross breeding

    Ten new high-quality genome assemblies for diverse bioenergy sorghum genotypes

    Get PDF
    INTRODUCTION: Sorghum (Sorghum bicolor (L.) Moench) is an agriculturally and economically important staple crop that has immense potential as a bioenergy feedstock due to its relatively high productivity on marginal lands. To capitalize on and further improve sorghum as a potential source of sustainable biofuel, it is essential to understand the genomic mechanisms underlying complex traits related to yield, composition, and environmental adaptations. METHODS: Expanding on a recently developed mapping population, we generated de novo genome assemblies for 10 parental genotypes from this population and identified a comprehensive set of over 24 thousand large structural variants (SVs) and over 10.5 million single nucleotide polymorphisms (SNPs). RESULTS: We show that SVs and nonsynonymous SNPs are enriched in different gene categories, emphasizing the need for long read sequencing in crop species to identify novel variation. Furthermore, we highlight SVs and SNPs occurring in genes and pathways with known associations to critical bioenergy-related phenotypes and characterize the landscape of genetic differences between sweet and cellulosic genotypes. DISCUSSION: These resources can be integrated into both ongoing and future mapping and trait discovery for sorghum and its myriad uses including food, feed, bioenergy, and increasingly as a carbon dioxide removal mechanism

    Trajectories of Homoeolog-Specific Expression in Allotetraploid Tragopogon castellanus Populations of Independent Origins

    Get PDF
    Polyploidization can have a significant ecological and evolutionary impact by providing substantially more genetic material that may result in novel phenotypes upon which selection may act. While the effects of polyploidization are broadly reviewed across the plant tree of life, the reproducibility of these effects within naturally occurring, independently formed polyploids is poorly characterized. The flowering plant genus Tragopogon (Asteraceae) offers a rare glimpse into the intricacies of repeated allopolyploid formation with both nascent (< 90 years old) and more ancient (mesopolyploids) formations. Neo- and mesopolyploids in Tragopogon have formed repeatedly and have extant diploid progenitors that facilitate the comparison of genome evolution after polyploidization across a broad span of evolutionary time. Here, we examine four independently formed lineages of the mesopolyploid Tragopogon castellanus for homoeolog expression changes and fractionation after polyploidization. We show that expression changes are remarkably similar among these independently formed polyploid populations with large convergence among expressed loci, moderate convergence among loci lost, and stochastic silencing. We further compare and contrast these results for T. castellanus with two nascent Tragopogon allopolyploids. While homoeolog expression bias was balanced in both nascent polyploids and T. castellanus, the degree of additive expression was significantly different, with the mesopolyploid populations demonstrating more non-additive expression. We suggest that gene dosage and expression noise minimization may play a prominent role in regulating gene expression patterns immediately after allopolyploidization as well as deeper into time, and these patterns are conserved across independent polyploid lineages

    IRE1/bZIP60-Mediated Unfolded Protein Response Plays Distinct Roles in Plant Immunity and Abiotic Stress Responses

    Get PDF
    Endoplasmic reticulum (ER)-mediated protein secretion and quality control have been shown to play an important role in immune responses in both animals and plants. In mammals, the ER membrane-located IRE1 kinase/endoribonuclease, a key regulator of unfolded protein response (UPR), is required for plasma cell development to accommodate massive secretion of immunoglobulins. Plant cells can secrete the so-called pathogenesis-related (PR) proteins with antimicrobial activities upon pathogen challenge. However, whether IRE1 plays any role in plant immunity is not known. Arabidopsis thaliana has two copies of IRE1, IRE1a and IRE1b. Here, we show that both IRE1a and IRE1b are transcriptionally induced during chemically-induced ER stress, bacterial pathogen infection and treatment with the immune signal salicylic acid (SA). However, we found that IRE1a plays a predominant role in the secretion of PR proteins upon SA treatment. Consequently, the ire1a mutant plants show enhanced susceptibility to a bacterial pathogen and are deficient in establishing systemic acquired resistance (SAR), whereas ire1b is unaffected in these responses. We further demonstrate that the immune deficiency in ire1a is due to a defect in SA- and pathogen-triggered, IRE1-mediated cytoplasmic splicing of the bZIP60 mRNA, which encodes a transcription factor involved in the expression of UPR-responsive genes. Consistently, IRE1a is preferentially required for bZIP60 splicing upon pathogen infection, while IRE1b plays a major role in bZIP60 processing upon Tunicamycin (Tm)-induced stress. We also show that SA-dependent induction of UPR-responsive genes is altered in the bzip60 mutant resulting in a moderate susceptibility to a bacterial pathogen. These results indicate that the IRE1/bZIP60 branch of UPR is a part of the plant response to pathogens for which the two Arabidopsis IRE1 isoforms play only partially overlapping roles and that IRE1 has both bZIP60-dependent and bZIP60-independent functions in plant immunity

    Identification of pleiotropic loci mediating structural and non-structural carbohydrate accumulation within the sorghum bioenergy association panel using high-throughput markers

    Get PDF
    Molecular characterization of diverse germplasm can contribute to breeding programs by increasing genetic gain for sorghum [Sorghum bicolor (L.) Moench] improvement. Identifying novel marker-trait associations and candidate genes enriches the existing genomic resources and can improve bioenergy-related traits using genomic-assisted breeding. In the current scenario, identifying the genetic loci underlying biomass and carbon partitioning is vital for ongoing efforts to maximize each carbon sink’s yield for bioenergy production. Here, we have processed a high-density genomic marker (22 466 550) data based on whole-genome sequencing (WGS) using a set of 365 accessions from the bioenergy association panel (BAP), which includes ~19.7 million (19 744 726) single nucleotide polymorphism (SNPs) and 2.7 million (~2 721 824) insertion deletions (indels). A set of high-quality filtered SNP (~5.48 million) derived markers facilitated the assessment of population structure, genetic diversity, and genome-wide association studies (GWAS) for various traits related to biomass and its composition using the BAP. The phenotypic traits for GWAS included seed color (SC), plant height (PH), days to harvest (DTH), fresh weight (FW), dry weight (DW), brix content % (BRX), neutral detergent fiber (NDF), acid detergent fiber (ADF), non-fibrous carbohydrate (NFC), and lignin content. Several novel loci and candidate genes were identified for bioenergy-related traits, and some well-characterized genes for plant height (Dw1 and Dw2) and the YELLOW SEED1 locus (Y1) were validated. We further performed a multi-variate adaptive shrinkage analysis to identify pleiotropic QTL, which resulted in several shared marker-trait associations among bioenergy and compositional traits. Significant marker-trait associations with pleiotropic effects can be used to develop molecular markers for trait improvement using a marker-assisted breeding approach. Significant nucleotide diversity and heterozygosity were observed between photoperiod-sensitive and insensitive individuals of the panel. This diverse bioenergy panel with genomic resources will provide an excellent opportunity for further genetic studies, including selecting parental lines for superior hybrid development to improve biomass-related traits in sorghum

    Comparative transcriptomic analysis of dermal wound healing reveals de novo skeletal muscle regeneration in Acomys cahirinus.

    No full text
    The African spiny mouse, Acomys spp., is capable of scar-free dermal wound healing. Here, we have performed a comprehensive analysis of gene expression throughout wound healing following full-thickness excisional dermal wounds in both Acomys cahirinus and Mus musculus. Additionally, we provide an annotated, de novo transcriptome assembly of A. cahirinus skin and skin wounds. Using a novel computational comparative RNA-Seq approach along with pathway and co-expression analyses, we identify enrichment of regeneration associated genes as well as upregulation of genes directly related to muscle development or function. Our RT-qPCR data reveals induction of the myogenic regulatory factors, as well as upregulation of embryonic myosin, starting between days 14 and 18 post-wounding in A. cahirinus. In contrast, the myogenic regulatory factors remain downregulated, embryonic myosin is only modestly upregulated, and no new muscle fibers of the panniculus carnosus are generated in M. musculus wounds. Additionally, we show that Col6a1, a key component of the satellite cell niche, is upregulated in A. cahirinus compared to M. musculus. Our data also demonstrate that the macrophage profile and inflammatory response is different between species, with A. cahirinus expressing significantly higher levels of Il10. We also demonstrate differential expression of the upstream regulators Wnt7a, Wnt2 and Wnt6 during wound healing. Our analyses demonstrate that A. cahirinus is capable of de novo skeletal muscle regeneration of the panniculus carnosus following removal of the extracellular matrix. We believe this study represents the first detailed analysis of de novo skeletal muscle regeneration observed in an adult mammal

    Fatty acid composition and genome-wide associations of a chickpea (Cicer arietinum L.) diversity panel for biofortification efforts

    No full text
    Abstract Chickpea is a nutritionally dense pulse crop with high levels of protein, carbohydrates, micronutrients and low levels of fats. Chickpea fatty acids are associated with a reduced risk of obesity, blood cholesterol, and cardiovascular diseases in humans. We measured four primary chickpea fatty acids; palmitic acid (PA), linoleic acid (LA), alpha-linolenic acid (ALA), and oleic acid (OA), which are crucial for human health and plant stress responses in a chickpea diversity panel with 256 accessions (Kabuli and desi types). A wide concentration range was found for PA (450.7–912.6 mg/100 g), LA (1605.7–3459.9 mg/100 g), ALA (416.4–864.5 mg/100 g), and OA (1035.5–1907.2 mg/100 g). The percent recommended daily allowances also varied for PA (3.3–6.8%), LA (21.4–46.1%), ALA (34.7–72%), and OA (4.3–7.9%). Weak correlations were found among fatty acids. Genome-wide association studies (GWAS) were conducted using genotyping-by-sequencing data. Five significant single nucleotide polymorphisms (SNPs) were identified for PA. Admixture population structure analysis revealed seven subpopulations based on ancestral diversity in this panel. This is the first reported study to characterize fatty acid profiles across a chickpea diversity panel and perform GWAS to detect associations between genetic markers and concentrations of selected fatty acids. These findings demonstrate biofortification of chickpea fatty acids is possible using conventional and genomic breeding techniques, to develop superior cultivars with better fatty acid profiles for improved human health and plant stress responses
    corecore