60 research outputs found
Genetic Analysis of Central Carbon Metabolism Unveils an Amino Acid Substitution That Alters Maize NAD-Dependent Isocitrate Dehydrogenase Activity
Background: Central carbon metabolism (CCM) is a fundamental component of life. The participating genes and enzymes are thought to be structurally and functionally conserved across and within species. Association mapping utilizes a rich history of mutation and recombination to achieve high resolution mapping. Therefore, applying association mapping in maize (Zea mays ssp. mays), the most diverse model crop species, to study the genetics of CCM is a particularly attractive system. Methodology/Principal Findings: We used a maize diversity panel to test the CCM functional conservation. We found heritable variation in enzyme activity for every enzyme tested. One of these enzymes was the NAD-dependent isocitrate dehydrogenase (IDH, E.C. 1.1.1.41), in which we identified a novel amino-acid substitution in a phylogenetically conserved site. Using candidate gene association mapping, we identified that this non-synonymous polymorphism was associated with IDH activity variation. The proposed mechanism for the IDH activity variation includes additional components regulating protein level. With the comparison of sequences from maize and teosinte (Zea mays ssp. Parviglumis), the maize wild ancestor, we found that some CCM genes had also been targeted for selection during maize domestication. Conclusions/Significance: Our results demonstrate the efficacy of association mapping for dissecting natural variation in primary metabolic pathways. The considerable genetic diversity observed in maize CCM genes underlies heritable phenotypic variation in enzyme activities and can be useful to identify putative functional sites
Metabolic Profiling of a Mapping Population Exposes New Insights in the Regulation of Seed Metabolism and Seed, Fruit, and Plant Relations
To investigate the regulation of seed metabolism and to estimate the degree of metabolic natural variability, metabolite profiling and network analysis were applied to a collection of 76 different homozygous tomato introgression lines (ILs) grown in the field in two consecutive harvest seasons. Factorial ANOVA confirmed the presence of 30 metabolite quantitative trait loci (mQTL). Amino acid contents displayed a high degree of variability across the population, with similar patterns across the two seasons, while sugars exhibited significant seasonal fluctuations. Upon integration of data for tomato pericarp metabolite profiling, factorial ANOVA identified the main factor for metabolic polymorphism to be the genotypic background rather than the environment or the tissue. Analysis of the coefficient of variance indicated greater phenotypic plasticity in the ILs than in the M82 tomato cultivar. Broad-sense estimate of heritability suggested that the mode of inheritance of metabolite traits in the seed differed from that in the fruit. Correlation-based metabolic network analysis comparing metabolite data for the seed with that for the pericarp showed that the seed network displayed tighter interdependence of metabolic processes than the fruit. Amino acids in the seed metabolic network were shown to play a central hub-like role in the topology of the network, maintaining high interactions with other metabolite categories, i.e., sugars and organic acids. Network analysis identified six exceptionally highly co-regulated amino acids, Gly, Ser, Thr, Ile, Val, and Pro. The strong interdependence of this group was confirmed by the mQTL mapping. Taken together these results (i) reflect the extensive redundancy of the regulation underlying seed metabolism, (ii) demonstrate the tight co-ordination of seed metabolism with respect to fruit metabolism, and (iii) emphasize the centrality of the amino acid module in the seed metabolic network. Finally, the study highlights the added value of integrating metabolic network analysis with mQTL mapping
Efficient QTL detection for nonhost resistance in wild lettuce: backcross inbred lines versus F2 population
In plants, several population types [F2, recombinant inbred lines, backcross inbred lines (BILs), etc.] are used for quantitative trait locus (QTL) analyses. However, dissection of the trait of interest and subsequent confirmation by introgression of QTLs for breeding purposes has not been as successful as that predicted from theoretical calculations. More practical knowledge of different QTL mapping approaches is needed. In this recent study, we describe the detection and mapping of quantitative resistances to downy mildew in a set of 29 BILs of cultivated lettuce (L. sativa) containing genome segments introgressed from wild lettuce (L. saligna). Introgression regions that are associated with quantitative resistance are considered to harbor a QTL. Furthermore, we compare this with results from an already existing F2 population derived from the same parents. We identified six QTLs in our BIL approach compared to only three in the F2 approach, while there were two QTLs in common. We performed a simulation study based on our actual data to help us interpret them. This revealed that two newly detected QTLs in the BILs had gone unnoticed in the F2, due to a combination of recessiveness of the trait and skewed segregation, causing a deficit of the wild species alleles. This study clearly illustrates the added value of extended genetic studies on two different population types (BILs and F2) to dissect complex genetic traits
The Complex Genetic Architecture of the Metabolome
Discovering links between the genotype of an organism and its metabolite levels can increase our understanding of metabolism, its controls, and the indirect effects of metabolism on other quantitative traits. Recent technological advances in both DNA sequencing and metabolite profiling allow the use of broad-spectrum, untargeted metabolite profiling to generate phenotypic data for genome-wide association studies that investigate quantitative genetic control of metabolism within species. We conducted a genome-wide association study of natural variation in plant metabolism using the results of untargeted metabolite analyses performed on a collection of wild Arabidopsis thaliana accessions. Testing 327 metabolites against >200,000 single nucleotide polymorphisms identified numerous genotype–metabolite associations distributed non-randomly within the genome. These clusters of genotype–metabolite associations (hotspots) included regions of the A. thaliana genome previously identified as subject to recent strong positive selection (selective sweeps) and regions showing trans-linkage to these putative sweeps, suggesting that these selective forces have impacted genome-wide control of A. thaliana metabolism. Comparing the metabolic variation detected within this collection of wild accessions to a laboratory-derived population of recombinant inbred lines (derived from two of the accessions used in this study) showed that the higher level of genetic variation present within the wild accessions did not correspond to higher variance in metabolic phenotypes, suggesting that evolutionary constraints limit metabolic variation. While a major goal of genome-wide association studies is to develop catalogues of intraspecific variation, the results of multiple independent experiments performed for this study showed that the genotype–metabolite associations identified are sensitive to environmental fluctuations. Thus, studies of intraspecific variation conducted via genome-wide association will require analyses of genotype by environment interaction. Interestingly, the network structure of metabolite linkages was also sensitive to environmental differences, suggesting that key aspects of network architecture are malleable
Genomic Analysis of QTLs and Genes Altering Natural Variation in Stochastic Noise
Quantitative genetic analysis has long been used to study how natural variation of genotype can influence an organism's phenotype. While most studies have focused on genetic determinants of phenotypic average, it is rapidly becoming understood that stochastic noise is genetically determined. However, it is not known how many traits display genetic control of stochastic noise nor how broadly these stochastic loci are distributed within the genome. Understanding these questions is critical to our understanding of quantitative traits and how they relate to the underlying causal loci, especially since stochastic noise may be directly influenced by underlying changes in the wiring of regulatory networks. We identified QTLs controlling natural variation in stochastic noise of glucosinolates, plant defense metabolites, as well as QTLs for stochastic noise of related transcripts. These loci included stochastic noise QTLs unique for either transcript or metabolite variation. Validation of these loci showed that genetic polymorphism within the regulatory network alters stochastic noise independent of effects on corresponding average levels. We examined this phenomenon more globally, using transcriptomic datasets, and found that the Arabidopsis transcriptome exhibits significant, heritable differences in stochastic noise. Further analysis allowed us to identify QTLs that control genomic stochastic noise. Some genomic QTL were in common with those altering average transcript abundance, while others were unique to stochastic noise. Using a single isogenic population, we confirmed that natural variation at ELF3 alters stochastic noise in the circadian clock and metabolism. Since polymorphisms controlling stochastic noise in genomic phenotypes exist within wild germplasm for naturally selected phenotypes, this suggests that analysis of Arabidopsis evolution should account for genetic control of stochastic variance and average phenotypes. It remains to be determined if natural genetic variation controlling stochasticity is equally distributed across the genomes of other multi-cellular eukaryotes
Combining genetical genomics and bulked segregant analysis-based differential expression: an approach to gene localization
Positional gene isolation in unsequenced species generally requires either a reference genome sequence or an inference of gene content based on conservation of synteny with a genomic model. In the large unsequenced genomes of the Triticeae cereals the latter, i.e. conservation of synteny with the rice and Brachypodium genomes, provides a powerful proxy for establishing local gene content and order. However, efficient exploitation of conservation of synteny requires ‘homology bridges’ between the model genome and the target region that contains a gene of interest. As effective homology bridges are generally the sequences of genetically mapped genes, increasing the density of these genes around a target locus is an important step in the process. We used bulked segregant analysis (BSA) of transcript abundance data to identify genes located in a specific region of the barley genome. The approach is valuable because only a relatively small proportion of barley genes are currently placed on a genetic map. We analyzed eQTL datasets from the reference Steptoe × Morex doubled haploid population and showed a strong association between differential gene expression and cis-regulation, with 83% of differentially expressed genes co-locating with their eQTL. We then performed BSA by assembling allele-specific pools based on the genotypes of individuals at the partial resistance QTL Rphq11. BSA identified a total of 411 genes as differentially expressed, including HvPHGPx, a gene previously identified as a promising candidate for Rphq11. The genetic location of 276 of these genes could be determined from both eQTL datasets and conservation of synteny, and 254 (92%) of these were located on the target chromosome. We conclude that the identification of differential expression by BSA constitutes a novel method to identify genes located in specific regions of interest. The datasets obtained from such studies provide a robust set of candidate genes for the analysis and serve as valuable resources for targeted marker development and comparative mapping with other grass species
Construction and application for QTL analysis of a Restriction Site Associated DNA (RAD) linkage map in barley
<p>Abstract</p> <p>Background</p> <p>Linkage maps are an integral resource for dissection of complex genetic traits in plant and animal species. Canonical map construction follows a well-established workflow: an initial discovery phase where genetic markers are mined from a small pool of individuals, followed by genotyping of selected mapping populations using sets of marker panels. A newly developed sequence-based marker technology, Restriction site Associated DNA (RAD), enables synchronous single nucleotide polymorphism (SNP) marker discovery and genotyping using massively parallel sequencing. The objective of this research was to assess the utility of RAD markers for linkage map construction, employing barley as a model system. Using the published high density EST-based SNP map in the Oregon Wolfe Barley (OWB) mapping population as a reference, we created a RAD map using a limited set of prior markers to establish linakge group identity, integrated the RAD and prior data, and used both maps for detection of quantitative trait loci (QTL).</p> <p>Results</p> <p>Using the RAD protocol in tandem with the Illumina sequence by synthesis platform, a total of 530 SNP markers were identified from initial scans of the OWB parental inbred lines - the "dominant" and "recessive" marker stocks - and scored in a 93 member doubled haploid (DH) mapping population. RAD sequence data from the structured population was converted into allele genotypes from which a genetic map was constructed. The assembled RAD-only map consists of 445 markers with an average interval length of 5 cM, while an integrated map includes 463 RAD loci and 2383 prior markers. Sequenced RAD markers are distributed across all seven chromosomes, with polymorphic loci emanating from both coding and noncoding regions in the <it>Hordeum </it>genome. Total map lengths are comparable and the order of common markers is identical in both maps. The same large-effect QTL for reproductive fitness traits were detected with both maps and the majority of these QTL were coincident with a dwarfing gene (<it>ZEO) </it>and the <it>VRS1 </it>gene, which determines the two-row and six-row germplasm groups of barley.</p> <p>Conclusions</p> <p>We demonstrate how sequenced RAD markers can be leveraged to produce high quality linkage maps for detection of single gene loci and QTLs. By combining SNP discovery and genotyping into parallel sequencing events, RAD markers should be a useful molecular breeding tool for a range of crop species. Expected improvements in cost and throughput of second and third-generation sequencing technologies will enable more powerful applications of the sequenced RAD marker system, including improvements in <it>de novo </it>genome assembly, development of ultra-high density genetic maps and association mapping.</p
Differential Detection of Genetic Loci Underlying Stem and Root Lignin Content in Populus
In this study, we established a comprehensive genetic map with a large number of progeny from a three-generation hybrid Populus intercross, and phenotyped the lignin content, S/G ratio and 28 cell wall subcomponents both in stems and roots for the mapping individuals. Phenotypic analysis revealed that lignin content and syringyl-to-guaiacyl (S/G) ratio using pyrolysis molecular beam mass spectroscopy (pyMBMS) varied among mapping individuals. Phenotypic analysis revealed that stem lignin content is significantly higher than that in root and the quantified traits can be classified into four distinct groups, with strong correlations observed among components within organs. Altogether, 179 coordinating QTLs were detected, and they were co-localized into 49 genetic loci, 27 of which appear to be pleiotropic. Many of the detected genetic loci were detected differentially in stem and root. This is the first report of separate genetic loci controlling cell wall phenotypes above and below ground. These results suggest that it may be possible to modify lignin content and composition via breed and/or engineer as a means of simultaneously improving Populus for cellulosic ethanol production and carbon sequestration
Dissection of QTL effects for root traits using a chromosome arm-specific mapping population in bread wheat
A high-resolution chromosome arm-specific mapping population was used in an attempt to locate/detect gene(s)/QTL for different root traits on the short arm of rye chromosome 1 (1RS) in bread wheat. This population consisted of induced homoeologous recombinants of 1RS with 1BS, each originating from a different crossover event and distinct from all other recombinants in the proportions of rye and wheat chromatin present. It provides a simple and powerful approach to detect even small QTL effects using fewer progeny. A promising empirical Bayes method was applied to estimate additive and epistatic effects for all possible marker pairs simultaneously in a single model. This method has an advantage for QTL analysis in minimizing the error variance and detecting interaction effects between loci with no main effect. A total of 15 QTL effects, 6 additive and 9 epistatic, were detected for different traits of root length and root weight in 1RS wheat. Epistatic interactions were further partitioned into inter-genomic (wheat and rye alleles) and intra-genomic (rye–rye or wheat–wheat alleles) interactions affecting various root traits. Four common regions were identified involving all the QTL for root traits. Two regions carried QTL for almost all the root traits and were responsible for all the epistatic interactions. Evidence for inter-genomic interactions is provided. Comparison of mean values supported the QTL detection
Characterization of the natural variation in Arabidopsis thaliana metabolome by the analysis of metabolic distance
Metabolite fingerprinting is widely used to unravel the chemical characteristics of biological samples. Multivariate data analysis and other statistical tools are subsequently used to analyze and visualize the plasticity of the metabolome and/or the relationship between those samples. However, there are limitations to these approaches for example because of the multi-dimensionality of the data that makes interpretation of the data obtained from untargeted analysis almost impossible for an average human being. These limitations make the biological information that is of prime importance in untargeted studies be partially exploited. Even in the case of full exploitation, current methods for relationship elucidation focus mainly on between groups variation and differences. Therefore, a measure that is capable of exploiting both between- and within-group biological variation would be of great value. Here, we examined the natural variation in the metabolome of nine Arabidopsis thaliana accessions grown under various environmental conditions and established a measure for the metabolic distance between accessions and across environments. This data analysis approach shows that there is just a minor correlation between genetic and metabolic diversity of the nine accessions. On the other hand, it delivers so far in Arabidopsis unexplored chemical information and is shown to be biologically relevant for resistance studies
- …