121 research outputs found

    In silico genotyping of the maize nested association mapping population

    Get PDF
    Nested Association Mapping (NAM) has been proposed as a means to combine the power of linkage mapping with the resolution of association mapping. It is enabled through sequencing or array genotyping of parental inbred lines while using low-cost, low-density genotyping technologies for their segregating progenies. For purposes of data analyses of NAM populations, parental genotypes at a large number of Single Nucleotide Polymorphic (SNP) loci need to be projected to their segregating progeny. Herein we demonstrate how approximately 0.5 million SNPs that have been genotyped in 26 parental lines of the publicly available maize NAM population can be projected onto their segregating progeny using only 1,106 SNP loci that have been genotyped in both the parents and their 5,000 progeny. The challenge is to estimate both the genotype and genetic location of the parental SNP genotypes in segregating progeny. Both challenges were met by estimating their expected genotypic values conditional on observed flanking markers through the use of both physical and linkage maps. About 90%, of 500,000 genotyped SNPs from the maize HapMap project, were assigned linkage map positions using linear interpolation between the maize Accessioned Gold Path (AGP) and NAM linkage maps. Of these, almost 70% provided high probability estimates of genotypes in almost 5,000 recombinant inbred lines

    Strategies for implementing genomic selection in family-based aquaculture breeding schemes: double haploid sib test populations

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Simulation studies have shown that accuracy and genetic gain are increased in genomic selection schemes compared to traditional aquaculture sib-based schemes. In genomic selection, accuracy of selection can be maximized by increasing the precision of the estimation of SNP effects and by maximizing the relationships between test sibs and candidate sibs. Another means of increasing the accuracy of the estimation of SNP effects is to create individuals in the test population with extreme genotypes. The latter approach was studied here with creation of double haploids and use of non-random mating designs.</p> <p>Methods</p> <p>Six alternative breeding schemes were simulated in which the design of the test population was varied: test sibs inherited maternal (<it>Mat</it>), paternal (<it>Pat</it>) or a mixture of maternal and paternal (<it>MatPat</it>) double haploid genomes or test sibs were obtained by maximum coancestry mating (<it>MaxC</it>), minimum coancestry mating (<it>MinC</it>), or random (<it>RAND</it>) mating. Three thousand test sibs and 3000 candidate sibs were genotyped. The test sibs were recorded for a trait that could not be measured on the candidates and were used to estimate SNP effects. Selection was done by truncation on genome-wide estimated breeding values and 100 individuals were selected as parents each generation, equally divided between both sexes.</p> <p>Results</p> <p>Results showed a 7 to 19% increase in selection accuracy and a 6 to 22% increase in genetic gain in the <it>MatPat</it> scheme compared to the <it>RAND</it> scheme. These increases were greater with lower heritabilities. Among all other scenarios, i.e. <it>Mat, Pat, MaxC</it>, and <it>MinC</it>, no substantial differences in selection accuracy and genetic gain were observed.</p> <p>Conclusions</p> <p>In conclusion, a test population designed with a mixture of paternal and maternal double haploids, i.e. the <it>MatPat</it> scheme, increases substantially the accuracy of selection and genetic gain. This will be particularly interesting for traits that cannot be recorded on the selection candidates and require the use of sib tests, such as disease resistance and meat quality.</p

    Extension of the bayesian alphabet for genomic selection

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Two Bayesian methods, BayesC<it>π </it>and BayesD<it>π</it>, were developed for genomic prediction to address the drawback of BayesA and BayesB regarding the impact of prior hyperparameters and treat the prior probability <it>π </it>that a SNP has zero effect as unknown. The methods were compared in terms of inference of the number of QTL and accuracy of genomic estimated breeding values (GEBVs), using simulated scenarios and real data from North American Holstein bulls.</p> <p>Results</p> <p>Estimates of <it>π </it>from BayesC<it>π</it>, in contrast to BayesD<it>π</it>, were sensitive to the number of simulated QTL and training data size, and provide information about genetic architecture. Milk yield and fat yield have QTL with larger effects than protein yield and somatic cell score. The drawback of BayesA and BayesB did not impair the accuracy of GEBVs. Accuracies of alternative Bayesian methods were similar. BayesA was a good choice for GEBV with the real data. Computing time was shorter for BayesC<it>π </it>than for BayesD<it>π</it>, and longest for our implementation of BayesA.</p> <p>Conclusions</p> <p>Collectively, accounting for computing effort, uncertainty as to the number of QTL (which affects the GEBV accuracy of alternative methods), and fundamental interest in the number of QTL underlying quantitative traits, we believe that BayesC<it>π </it>has merit for routine applications.</p

    Epistasis: Obstacle or Advantage for Mapping Complex Traits?

    Get PDF
    Identification of genetic loci in complex traits has focused largely on one-dimensional genome scans to search for associations between single markers and the phenotype. There is mounting evidence that locus interactions, or epistasis, are a crucial component of the genetic architecture of biologically relevant traits. However, epistasis is often viewed as a nuisance factor that reduces power for locus detection. Counter to expectations, recent work shows that fitting full models, instead of testing marker main effect and interaction components separately, in exhaustive multi-locus genome scans can have higher power to detect loci when epistasis is present than single-locus scans, and improvement that comes despite a much larger multiple testing alpha-adjustment in such searches. We demonstrate, both theoretically and via simulation, that the expected power to detect loci when fitting full models is often larger when these loci act epistatically than when they act additively. Additionally, we show that the power for single locus detection may be improved in cases of epistasis compared to the additive model. Our exploration of a two step model selection procedure shows that identifying the true model is difficult. However, this difficulty is certainly not exacerbated by the presence of epistasis, on the contrary, in some cases the presence of epistasis can aid in model selection. The impact of allele frequencies on both power and model selection is dramatic

    QTL linkage analysis of connected populations using ancestral marker and pedigree information

    Get PDF
    The common assumption in quantitative trait locus (QTL) linkage mapping studies that parents of multiple connected populations are unrelated is unrealistic for many plant breeding programs. We remove this assumption and propose a Bayesian approach that clusters the alleles of the parents of the current mapping populations from locus-specific identity by descent (IBD) matrices that capture ancestral marker and pedigree information. Moreover, we demonstrate how the parental IBD data can be incorporated into a QTL linkage analysis framework by using two approaches: a Threshold IBD model (TIBD) and a Latent Ancestral Allele Model (LAAM). The TIBD and LAAM models are empirically tested via numerical simulation based on the structure of a commercial maize breeding program. The simulations included a pilot dataset with closely linked QTL on a single linkage group and 100 replicated datasets with five linkage groups harboring four unlinked QTL. The simulation results show that including parental IBD data (similarly for TIBD and LAAM) significantly improves the power and particularly accuracy of QTL mapping, e.g., position, effect size and individuals’ genotype probability without significantly increasing computational demand

    Best Linear Unbiased Prediction of Genomic Breeding Values Using a Trait-Specific Marker-Derived Relationship Matrix

    Get PDF
    With the availability of high density whole-genome single nucleotide polymorphism chips, genomic selection has become a promising method to estimate genetic merit with potentially high accuracy for animal, plant and aquaculture species of economic importance. With markers covering the entire genome, genetic merit of genotyped individuals can be predicted directly within the framework of mixed model equations, by using a matrix of relationships among individuals that is derived from the markers. Here we extend that approach by deriving a marker-based relationship matrix specifically for the trait of interest

    Association mapping for yield and grain quality traits in rice (Oryza sativa L.)

    Get PDF
    Association analysis was applied to a panel of accessions of Embrapa Rice Core Collection (ERiCC) with 86 SSR and field data from two experiments. A clear subdivision between lowland and upland accessions was apparent, thereby indicating the presence of population structure. Thirty-two accessions with admixed ancestry were identified through structure analysis, these being discarded from association analysis, thus leaving 210 accessions subdivided into two panels. The association of yield and grain-quality traits with SSR was undertaken with a mixed linear model, with markers and subpopulation as fixed factors, and kinship matrix as a random factor. Eight markers from the two appraised panels showed significant association with four different traits, although only one (RM190) maintained the marker-trait association across years and cultivation. The significant association detected between amylose content and RM190 was in agreement with previous QTL analyses in the literature. Herein, the feasibility of undertaking association analysis in conjunction with germplasm characterization was demonstrated, even when considering low marker density. The high linkage disequilibrium expected in rice lines and cultivars facilitates the detection of marker-trait associations for implementing marker assisted selection, and the mining of alleles related to important traits in germplasm

    Mixed model approaches for the identification of QTLs within a maize hybrid breeding program

    Get PDF
    Two outlines for mixed model based approaches to quantitative trait locus (QTL) mapping in existing maize hybrid selection programs are presented: a restricted maximum likelihood (REML) and a Bayesian Markov Chain Monte Carlo (MCMC) approach. The methods use the in-silico-mapping procedure developed by Parisseaux and Bernardo (2004) as a starting point. The original single-point approach is extended to a multi-point approach that facilitates interval mapping procedures. For computational and conceptual reasons, we partition the full set of relationships from founders to parents of hybrids into two types of relations by defining so-called intermediate founders. QTL effects are defined in terms of those intermediate founders. Marker based identity by descent relationships between intermediate founders define structuring matrices for the QTL effects that change along the genome. The dimension of the vector of QTL effects is reduced by the fact that there are fewer intermediate founders than parents. Furthermore, additional reduction in the number of QTL effects follows from the identification of founder groups by various algorithms. As a result, we obtain a powerful mixed model based statistical framework to identify QTLs in genetic backgrounds relevant to the elite germplasm of a commercial breeding program. The identification of such QTLs will provide the foundation for effective marker assisted and genome wide selection strategies. Analyses of an example data set show that QTLs are primarily identified in different heterotic groups and point to complementation of additive QTL effects as an important factor in hybrid performance

    Genetic and Physiological Analysis of Iron Biofortification in Maize Kernels

    Get PDF
    BACKGROUND: Maize is a major cereal crop widely consumed in developing countries, which have a high prevalence of iron (Fe) deficiency anemia. The major cause of Fe deficiency in these countries is inadequate intake of bioavailable Fe, where poverty is a major factor. Therefore, biofortification of maize by increasing Fe concentration and or bioavailability has great potential to alleviate this deficiency. Maize is also a model system for genomic research and thus allows the opportunity for gene discovery. Here we describe an integrated genetic and physiological analysis of Fe nutrition in maize kernels, to identify loci that influence grain Fe concentration and bioavailability. METHODOLOGY: Quantitative trait locus (QTL) analysis was used to dissect grain Fe concentration (FeGC) and Fe bioavailability (FeGB) from the Intermated B73 × Mo17 (IBM) recombinant inbred (RI) population. FeGC was determined by ion coupled argon plasma emission spectroscopy (ICP). FeGB was determined by an in vitro digestion/Caco-2 cell line bioassay. CONCLUSIONS: Three modest QTL for FeGC were detected, in spite of high heritability. This suggests that FeGC is controlled by many small QTL, which may make it a challenging trait to improve by marker assisted breeding. Ten QTL for FeGB were identified and explained 54% of the variance observed in samples from a single year/location. Three of the largest FeGB QTL were isolated in sister derived lines and their effect was observed in three subsequent seasons in New York. Single season evaluations were also made at six other sites around North America, suggesting the enhancement of FeGB was not specific to our farm site. FeGB was not correlated with FeGC or phytic acid, suggesting that novel regulators of Fe nutrition are responsible for the differences observed. Our results indicate that iron biofortification of maize grain is achievable using specialized phenotyping tools and conventional plant breeding techniques
    corecore