52 research outputs found

    Prediction of a deletion copy number variant by a dense SNP panel

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>A newly recognized type of genetic variation, Copy Number Variation (CNV), is detected in mammalian genomes, e.g. the cattle genome. This form of variation can potentially cause phenotypic variation. Our objective was to determine whether dense SNP (single nucleotide polymorphisms) panels can capture the genetic variation due to a simple bi-allelic CNV, with the prospect of including the effect of such structural variations into genomic predictions.</p> <p>Methods</p> <p>A deletion type CNV on bovine chromosome 6 was predicted from its neighboring SNP with a multiple regression model. Our dataset consisted of CNV genotypes of 1,682 cows, along with 100 surrounding SNP genotypes. A prediction model was fitted considering 10 to 100 surrounding SNP and the accuracy obtained directly from the model was confirmed by cross-validation.</p> <p>Results and conclusions</p> <p>The accuracy of prediction increased with an increasing number of SNP in the model and the predicted accuracies were similar to those obtained by cross-validation. A substantial increase in accuracy was observed when the number of SNP increased from 10 to 50 but thereafter the increase was smaller, reaching the highest accuracy (0.94) with 100 surrounding SNP. Thus, we conclude that the genotype of a deletion type CNV and its putative QTL effect can be predicted with a maximum accuracy of 0.94 from surrounding SNP. This high prediction accuracy suggests that genetic variation due to simple deletion CNV is well captured by dense SNP panels. Since genomic selection relies on the availability of a dense marker panel with markers in close linkage disequilibrium to the QTL in order to predict their genetic values, we also discuss opportunities for genomic selection to predict the effects of CNV by dense SNP panels, when CNV cause variation in quantitative traits.</p

    Genomic dissection of maternal, additive and non-additive genetic effects for growth and carcass traits in Nile tilapia

    Get PDF
    International audienceAbstractBackgroundThe availability of both pedigree and genomic sources of information for animal breeding and genetics has created new challenges in understanding how they can be best used and interpreted. This study estimated genetic variance components based on genomic information and compared these to the variance components estimated from pedigree alone in a population generated to estimate non-additive genetic variance. Furthermore, the study examined the impact of the assumptions of Hardy–Weinberg equilibrium (HWE) on estimates of genetic variance components. For the first time, the magnitude of inbreeding depression for important commercial traits in Nile tilapia was estimated by using genomic data.ResultsThe study estimated the non-additive genetic variance in a Nile tilapia population of full-sib families and, when present, it was almost entirely represented by additive-by-additive epistatic variance, although in pedigree studies this non-additive variance is commonly assumed to arise from dominance. For body depth (BD) and body weight at harvest (BWH), the proportion of additive-by-additive epistatic to phenotypic variance was estimated to be 0.15 and 0.17 using genomic data (P < 0.05). In addition, with genomic data, the maternal variance (P < 0.05) for BD, BWH, body length (BL) and fillet weight (FW) explained approximately 10% of the phenotypic variances, which was comparable to pedigree-based estimates. The study also showed the detrimental effects of inbreeding on commercial traits of tilapia, which was estimated to reduce trait values by 1.1, 0.9, 0.4 and 0.3% per 1% increase in the individual homozygosity for FW, BWH, BD and BL, respectively. The presence of inbreeding depression but lack of dominance variance was consistent with an infinitesimal dominance model for the traits.ConclusionsThe benefit of including non-additive genetic effects for genetic evaluations in tilapia breeding schemes is not evident from these findings, but the observed inbreeding depression points to a role for reciprocal recurrent selection. Commercially, this conclusion will depend on the scheme’s operational costs and resources. The creation of maternal lines in Tilapia breeding schemes may be a possibility if the variation associated with maternal effects is heritable

    Genomic prediction in an admixed population of Atlantic salmon (Salmo salar)

    Get PDF
    Reliability of genomic selection (GS) models was tested in an admixed population of Atlantic salmon, originating from crossing of several wild subpopulations. The models included ordinary genomic BLUP models (GBLUP), using genome-wide SNP markers of varying densities (1 to 220k), a genomic identity-by-descent model (IBD-GS), using linkage analysis of sparse genome-wide markers, as well as a classical pedigree-based model. Reliabilities of the models were compared through 5-fold cross-validation. The traits studied were salmon lice (Lepeophtheirus salmonis) resistance (LR), measured as (log) density on the skin and fillet color (FC), with respective estimated heritabilities of 0.14 and 0.43. All genomic models outperformed the classical pedigree-based model, for both traits and at all marker densities. However, the relative improvement differed considerably between traits, models and marker densities. For the highly heritable FC, the IBD-GS had similar reliability as GBLUP at high marker densities (>22k). In contrast, for the lowly heritable LR, IBD-GS was clearly inferior to GBLUP, irrespective of marker density. Hence, GBLUP was robust to marker density for the lowly heritable LR, but sensitive to marker density for the highly heritable FC. We hypothesize that this phenomenon may be explained by historical admixture of different founder populations, expected to reduce short-range LD and induce long-range LD. The relative importance of LD/relationship information is expected to decrease/increase with increasing heritability of the trait. Still, using the ordinary GBLUP, the typical long-range LD of an admixed population may be effectively captured by sparse markers, while efficient utilization of relationship information may require denser markers (e.g., 22k or more)

    Simultaneous fitting of genomic-BLUP and Bayes-C components in a genomic prediction model

    Get PDF
    International audienceAbstractBackgroundThe rapid adoption of genomic selection is due to two key factors: availability of both high-throughput dense genotyping and statistical methods to estimate and predict breeding values. The development of such methods is still ongoing and, so far, there is no consensus on the best approach. Currently, the linear and non-linear methods for genomic prediction (GP) are treated as distinct approaches. The aim of this study was to evaluate the implementation of an iterative method (called GBC) that incorporates aspects of both linear [genomic-best linear unbiased prediction (G-BLUP)] and non-linear (Bayes-C) methods for GP. The iterative nature of GBC makes it less computationally demanding similar to other non-Markov chain Monte Carlo (MCMC) approaches. However, as a Bayesian method, GBC differs from both MCMC- and non-MCMC-based methods by combining some aspects of G-BLUP and Bayes-C methods for GP. Its relative performance was compared to those of G-BLUP and Bayes-C.MethodsWe used an imputed 50 K single-nucleotide polymorphism (SNP) dataset based on the Illumina Bovine50K BeadChip, which included 48,249 SNPs and 3244 records. Daughter yield deviations for somatic cell count, fat yield, milk yield, and protein yield were used as response variables.ResultsGBC was frequently (marginally) superior to G-BLUP and Bayes-C in terms of prediction accuracy and was significantly better than G-BLUP only for fat yield. On average across the four traits, GBC yielded a 0.009 and 0.006 increase in prediction accuracy over G-BLUP and Bayes-C, respectively. Computationally, GBC was very much faster than Bayes-C and similar to G-BLUP.ConclusionsOur results show that incorporating some aspects of G-BLUP and Bayes-C in a single model can improve accuracy of GP over the commonly used method: G-BLUP. Generally, GBC did not statistically perform better than G-BLUP and Bayes-C, probably due to the close relationships between reference and validation individuals. Nevertheless, it is a flexible tool, in the sense, that it simultaneously incorporates some aspects of linear and non-linear models for GP, thereby exploiting family relationships while also accounting for linkage disequilibrium between SNPs and genes with large effects. The application of GBC in GP merits further exploration

    Within- and across-breed genomic prediction using whole-genome sequence and single nucleotide polymorphism panels

    Get PDF
    International audienceBackground Currently, genomic prediction in cattle is largely based on panels of about 54k single nucleotide polymorphisms (SNPs). However with the decreasing costs of and current advances in next-generation sequencing technologies, whole-genome sequence (WGS) data on large numbers of individuals is within reach. Availability of such data provides new opportunities for genomic selection, which need to be explored.MethodsThis simulation study investigated how much predictive ability is gained by using WGS data under scenarios with QTL (quantitative trait loci) densities ranging from 45 to 132 QTL/Morgan and heritabilities ranging from 0.07 to 0.30, compared to different SNP densities, with emphasis on divergent dairy cattle breeds with small populations. The relative performances of best linear unbiased prediction (SNP-BLUP) and of a variable selection method with a mixture of two normal distributions (MixP) were also evaluated. Genomic predictions were based on within-population, across-population, and multi-breed reference populations.ResultsThe use of WGS data for within-population predictions resulted in small to large increases in accuracy for low to moderately heritable traits. Depending on heritability of the trait, and on SNP and QTL densities, accuracy increased by up to 31 %. The advantage of WGS data was more pronounced (7 to 92 % increase in accuracy depending on trait heritability, SNP and QTL densities, and time of divergence between populations) with a combined reference population and when using MixP. While MixP outperformed SNP-BLUP at 45 QTL/Morgan, SNP-BLUP was as good as MixP when QTL density increased to 132 QTL/Morgan.ConclusionsOur results show that, genomic predictions in numerically small cattle populations would benefit from a combination of WGS data, a multi-breed reference population, and a variable selection method

    Use and optimization of different sources of information for genomic prediction

    Get PDF
    Abstract Background Molecular data is now commonly used to predict breeding values (BV). Various methods to calculate genomic relationship matrices (GRM) have been developed, with some studies proposing regression of coefficients back to the reference matrix of pedigree-based relationship coefficients (A). The objective was to compare the utility of two GRM: a matrix based on linkage analysis (LA) and anchored to the pedigree, i.e. GLA,{\mathbf{G}}_{{{\mathbf{LA}}}} , G LA , and a matrix based on linkage disequilibrium (LD), i.e. GLD{\mathbf{G}}_{{{\mathbf{LD}}}} G LD , using genomic and phenotypic data collected on 5416 broiler chickens. Furthermore, the effects of regressing the coefficients of GLD{\mathbf{G}}_{{{\mathbf{LD}}}} G LD back to A (LDA) and to GLA{\mathbf{G}}_{{{\mathbf{LA}}}} G LA (LDLA) were evaluated, using a range of weighting factors. The performance of the matrices and their composite products was assessed by the fit of the models to the data, and the empirical accuracy and bias of the BV that they predicted. The sensitivity to marker choice was examined by using two chips of equal density but including different single nucleotide polymorphisms (SNPs). Results The likelihood of models using GRM and composite matrices exceeded the likelihood of models based on pedigree alone and was highest with intermediate weighting factors for both the LDA and LDLA approaches. For these data, empirical accuracies were not strongly affected by the weighting factors, although they were highest when different sources of information were combined. The optimum weighting factors depended on the type of matrices used, as well as on the choice of SNPs from which the GRM were constructed. Prediction bias was strongly affected by the chip used and less by the form of the GRM. Conclusions Our findings provide an empirical comparison of the efficacy of pedigree and genomic predictions in broiler chickens and examine the effects of fitting GRM with coefficients regressed back to a reference anchored to the pedigree, either A or GLA{\mathbf{G}}_{{{\mathbf{LA}}}} G LA . For the analysed dataset, the best results were obtained when GLD{\mathbf{G}}_{{{\mathbf{LD}}}} G LD was combined with relationships in A or GLA{\mathbf{G}}_{{{\mathbf{LA}}}} G LA , with optimum weighting factors that depended on the choice of SNPs used. The optimum weighting factor for broiler body weight differed from weighting factors that were based on the density of SNPs and theoretically derived using generalised assumptions

    Large scale genome-wide association and LDLA mapping study identifies QTLs for boar taint and related sex steroids

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Boar taint is observed in a high proportion of uncastrated male pigs and is characterized by an unpleasant odor/flavor in cooked meat, primarily caused by elevated levels of androstenone and skatole. Androstenone is a steroid produced in the testis in parallel with biosynthesis of other sex steroids like testosterone and estrogens. This represents a challenge when performing selection against androstenone in breeding programs, without simultaneously decreasing levels of other steroids. The aim of this study was to use high-density genome wide association (GWA) in combination with linkage disequilibrium-linkage analysis (LDLA) to identify quantitative trait loci (QTL) associated with boar taint compounds and related sex steroids in commercial Landrace (n = 1,251) and Duroc (n = 918) breeds.</p> <p>Results</p> <p>Altogether, 14 genome wide significant (GWS) QTL regions for androstenone in subcutaneous fat were obtained from the LDLA study in Landrace and 14 GWS QTL regions in Duroc. LDLA analysis revealed that 7 of these QTL regions, located on SSC 1, 2, 3, 7 and 15, were obtained in both breeds. All 14 GWS androstenone QTLs in Landrace are also affecting the estrogens at chromosome wise significance (CWS) or GWS levels, while in Duroc, 3 of the 14 QTLs affect androstenone without affecting any of the estrogens. For skatole, 10 and 4 QTLs were GWS in the LDLA analysis for Landrace and Duroc respectively, with 4 of these detected in both breeds. The GWS QTLs for skatole obtained by LDLA are located at SSC 1, 5, 6, 7, 10, 11, 13 and 14.</p> <p>Conclusion</p> <p>This is the first report applying the Porcine 60 K SNP array for simultaneous analysis of boar taint compounds and related sex hormones, using both GWA and LDLA approaches. Several QTLs are involved in regulation of androstenone and skatole, and most of the QTLs for androstenone are also affecting the levels of estrogens. Seven QTLs for androstenone were detected in one breed and confirmed in the other, i.e. in an independent sample, although the majority of QTLs are breed specific. Most QTLs for skatole do not negatively affect other sex hormones and should be easier to implement into the breeding scheme.</p
    corecore