Search CORE

199 research outputs found

Large-scale genomic prediction using singular value decomposition of the genotype matrix

Author: A Legarra
CR Henderson
DC Lay
G Campos de los
I Misztal
I Misztal
Ismo Strandén
Jørgen Ødegård
L Tusell
M Kimura
OF Christensen
P VanRaden
PM VanRaden
PM VanRaden
RL Fernando
T Hastie
T Meuwissen
T Meuwissen
THE Meuwissen
THE Meuwissen
Theo H. E. Meuwissen
Ulf Indahl
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

The importance of identity-by-state information for the accuracy of genomic selection

Author: Alessandro Bagnato
AR Gilmour
BJ Hayes
BL Harris
D Berry
D Habier
D Habier
DJ Garrick
HD Daetwyler
John A Woolliams
Jørgen Ødegård
M Goddard
Marlies Dolezal
ME Goddard
MS Lund
PM VanRaden
R Makowsky
RL Fernando
Sergio I Roman-Ponce
T Luan
T Meuwissen
THE Meuwissen
THE Meuwissen
Theo HE Meuwissen
Tu Luan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Abstract Background It is commonly assumed that prediction of genome-wide breeding values in genomic selection is achieved by capitalizing on linkage disequilibrium between markers and QTL but also on genetic relationships. Here, we investigated the reliability of predicting genome-wide breeding values based on population-wide linkage disequilibrium information, based on identity-by-descent relationships within the known pedigree, and to what extent linkage disequilibrium information improves predictions based on identity-by-descent genomic relationship information. Methods The study was performed on milk, fat, and protein yield, using genotype data on 35 706 SNP and deregressed proofs of 1086 Italian Brown Swiss bulls. Genome-wide breeding values were predicted using a genomic identity-by-state relationship matrix and a genomic identity-by-descent relationship matrix (averaged over all marker loci). The identity-by-descent matrix was calculated by linkage analysis using one to five generations of pedigree data. Results We showed that genome-wide breeding values prediction based only on identity-by-descent genomic relationships within the known pedigree was as or more reliable than that based on identity-by-state, which implicitly also accounts for genomic relationships that occurred before the known pedigree. Furthermore, combining the two matrices did not improve the prediction compared to using identity-by-descent alone. Including different numbers of generations in the pedigree showed that most of the information in genome-wide breeding values prediction comes from animals with known common ancestors less than four generations back in the pedigree. Conclusions Our results show that, in pedigreed breeding populations, the accuracy of genome-wide breeding values obtained by identity-by-descent relationships was not improved by identity-by-state information. Although, in principle, genomic selection based on identity-by-state does not require pedigree data, it does use the available pedigree structure. Our findings may explain why the prediction equations derived for one breed may not predict accurate genome-wide breeding values when applied to other breeds, since family structures differ among breeds.</p

Crossref

AIR Universita degli studi di Milano

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Genetic prediction of complex traits: integrating infinitesimal and marked genetic effects

Author: A Legarra
Clément Carré
CR Henderson
CR Henderson
D Gianola
David Cros
Eduardo Manfredi
Fabrice Gamboa
G Los campos De
GR Abecasis
Gregor Gorjanc
I Aguilar
John Michael Hickey
JT Yang
LN Hazel
ME Goddard
ME Goddard
MS Lund
P Vanraden
RL Fernando
RL Quaas
RV Rohlfs
SI Duchemin
TH Meuwissen
TH Meuwissen
TH Meuwissen
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Genetic prediction for complex traits is usually based on models including individual (infinitesimal) or marker effects. Here, we concentrate on models including both the individual and the marker effects. In particular, we develop a ''Mendelian segregation'' model combining infinitesimal effects for base individuals and realized Mendelian sampling in descendants described by the available DNA data. The model is illustrated with an example and the analyses of a public simulated data file. Further, the potential contribution of such models is assessed by simulation. Accuracy, measured as the correlation between true (simulated) and predicted genetic values, was similar for all models compared under different genetic backgrounds. As expected, the segregation model is worthwhile when markers capture a low fraction of total genetic variance. (Résumé d'auteur

Crossref

Springer - Publisher Connector

HAL Descartes

PubMed Central

Edinburgh Research Explorer

Agritrop

Repository of the University of Ljubljana

HAL-CIRAD

Estimating genetic diversity across the neutral genome with the use of dense marker maps

Author: APW De Roos
D Habier
DH Reed
DS Falconer
FAO
JA Woolliams
Jack J Windig
JCM Dekkers
JJ Windig
JK Oldenbroek
Krista A Engelsma
L Grapes
M Toro
MA Toro
MA Toro
Mario PL Calus
MPL Calus
MS Khatkar
Piter Bijma
RL Fernando
THE Meuwissen
THE Meuwissen
THE Meuwissen
WG Hill
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background With the advent of high throughput DNA typing, dense marker maps have become available to investigate genetic diversity on specific regions of the genome. The aim of this paper was to compare two marker based estimates of the genetic diversity in specific genomic regions lying in between markers: IBD-based genetic diversity and heterozygosity. Methods A computer simulated population was set up with individuals containing a single 1-Morgan chromosome and 1665 SNP markers and from this one, an additional population was produced with a lower marker density i.e. 166 SNP markers. For each marker interval based on adjacent markers, the genetic diversity was estimated either by IBD probabilities or heterozygosity. Estimates were compared to each other and to the true genetic diversity. The latter was calculated for a marker in the middle of each marker interval that was not used to estimate genetic diversity. Results The simulated population had an average minor allele frequency of 0.28 and an LD (r2) of 0.26, comparable to those of real livestock populations. Genetic diversities estimated by IBD probabilities and by heterozygosity were positively correlated, and correlations with the true genetic diversity were quite similar for the simulated population with a high marker density, both for specific regions (r = 0.19-0.20) and large regions (r = 0.61-0.64) over the genome. For the population with a lower marker density, the correlation with the true genetic diversity turned out to be higher for the IBD-based genetic diversity. Conclusions Genetic diversities of ungenotyped regions of the genome (i.e. between markers) estimated by IBD-based methods and heterozygosity give similar results for the simulated population with a high marker density. However, for a population with a lower marker density, the IBD-based method gives a better prediction, since variation and recombination between markers are missed with heterozygosity.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

Effects of the number of markers per haplotype and clustering of haplotypes on the accuracy of QTL mapping and prediction of genomic breeding values

Author: A Legarra
Addie LJ Vereijken
B Hayes
BJ Hayes
Chris Schrooten
D Habier
DS Falconer
E Uleberg
Egbert F Knol
EJ Pollak
HG deVries
Jack J Windig
K Yu
L Grapes
L Grapes
L Ronnegard
M Lynch
Mario PL Calus
MPL Calus
RL Fernando
Roel F Veerkamp
T Druet
THE Meuwissen
THE Meuwissen
THE Meuwissen
Theo HE Meuwissen
TR Solberg
WG Hill
WM Muir
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

The aim of this paper was to compare the effect of haplotype definition on the precision of QTL-mapping and on the accuracy of predicted genomic breeding values. In a multiple QTL model using identity-by-descent (IBD) probabilities between haplotypes, various haplotype definitions were tested i.e. including 2, 6, 12 or 20 marker alleles and clustering base haplotypes related with an IBD probability of > 0.55, 0.75 or 0.95. Simulated data contained 1100 animals with known genotypes and phenotypes and 1000 animals with known genotypes and unknown phenotypes. Genomes comprising 3 Morgan were simulated and contained 74 polymorphic QTL and 383 polymorphic SNP markers with an average r2 value of 0.14 between adjacent markers. The total number of haplotypes decreased up to 50% when the window size was increased from two to 20 markers and decreased by at least 50% when haplotypes related with an IBD probability of > 0.55 instead of > 0.95 were clustered. An intermediate window size led to more precise QTL mapping. Window size and clustering had a limited effect on the accuracy of predicted total breeding values, ranging from 0.79 to 0.81. Our conclusion is that different optimal window sizes should be used in QTL-mapping versus genome-wide breeding value prediction

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

Genomic predictions based on animal models using genotype imputation on a national scale in Norwegian Red cattle

Author: A Legarra
A Legarra
E Mantysaari
I Aguilar
JE Powell
JH Steiger
JM Hickey
Jørgen Ødegård
LK Matukumalli
M Sargolzaei
Morten Svendsen
OF Christensen
P VanRaden
PM VanRaden
R Mrode
RA Westell
RJ Kerr
RL Fernando
RL Fernando
SR Browning
T Luan
T Meuwissen
THE Meuwissen
THE Meuwissen
Theo H. E. Meuwissen
Trygve Solberg
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Inclusion of genetically identical animals to a numerator relationship matrix and modification of its inverse

Author: B Tier
BW Brown
BW Kennedy
CEG Bruder
CR Henderson
CR Henderson
I Misztal
K Takeda
Kazuhiro Yasuda
RL Quaas
Takuro Oikawa
THE Meuwissen
TR Famula
WM Rideout III
X Yang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

In the field of animal breeding, estimation of genetic parameters and prediction of breeding values are routinely conducted by analyzing quantitative traits. Using an animal model and including the direct inverse of a numerator relationship matrix (NRM) into a mixed model has made these analyses possible. However, a method including a genetically identical animal (GIA) in NRM if genetic relationships between pairs of GIAs are not perfect, is still lacking. Here, we describe a method to incorporate GIAs into NRM using a K matrix in which diagonal elements are set to 1.0, off-diagonal elements between pairs of GIAs to (1-x) and the other elements to 0, where x is a constant less than 0.05. The inverse of the K matrix is then calculated directly by a simple formula. Thus, the inverse of the NRM is calculated by the products of the lower triangular matrix that identifies the parents of each individual, its transpose matrix, the inverse of the K matrix and the inverse of diagonal matrix D, in which the diagonal elements comprise a number of known parents and their inbreeding coefficients. The computing method is adaptable to the analysis of a data set including pairs of GIAs with imperfect relationships

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Genomic prediction based on runs of homozygosity

Author: AK Sonesson
Alessandro Bagnato
AR Gilmour
B Hayes
BJ Hayes
D Habier
IM MacLeod
JA Sved
JA Yang
JL Jannink
LR Rabiner
Marlies Dolezal
ME Goddard
PM VanRaden
RL Fernando
T Luan
THE Meuwissen
THE Meuwissen
THE Meuwissen
Theo HE Meuwissen
Tu Luan
WG Hill
Xijiang Yu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A gene frequency model for QTL mapping using Bayesian inference

Author: A Legarra
AP de Roos
AR Gilmour
BJ Hayes
BJ Hayes
C Andreescu
CM Devlin
CR Henderson
CS Haley
D Habier
D Habier
D Sorensen
DJ de Koning
DL Johnson
DW Fulker
EA Rosenthal
EM Wijsman
ES Lander
EW Daw
F Farnir
F Gagnon
FE Gringola
H Zhao
Helene Gilbert
I Hoeschele
J Weller
Jack CM Dekkers
JCM Dekkers
L Grapes
L Grapes
L Terney
LP Zhao
M Perez-Enciso
M Xiong
ME Goddard
MPL Calus
N Metropolis
R Fan
R Fan
RC Elston
RL Fernando
RL Fernando
RL Fernando
RM Cantor
RM Thallman
Rohan L Fernando
S Heath
S Xu
SA Knott
SC Heath
T Meuwissen
T Wang
T Wang
THE Meuwissen
THE Meuwissen
Wei He
XY Lou
ZB Zeng
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Information for mapping of quantitative trait loci (QTL) comes from two sources: linkage disequilibrium (non-random association of allele states) and cosegregation (non-random association of allele origin). Information from LD can be captured by modeling conditional means and variances at the QTL given marker information. Similarly, information from cosegregation can be captured by modeling conditional covariances. Here, we consider a Bayesian model based on gene frequency (BGF) where both conditional means and variances are modeled as a function of the conditional gene frequencies at the QTL. The parameters in this model include these gene frequencies, additive effect of the QTL, its location, and the residual variance. Bayesian methodology was used to estimate these parameters. The priors used were: logit-normal for gene frequencies, normal for the additive effect, uniform for location, and inverse chi-square for the residual variance. Computer simulation was used to compare the power to detect and accuracy to map QTL by this method with those from least squares analysis using a regression model (LSR). Results To simplify the analysis, data from unrelated individuals in a purebred population were simulated, where only LD information contributes to map the QTL. LD was simulated in a chromosomal segment of 1 cM with one QTL by random mating in a population of size 500 for 1000 generations and in a population of size 100 for 50 generations. The comparison was studied under a range of conditions, which included SNP density of 0.1, 0.05 or 0.02 cM, sample size of 500 or 1000, and phenotypic variance explained by QTL of 2 or 5%. Both 1 and 2-SNP models were considered. Power to detect the QTL for the BGF, ranged from 0.4 to 0.99, and close or equal to the power of the regression using least squares (LSR). Precision to map QTL position of BGF, quantified by the mean absolute error, ranged from 0.11 to 0.21 cM for BGF, and was better than the precision of LSR, which ranged from 0.12 to 0.25 cM. Conclusions In conclusion given a high SNP density, the gene frequency model can be used to map QTL with considerable accuracy even within a 1 cM region.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ProdInra

Precision of genetic parameters and breeding values estimated in marker assisted BLUP genetic evaluation

Author: AR Gilmour
AW George
B Villanueva
Christian Edel
F Guillaume
FR Rijsdijk
G Thaller
Georg Thaller
Interbull
J Bennewitz
J Ruane
JBS Haldane
JCM Dekkers
K Vuori
Kay-Uwe Götz
L Rönnegård
PM VanRaden
PM VanRaden
Reiner Emmerling
RJ Spelman
RL Fernando
S Neuner
S Viitala
SC Heath
Stefan Neuner
T Druet
T Druet
THE Meuwissen
THE Meuwissen
WF Fikse
Z Liu
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

In practical implementations of marker-assisted selection economic and logistic restrictions frequently lead to incomplete genotypic data for the animals of interest. This may result in bias and larger standard errors of the estimated parameters and, as a consequence, reduce the benefits of applying marker-assisted selection. Our study examines the impact of the following factors: phenotypic information, depth of pedigree, and missing genotypes in the application of marker-assisted selection. Stochastic simulations were conducted to generate a typical dairy cattle population. Genetic parameters and breeding values were estimated using a two-step approach. First, pre-corrected phenotypes (daughter yield deviations (DYD) for bulls, yield deviations (YD) for cows) were calculated in polygenic animal models for the entire population. These estimated phenotypes were then used in marker assisted BLUP (MA-BLUP) evaluations where only the genotyped animals and their close relatives were included

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central