Location of Repository

Extent of genome-wide linkage disequilibrium in Australian Holstein-Friesian cattle based on a high-density SNP panel

By Mehar S. Khatkar, Mehar S. Nicholas, Andrew R. Collins, Kyall R. Zenger, Julie A.L. Cavanagh, Wes Barris, Robert D. Schnabel, Jeremy F. Taylor and Herman W. Raadsma


BACKGROUND: The extent of linkage disequilibrium (LD) within a population determines the number of markers that will be required for successful association mapping and marker-assisted selection. Most studies on LD in cattle reported to date are based on microsatellite markers or small numbers of single nucleotide polymorphisms (SNPs) covering one or only a few chromosomes. This is the first comprehensive study on the extent of LD in cattle by analyzing data on 1,546 Holstein-Friesian bulls genotyped for 15,036 SNP markers covering all regions of all autosomes. Furthermore, most studies in cattle have used relatively small sample sizes and, consequently, may have had biased estimates of measures commonly used to describe LD. We examine minimum sample sizes required to estimate LD without bias and loss in accuracy. Finally, relatively little information is available on comparative LD structures including other mammalian species such as human and mouse, and we compare LD structure in cattle with public-domain data from both human and mouse. RESULTS: We computed three LD estimates, D', Dvol and r2, for 1,566,890 syntenic SNP pairs and a sample of 365,400 non-syntenic pairs. Mean D' is 0.189 among syntenic SNPs, and 0.105 among non-syntenic SNPs; mean r2 is 0.024 among syntenic SNPs and 0.0032 among non-syntenic SNPs. All three measures of LD for syntenic pairs decline with distance; the decline is much steeper for r2 than for D' and Dvol. The value of D' and Dvol are quite similar. Significant LD in cattle extends to 40 kb (when estimated as r2) and 8.2 Mb (when estimated as D'). The mean values for LD at large physical distances are close to those for non-syntenic SNPs. Minor allelic frequency threshold affects the distribution and extent of LD. For unbiased and accurate estimates of LD across marker intervals spanning 0.62). For estimation of LD by D' and Dvol with sufficient precision, a sample size of at least 400 is required, whereas for r2 a minimum sample of 75 is adequate

Topics: QL, QH426
Year: 2008
OAI identifier: oai:eprints.soton.ac.uk:59929
Provided by: e-Prints Soton

Suggested articles



  1. (2005). Applications of whole-genome highdensity SNP genotyping. Expert Rev Mol Diagn
  2. (2005). Daly MJ: Genome-wide association studies for common diseases and complex traits. Nat Rev Genet
  3. DH: Confirming single nucleotide polymorphisms Additional file 1
  4. (2005). MS: A genome-wide scalable SNP genotyping assay using microarray technology. Nat Genet
  5. (2005). TP: Linkage mapping bovine EST-based SNP. BMC Genomics

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.