Search CORE

45 research outputs found

High-resolution haplotype block structure in the cattle genome

Author: A Tenesa
AM Bowcock
B Rosner
BJ Hayes
BJ Hayes
Clare A Gill
CP Van Tassell
Curtis P Van Tassell
DJ Witherspoon
DL Hyten
DW Craig
H Nilsen
I Kershaw
International HapMap Consortium
IT Jolliffe
JA Sved
JL Mountain
John J Grefenstette
Jungwoo Choi
KA Frazer
Lakshmi K Matukumalli
LK Matukumalli
LK Matukumalli
M Gautier
M Jakobsson
MJ Daly
MS Khatkar
P Scheet
PC Sabeti
Rafael Villa-Angulo
S Gu
SB Gabriel
SD McKay
The Bovine HapMap Consortium
V Guryev
X Wang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The Bovine HapMap Consortium has generated assay panels to genotype ~30,000 single nucleotide polymorphisms (SNPs) from 501 animals sampled from 19 worldwide taurine and indicine breeds, plus two outgroup species (Anoa and Water Buffalo). Within the larger set of SNPs we targeted 101 high density regions spanning up to 7.6 Mb with an average density of approximately one SNP per 4 kb, and characterized the linkage disequilibrium (LD) and haplotype block structure within individual breeds and groups of breeds in relation to their geographic origin and use. Results From the 101 targeted high-density regions on bovine chromosomes 6, 14, and 25, between 57 and 95% of the SNPs were informative in the individual breeds. The regions of high LD extend up to ~100 kb and the size of haplotype blocks ranges between 30 bases and 75 kb (10.3 kb average). On the scale from 1–100 kb the extent of LD and haplotype block structure in cattle has high similarity to humans. The estimation of effective population sizes over the previous 10,000 generations conforms to two main events in cattle history: the initiation of cattle domestication (~12,000 years ago), and the intensification of population isolation and current population bottleneck that breeds have experienced worldwide within the last ~700 years. Haplotype block density correlation, block boundary discordances, and haplotype sharing analyses were consistent in revealing unexpected similarities between some beef and dairy breeds, making them non-differentiable. Clustering techniques permitted grouping of breeds into different clades given their similarities and dissimilarities in genetic structure. Conclusion This work presents the first high-resolution analysis of haplotype block structure in worldwide cattle samples. Several novel results were obtained. First, cattle and human share a high similarity in LD and haplotype block structure on the scale of 1–100 kb. Second, unexpected similarities in haplotype block structure between dairy and beef breeds make them non-differentiable. Finally, our findings suggest that ~30,000 uniformly distributed SNPs would be necessary to construct a complete genome LD map in <it>Bos taurus </it>breeds, and ~580,000 SNPs would be necessary to characterize the haplotype block structure across the complete cattle genome.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

OAKTrust Digital Repository (Texas A&M Univ)

Double Digest RADseq: An Inexpensive Method for De Novo SNP Discovery and Genotyping in Model and Non-Model Species

Author: Brant K. Peterson
CM Ramsdell
CP van Tassell
D Altshuler
DA Pollard
DW Craig
EM Kenny
Emily H. Kay
G Lunter
GP Consortium 1000
H Li
H Li
H Li
Heidi S. Fisher
Hopi E. Hoekstra
J Felsenstein
JC Avise
Jesse N. Weber
JL Davey
JM Catchen
KJ Emerson
KW Broman
L Li
L Salmela
LM Turner
Ludovic Orlando
MA Depristo
MA Quail
MA White
MD Carling
N Patterson
NA Baird
NJ van Orsouw
P Andolfatto
PA Hohenlohe
PA Hohenlohe
PA Hohenlohe
PA Hohenlohe
RC Edgar
S Alon
TFC Mackay
WF Dietrich
WF Pfender
WJ Kent
Z Gompert
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The ability to efficiently and accurately determine genotypes is a keystone technology in modern genetics, crucial to studies ranging from clinical diagnostics, to genotype-phenotype association, to reconstruction of ancestry and the detection of selection. To date, high capacity, low cost genotyping has been largely achieved via “SNP chip” microarray-based platforms which require substantial prior knowledge of both genome sequence and variability, and once designed are suitable only for those targeted variable nucleotide sites. This method introduces substantial ascertainment bias and inherently precludes detection of rare or population-specific variants, a major source of information for both population history and genotype-phenotype association. Recent developments in reduced-representation genome sequencing experiments on massively parallel sequencers (commonly referred to as RAD-tag or RADseq) have brought direct sequencing to the problem of population genotyping, but increased cost and procedural and analytical complexity have limited their widespread adoption. Here, we describe a complete laboratory protocol, including a custom combinatorial indexing method, and accompanying software tools to facilitate genotyping across large numbers (hundreds or more) of individuals for a range of markers (hundreds to hundreds of thousands). Our method requires no prior genomic knowledge and achieves per-site and per-individual costs below that of current SNP chip technology, while requiring similar hands-on time investment, comparable amounts of input DNA, and downstream analysis times on the order of hours. Finally, we provide empirical results from the application of this method to both genotyping in a laboratory cross and in wild populations. Because of its flexibility, this modified RADseq approach promises to be applicable to a diversity of biological questions in a wide range of organisms

CiteSeerX

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

PubMed Central

Surface-associated MUC5B mucins promote protease activity in Lactobacillus fermentum biofilms

Author: A Klein
A Mellors
AJ MacPherson
AP Corfield
B Ma
C Badet
C Kindblom
C Wickström
C Wickström
C Wickström
Claes Wickström
DJ Thornton
DR Sutherland
E Farnell
ER Kunji
FJ Troost
G Musso
GD Stoyancheva
Gunnel Svensäter
I Strahinic
JL Round
JR Davies
Julia R Davies
K Hojo
KM Abdullah
LE Chávez de Paz
Luis Chávez de Paz
M Derrien
MA Watt
ME Macías-Rodríguez
ML Van Tassell
RJ Siezen
S Marchant
SM Naser
Y Song
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Whole-genome resequencing shows numerous genes with nonsynonymous SNPs in the Japanese native cattle Kuchinoshima-Ushi

Author: A Bagnato
A Stamatakis
BJ Hayes
BT Page
CG Elsik
CP Van Tassell
DA Wheeler
DR Bentley
F Ronquist
FC Buchanan
H Li
H Li
Hirofumi Yoshikawa
J Wang
JBS Ferraz
JC Venter
JD Nkrumah
JI Kim
JL Gill
Kaoru Tsuda
MA Harris
ME Goddard
MF Allan
MG Thomas
PM VanRaden
RA Gibbs
Ryouka Kawahara-Miki
S Eck
S Hiendleder
S Hoashi
S Levy
S MacEachern
S Pant
SC Liefers
Sen-ichi Oda
Shizufumi Ebihara
Shunsuke Yajima
SM Ahn
T Maniatis
Takashi Matsumoto
Tomohiro Kono
X Zhou
Yu Kanesaki
Yuh Shiwa
Yuko Arai-Kichise
Z Jiang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Because the Japanese native cattle <it>Kuchinoshima-Ushi </it>have been isolated in a small island and their lineage has been intensely protected, it has been assumed to date that numerous and valuable genomic variations are conserved in this cattle breed. Results In this study, we evaluated genetic features of this breed, including single nucleotide polymorphism (SNP) information, by whole-genome sequencing using a Genome Analyzer II. A total of 64.2 Gb of sequence was generated, of which 86% of the obtained reads were successfully mapped to the reference sequence (Btau 4.0) with BWA. On an average, 93% of the genome was covered by the reads and the number of mapped reads corresponded to 15.8-fold coverage across the covered region. From these data, we identified 6.3 million SNPs, of which more than 5.5 million (87%) were found to be new. Out of the SNPs annotated in the bovine sequence assembly, 20,432 were found in protein-coding regions containing 11,713 nonsynonymous SNPs in 4,643 genes. Furthermore, phylogenetic analysis using sequence data from 10 genes (more than 10 kbp) showed that <it>Kuchinoshima-Ushi </it>is clearly distinct from European domestic breeds of cattle. Conclusions These results provide a framework for further genetic studies in the <it>Kuchinoshima-Ushi </it>population and research on functions of SNP-containing genes, which would aid in understanding the molecular basis underlying phenotypic variation of economically important traits in cattle and in improving intrinsic defects in domestic cattle breeds.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Epistasis: Obstacle or Advantage for Mapping Complex Traits?

Identification of genetic loci in complex traits has focused largely on one-dimensional genome scans to search for associations between single markers and the phenotype. There is mounting evidence that locus interactions, or epistasis, are a crucial component of the genetic architecture of biologically relevant traits. However, epistasis is often viewed as a nuisance factor that reduces power for locus detection. Counter to expectations, recent work shows that fitting full models, instead of testing marker main effect and interaction components separately, in exhaustive multi-locus genome scans can have higher power to detect loci when epistasis is present than single-locus scans, and improvement that comes despite a much larger multiple testing alpha-adjustment in such searches. We demonstrate, both theoretically and via simulation, that the expected power to detect loci when fitting full models is often larger when these loci act epistatically than when they act additively. Additionally, we show that the power for single locus detection may be improved in cases of epistasis compared to the additive model. Our exploration of a two step model selection procedure shows that identifying the true model is difficult. However, this difficulty is certainly not exacerbated by the presence of epistasis, on the contrary, in some cases the presence of epistasis can aid in model selection. The impact of allele frequencies on both power and model selection is dramatic

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Wageningen University & Research Publications

KNAW Repository

High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species

Abstract Background High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of <it>Eucalyptus </it>from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for <it>E. grandis</it>. A systematic assessment of <it>in silico </it>SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous <it>in silico </it>constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine <it>Eucalyptus </it>species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions This study indicates that the GGGT performs well both within and across species of <it>Eucalyptus </it>notwithstanding its nucleotide diversity ≥2%. The development of a much larger array of informative SNPs across multiple <it>Eucalyptus </it>species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in <it>Eucalyptus</it>.</p

Repository Open Access to Scientific Information from Embrapa

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

RCAAP - Repositório Científico de Acesso Aberto de Portugal

Universidade de São Paulo