Article thumbnail

A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species

By Robert J. Elshire, Jeffrey C. Glaubitz, Qi Sun, Jesse A. Poland, Ken Kawamoto, Edward S. Buckler and Sharon E. Mitchell


Advances in next generation technologies have driven the costs of DNA sequencing down to the point that genotyping-by-sequencing (GBS) is now feasible for high diversity, large genome species. Here, we report a procedure for constructing GBS libraries based on reducing genome complexity with restriction enzymes (REs). This approach is simple, quick, extremely specific, highly reproducible, and may reach important regions of the genome that are inaccessible to sequence capture approaches. By using methylation-sensitive REs, repetitive regions of genomes can be avoided and lower copy regions targeted with two to three fold higher efficiency. This tremendously simplifies computationally challenging alignment problems in species with high levels of genetic diversity. The GBS procedure is demonstrated with maize (IBM) and barley (Oregon Wolfe Barley) recombinant inbred populations where roughly 200,000 and 25,000 sequence tags were mapped, respectively. An advantage in species like barley that lack a complete genome sequence is that a reference map need only be developed around the restriction sites, and this can be done in the process of sample genotyping. In such cases, the consensus of the read clusters across the sequence tagged sites becomes the reference. Alternatively, for kinship analyses in the absence of a reference genome, the sequence tags can simply be treated as dominant markers. Future application of GBS to breeding, conservation, and global species and population surveys may allow plant breeders to conduct genomic selection on a novel germplasm or species without first having to develop any prior molecular tools, or conservation biologists to determine population structure without prior knowledge of the genome or diversity in the species

Topics: Research Article
Publisher: Public Library of Science
OAI identifier:
Provided by: PubMed Central

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.

Suggested articles


  1. (2006). A distant upstream enhancer at the maize domestication gene tb1 has pleiotropic effects on plant and inflorescent architecture.
  2. (2009). A firstgeneration haplotype map of maize.
  3. (1987). A rapid DNA isolation procedure for small quantities of fresh leaf tissue.
  4. (2008). Accurate whole genome sequencing using reversible terminator chemistry.
  5. (1995). AFLP: a new technique for DNA fingerprinting.
  6. (2009). An integrated resource for barley linkage map and malting quality QTL alignment. Plant Genome 2: 134–140. Genotyping Approach for High Diversity Species PLoS
  7. (2006). BTA, a novel reagent for DNA attachment on glass and efficient generation of solid–phase amplified DNA colonies.
  8. (2007). Conserved noncoding genomic sequences associated with a flowering-time quantitative trait locus in maize.
  9. (2008). Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplex.
  10. (2007). Evaluation of target preparation methods for single-feature polymorphism detection in large complex plant genomes. Crop Sci 47(S2): S135–S148.
  11. (2002). Expanding the genetic map of maize with the intermated B736Mo17 (IBM) population.
  12. (2009). Fast and accurate short read alignment with BurrowsWheeler transform.
  13. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
  14. (2009). Genetic characterization and linkage disequilibrium estimation of a global maize collection using SNP markers.
  15. (2009). Genetic properties of the maize nested association mapping population.
  16. (2010). Genome-wide association studies of 14 agronomic traits in rice landraces.
  17. (2010). Genome-wide patterns of genetic variation among elite maize inbred lines.
  18. (2001). Genome-wide variation in human and fruitfly: a comparison.
  19. (2009). High-throughput genotyping by whole-genome resequencing.
  20. (2008). Identification of genetic variants using bar-coded multiplexed sequencing.
  21. (2005). Insights on evolution of virulence and resistance from the complete genome analysis of an early methicillin-resistant Staphylococcus aureus strain and a biofilm-producing methicillin-resistant Staphlococcus epidermis strain.
  22. (1991). Low nucleotide diversity in man.
  23. (2009). Maize inbreds exhibit high levels of copy number variation (CNV) and presence/absence variation (PAV) in genome content.
  24. (2006). Molecular and functional diversity of maize.
  25. (2001). Molecular mapping of the Oregon Wolfe Barleys: a phenotypically polymorphic doubledhaploid population.
  26. (2008). Multiplex sequencing of plant chloroplast genomes using Solexa sequencing-by-synthesis technology.
  27. (2006). Nucleotide variation and haplotype diversity in a 10-kb noncoding region in three continental human populations.
  28. (2006). Organization and variability of the maize genome.
  29. (2010). Paramutation in maize: RNA mediated trans-generational gene silencing.
  30. (2001). Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp mays L.). Proc Natl Acad Sci
  31. (2008). Rapid SNP discovery and genetic mapping using sequenced RAD markers.
  32. (2000). Rienzo A
  33. (2010). Sequencing technologies – the next generation.
  34. (2001). Structure of linkage disequilibrium and phenotypic associations in the maize genome.
  35. (2010). Targetenrichment strategies for next-generation sequencing.
  36. (2009). The B73 maize genome: complexity, diversity and dynamics.