Search CORE

Non PCR-amplified Transcripts and AFLP fragments as reduced representations of the quail genome for 454 Titanium sequencing

Author: AD Mills
Alain Vignal
Catherine Beaumont
Christine Leterrier
Christophe Klopp
CP Van Tassell
Céline Noirot
David Gourichon
F Minvielle
Florence Vignoles
Francis Minvielle
Frédérique Pitel
J Binladen
Katia Feve
M LeMeur
NJ van Orsouw
O Roussot
Olivier Bouchez
P Beldade
P Ng
R Pinard
Sabine Richard
Sophie Leroux
T Maricic
T Wicker
Y Shibata
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background SNP (Single Nucleotide Polymorphism) discovery is now routinely performed using high-throughput sequencing of reduced representation libraries. Our objective was to adapt 454 GS FLX based sequencing methodologies in order to obtain the largest possible dataset from two reduced representations libraries, produced by AFLP (Amplified Fragment Length Polymorphism) for genomic DNA, and EST (Expressed Sequence Tag) for the transcribed fraction of the genome. Findings The expressed fraction was obtained by preparing cDNA libraries without PCR amplification from quail embryo and brain. To optimize the information content for SNP analyses, libraries were prepared from individuals selected in three quail lines and each individual in the AFLP library was tagged. Sequencing runs produced 399,189 sequence reads from cDNA and 373,484 from genomic fragments, covering close to 250 Mb of sequence in total. Conclusions Both methods used to obtain reduced representations for high-throughput sequencing were successful after several improvements. The protocols may be used for several sequencing applications, such as <it>de novo </it>sequencing, tagged PCR fragments or long fragment sequencing of cDNA.</p

HAL-ENS-LYON

Public Library of Science (PLOS)

HAL Descartes

HAL Université de Tours

ProdInra

Sequence-Based Genotyping for Marker Discovery and Co-Dominant Scoring in Germplasm and Populations

Author: A Diaz
A Platt
A. Marcos Ramos
AJ Cortes
AM Ramos
Antoine Janssen
C Alonso-Blanco
CN Stewart Jr
CP Van Tassell
E Bao
Feyruz Yalcin
H Li
H Li
H Shinozuka
Hein J. A. van der Poel
HG Nam
HJ Edenberg
Hoa T. Truong
J Buntjer
J Cockram
J van Oeveren
JA Poland
JW Davey
Koen H. J. Huvenaars
L Barchi
Leonora. J. G. van Enckevort
M Koornneef
M Tester
M Vuylsteke
MA DePristo
Marjo de Ruiter
Michiel J. T. van Eijk
MW Ganal
N Appleby
NA Baird
Nathalie J. van Orsouw
NJ van Orsouw
P Andolfatto
P Vos
PY Kwok
R van Poecke
René C. J. Hogers
RJ Elshire
RJ Hayes
SP Moose
SW Baxter
Tianzhen Zhang
X Huang
Y Chutimanitsakun
Publication venue: Public Library of Science
Publication date: 25/05/2012
Field of study

Conventional marker-based genotyping platforms are widely available, but not without their limitations. In this context, we developed Sequence-Based Genotyping (SBG), a technology for simultaneous marker discovery and co-dominant scoring, using next-generation sequencing. SBG offers users several advantages including a generic sample preparation method, a highly robust genome complexity reduction strategy to facilitate de novo marker discovery across entire genomes, and a uniform bioinformatics workflow strategy to achieve genotyping goals tailored to individual species, regardless of the availability of a reference sequence. The most distinguishing features of this technology are the ability to genotype any population structure, regardless whether parental data is included, and the ability to co-dominantly score SNP markers segregating in populations. To demonstrate the capabilities of SBG, we performed marker discovery and genotyping in Arabidopsis thaliana and lettuce, two plant species of diverse genetic complexity and backgrounds. Initially we obtained 1,409 SNPs for arabidopsis, and 5,583 SNPs for lettuce. Further filtering of the SNP dataset produced over 1,000 high quality SNP markers for each species. We obtained a genotyping rate of 201.2 genotypes/SNP and 58.3 genotypes/SNP for arabidopsis (n = 222 samples) and lettuce (n = 87 samples), respectively. Linkage mapping using these SNPs resulted in stable map configurations. We have therefore shown that the SBG approach presented provides users with the utmost flexibility in garnering high quality markers that can be directly used for genotyping and downstream applications. Until advances and costs will allow for routine whole-genome sequencing of populations, we expect that sequence-based genotyping technologies such as SBG will be essential for genotyping of model and non-model genomes alike

FigShare

Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers

Author: Anthony L. Shiver
CP Van Tassell
CQ Lai
D Botstein
D Chen
Eric A. Johnson
Eric U. Selker
HL Stickney
J Berger
Justin C. Fay
KJ Coyne
Mark C. Currey
MD Shapiro
MR Miller
MR Miller
Nathan A. Baird
NJ van Orsouw
P Vos
P Wenzl
Paul D. Etter
PF Colosimo
PF Colosimo
SR Wicks
Tressa S. Atwood
WA Cresko
William A. Cresko
ZA Lewis
Zachary A. Lewis
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Single nucleotide polymorphism (SNP) discovery and genotyping are essential to genetic mapping. There remains a need for a simple, inexpensive platform that allows high-density SNP discovery and genotyping in large populations. Here we describe the sequencing of restriction-site associated DNA (RAD) tags, which identified more than 13,000 SNPs, and mapped three traits in two model organisms, using less than half the capacity of one Illumina sequencing run. We demonstrated that different marker densities can be attained by choice of restriction enzyme. Furthermore, we developed a barcoding system for sample multiplexing and fine mapped the genetic basis of lateral plate armor loss in threespine stickleback by identifying recombinant breakpoints in F2 individuals. Barcoding also facilitated mapping of a second trait, a reduction of pelvic structure, by in silico re-sorting of individuals. To further demonstrate the ease of the RAD sequencing approach we identified polymorphic markers and mapped an induced mutation in Neurospora crassa. Sequencing of RAD markers is an integrated platform for SNP discovery and genotyping. This approach should be widely applicable to genetic mapping in a variety of organisms

CiteSeerX

Public Library of Science (PLOS)

Double Digest RADseq: An Inexpensive Method for De Novo SNP Discovery and Genotyping in Model and Non-Model Species

Author: Brant K. Peterson
CM Ramsdell
CP van Tassell
D Altshuler
DA Pollard
DW Craig
EM Kenny
Emily H. Kay
G Lunter
GP Consortium 1000
H Li
H Li
H Li
Heidi S. Fisher
Hopi E. Hoekstra
J Felsenstein
JC Avise
Jesse N. Weber
JL Davey
JM Catchen
KJ Emerson
KW Broman
L Li
L Salmela
LM Turner
Ludovic Orlando
MA Depristo
MA Quail
MA White
MD Carling
N Patterson
NA Baird
NJ van Orsouw
P Andolfatto
PA Hohenlohe
PA Hohenlohe
PA Hohenlohe
PA Hohenlohe
RC Edgar
S Alon
TFC Mackay
WF Dietrich
WF Pfender
WJ Kent
Z Gompert
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The ability to efficiently and accurately determine genotypes is a keystone technology in modern genetics, crucial to studies ranging from clinical diagnostics, to genotype-phenotype association, to reconstruction of ancestry and the detection of selection. To date, high capacity, low cost genotyping has been largely achieved via “SNP chip” microarray-based platforms which require substantial prior knowledge of both genome sequence and variability, and once designed are suitable only for those targeted variable nucleotide sites. This method introduces substantial ascertainment bias and inherently precludes detection of rare or population-specific variants, a major source of information for both population history and genotype-phenotype association. Recent developments in reduced-representation genome sequencing experiments on massively parallel sequencers (commonly referred to as RAD-tag or RADseq) have brought direct sequencing to the problem of population genotyping, but increased cost and procedural and analytical complexity have limited their widespread adoption. Here, we describe a complete laboratory protocol, including a custom combinatorial indexing method, and accompanying software tools to facilitate genotyping across large numbers (hundreds or more) of individuals for a range of markers (hundreds to hundreds of thousands). Our method requires no prior genomic knowledge and achieves per-site and per-individual costs below that of current SNP chip technology, while requiring similar hands-on time investment, comparable amounts of input DNA, and downstream analysis times on the order of hours. Finally, we provide empirical results from the application of this method to both genotyping in a laboratory cross and in wild populations. Because of its flexibility, this modified RADseq approach promises to be applicable to a diversity of biological questions in a wide range of organisms

CiteSeerX

Harvard University - DASH

CLOTU: An online pipeline for processing and clustering of 454 amplicon reads into OTUs followed by taxonomic annotation

Author: A Giongo
AL Hartman
Bjørn-Helge Mevik
C Quince
DA Benson
GW Tyson
Håvard Kauserud
IA Dickie
J Binladen
J Bråte
J Falgueras
J Gans
JG Caporaso
JR Cole
Kamran Shalchian-Tabrizi
L Tedersoo
LF Roesch
M Gardes
M Hamady
M Margulies
ML Sogin
NJ van Orsouw
P Lopez-Garcia
PD Schloss
Pål Enger
RA Edwards
Rakel Blaalid
RM Atlas
RV Pandey
S Kumar
SB Needleman
SF Altschul
SM Huse
SM Huse
Surendra Kumar
T White
Tor Carlsen
V Torsvik
WB Whitman
WR Pearson
Y Huang
Y Yu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The implementation of high throughput sequencing for exploring biodiversity poses high demands on bioinformatics applications for automated data processing. Here we introduce <smcaps>CLOTU</smcaps>, an online and open access pipeline for processing 454 amplicon reads. C<smcaps>LOTU</smcaps> has been constructed to be highly user-friendly and flexible, since different types of analyses are needed for different datasets. Results In <smcaps>CLOTU</smcaps>, the user can filter out low quality sequences, trim tags, primers, adaptors, perform clustering of sequence reads, and run <smcaps>BLAST</smcaps> against NCBInr or a customized database in a high performance computing environment. The resulting data may be browsed in a user-friendly manner and easily forwarded to downstream analyses. Although <smcaps>CLOTU</smcaps> is specifically designed for analyzing 454 amplicon reads, other types of DNA sequence data can also be processed. A fungal ITS sequence dataset generated by 454 sequencing of environmental samples is used to demonstrate the utility of <smcaps>CLOTU</smcaps>. Conclusions C<smcaps>LOTU</smcaps> is a flexible and easy to use bioinformatics pipeline that includes different options for filtering, trimming, clustering and taxonomic annotation of high throughput sequence reads. Some of these options are not included in comparable pipelines. C<smcaps>LOTU</smcaps> is implemented in a Linux computer cluster and is freely accessible to academic users through the Bioportal web-based bioinformatics service (<url>http://www.bioportal.uio.no</url>).</p

NORA - Norwegian Open Research Archives

Cancer genetics services: a systematic review of the economic evidence and issues

Author: A Netten
A Walker
AAPM Van der Riet
B Bapat
B Wilson
BAJ Ponder
C Lerman
C Sevilla
C Soravia
D Ford
D Ford
D Schrag
D Schrag
D Stoppa-Lyonnet
DF Easton
DM Cromwell
DM Eccles
ER Maher
F Couch
G L Griffith
H Chaliki
HFA Vasen
HFA Vasen
HFA Vasen
J Gray
J Hall
JA Peters
JL Wagner
JP Struewing
K Brain
K Heimdal
M EuroQol Group (Buxton
M Steel
MC King
MF Drummond
ML Brown
ML Brown
NJ Van Orsouw
R Eeles
R Lidereau
R T Edwards
RC Haggitt
RT Edwards
S Syngal
S Syngal
SH Taplin
T Debniak
TO Tengs
TO Tengs
VR Grann
VR Grann
VR Grann
VR Grann
Publication venue: Nature Publishing Group
Publication date: 01/01/2004
Field of study

Online Research @ Cardiff

Single-nucleotide polymorphism discovery by high-throughput sequencing in sorghum

Author: A Howe
A Roberts
AH Paterson
AJ Amaral
AM Casa
C Castano-Sanchez
CLL Gowda
CP Van Tassell
D Altshuler
DA Nickerson
DL Hyten
DR Bentley
FA Feltus
FR Miller
Frank F White
Ginny Antony
H-M Lam
HHD Kerstens
IY Choi
J Lai
J Marchini
J Yu
JA Bedell
James C Nelson
JC Stephens
JD Faris
Jianming Yu
KL McNally
M Kimura
M Margulies
M Trick
MA Gore
Na Baird
NJ van Orsouw
PJ Brown
PJ Maughan
PJ Maughan
R Li
RM Clark
RT Wiedmann
S Atwell
S Deschamps
S Ossowski
Shichen Wang
SM Al-Janabi
T Murashige
T Sasaki
WB Barbazuk
WL Rooney
X Wu
XH Huang
XH Huang
Xianran Li
Y Arai-Kichise
Y Chutimanitsakun
Y Fu
Yuye Wu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Eight diverse sorghum (<it>Sorghum bicolor </it>L. Moench) accessions were subjected to short-read genome sequencing to characterize the distribution of single-nucleotide polymorphisms (SNPs). Two strategies were used for DNA library preparation. Missing SNP genotype data were imputed by local haplotype comparison. The effect of library type and genomic diversity on SNP discovery and imputation are evaluated. Results Alignment of eight genome equivalents (6 Gb) to the public reference genome revealed 283,000 SNPs at ≥82% confirmation probability. Sequencing from libraries constructed to limit sequencing to start at defined restriction sites led to genotyping 10-fold more SNPs in all 8 accessions, and correctly imputing 11% more missing data, than from semirandom libraries. The SNP yield advantage of the reduced-representation method was less than expected, since up to one fifth of reads started at noncanonical restriction sites and up to one third of restriction sites predicted <it>in silico </it>to yield unique alignments were not sampled at near-saturation. For imputation accuracy, the availability of a genomically similar accession in the germplasm panel was more important than panel size or sequencing coverage. Conclusions A sequence quantity of 3 million 50-base reads per accession using a <it>Bsr</it>FI library would conservatively provide satisfactory genotyping of 96,000 sorghum SNPs. For most reliable SNP-genotype imputation in shallowly sequenced genomes, germplasm panels should consist of pairs or groups of genomically similar entries. These results may help in designing strategies for economical genotyping-by-sequencing of large numbers of plant accessions.</p

High-throughput 454 resequencing for allele discovery and recombination mapping in Plasmodium falciparum

Author: Allison Regier
AR Quinlan
Asako Tan
Brendan Collins
Brian A Desany
DA Wheeler
DE Neafsey
DJ Begun
DL Hyten
E Mancera
E Martinez-Perez
E Novaes
ER Mardis
F Picard
H Jiang
HH Chou
I Kozarewa
J Qi
J Ragoussis
J San Filippo
JA Bailey
JC Tan
JM Chen
JM Rothberg
John C Tan
KE Holt
KL McNally
KV Voelkerding
M Margulies
M Shinohara
MA West
Michael T Ferdig
MJ Gardner
MJ Moore
MZ Man
NJ van Orsouw
NV Dharia
O Harismendy
PJ Campbell
R Li
RA Holt
RR Selzer
RS Malhi
Scott J Emrich
SKK Volkman
T Singer
T Wicker
TE Wellems
Upeka Samarakoon
W Brockman
W Huang
WB Barbazuk
X Huang
X Su
Y Shen
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Knowledge of the origins, distribution, and inheritance of variation in the malaria parasite (<it>Plasmodium falciparum</it>) genome is crucial for understanding its evolution; however the 81% (A+T) genome poses challenges to high-throughput sequencing technologies. We explore the viability of the Roche 454 Genome Sequencer FLX (GS FLX) high throughput sequencing technology for both whole genome sequencing and fine-resolution characterization of genetic exchange in malaria parasites. Results We present a scheme to survey recombination in the haploid stage genomes of two sibling parasite clones, using whole genome pyrosequencing that includes a sliding window approach to predict recombination breakpoints. Whole genome shotgun (WGS) sequencing generated approximately 2 million reads, with an average read length of approximately 300 bp. <it>De novo </it>assembly using a combination of WGS and 3 kb paired end libraries resulted in contigs ≤ 34 kb. More than 8,000 of the 24,599 SNP markers identified between parents were genotyped in the progeny, resulting in a marker density of approximately 1 marker/3.3 kb and allowing for the detection of previously unrecognized crossovers (COs) and many non crossover (NCO) gene conversions throughout the genome. Conclusions By sequencing the 23 Mb genomes of two haploid progeny clones derived from a genetic cross at more than 30× coverage, we captured high resolution information on COs, NCOs and genetic variation within the progeny genomes. This study is the first to resequence progeny clones to examine fine structure of COs and NCOs in malaria parasites.</p

Development of a two-dimensional electrophoresis method to study soil bacterial diversity

Author: Cullen DW
Don RH
Dullaghan EM
Díez B
Hengstmann U
Jackson CR
Janatová M
Kang KS
Kassen R
Kent AD
Kiminori Itoh
Lane DJ
Malloff C
Malloff CA
Miyoshi E
Muyzer G
Nyström-Lahti M
O’Farrell PH
Rajendran Narasimmalu
Takashi Amemiya
Takumi Isshi
Torsvik V
Van Orsouw NJ
Van Orsouw NJ
Yu Z
Publication venue: 'Wiley'
Publication date
Field of study