Search CORE

Identification of SNPs in RNA-seq data of two cultivars of Glycine max (soybean) differing in drought resistance

Author: Barbazuk WB
Baudet C
Cardon LR
Choi IY
Conesa A
Du CF
Duran C
Flint-Garcia SA
Gonçalo Amarante Guimarães Pereira
Huang X
Jorge Maurício Costa Mondego
Koboldt DC
Langmead B
Leandro Costa do Nascimento
Li H
Liao Y
Marcelo Falsarella Carazzolle
Melum E
Mochida K
Nascimento LC
Novaes E
Pindo M
Ramon Oliveira Vidal
Schiermeier Q
Schmutz J
Slater G
Stokstad E
Van Tassell CP
Wu X
Zhou QY
Publication venue: 'FapUNIFESP (SciELO)'
Publication date: 01/01/2012
Field of study

Single nucleotide polymorphism genotyping in polyploid wheat with the Illumina GoldenGate assay

Author: A Oliphant
AG Clark
B Ewing
B Ewing
BS Weir
C Lotti
Charles Nicolet
D Gordon
DL Hyten
ED Akhunov
Eduard Akhunov
ES McFadden
FJ Steemers
G Blanc
H Kihara
International HapMap Consortium
J Dvorak
J Dvorak
J Dvorak
J Shendure
Jan Dvorak
JB Fan
K Liu
K Zhao
KS Caldwell
KS Gill
M Akbari
M Margulies
M Stephens
M Troggio
MJ Aranzana
N Rostoks
P Hardenbol
P Sarkar
Q-J Song
R Shen
RT Brumfield
S Chao
SA Flint-Garcia
SJ Macdonald
WB Barbazuk
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

Single nucleotide polymorphisms (SNPs) are indispensable in such applications as association mapping and construction of high-density genetic maps. These applications usually require genotyping of thousands of SNPs in a large number of individuals. Although a number of SNP genotyping assays are available, most of them are designed for SNP genotyping in diploid individuals. Here, we demonstrate that the Illumina GoldenGate assay could be used for SNP genotyping of homozygous tetraploid and hexaploid wheat lines. Genotyping reactions could be carried out directly on genomic DNA without the necessity of preliminary PCR amplification. A total of 53 tetraploid and 38 hexaploid homozygous wheat lines were genotyped at 96 SNP loci. The genotyping error rate estimated after removal of low-quality data was 0 and 1% for tetraploid and hexaploid wheat, respectively. Developed SNP genotyping assays were shown to be useful for genotyping wheat cultivars. This study demonstrated that the GoldenGate assay is a very efficient tool for high-throughput genotyping of polyploid wheat, opening new possibilities for the analysis of genetic variation in wheat and dissection of genetic basis of complex traits using association mapping approach

eScholarship - University of California

The whole-organism heavy chain B cell repertoire from Zebrafish self-organizes into distinct network features

Author: A Richard
AL Barabasi
C Berek
CE Willett
CJ Jolly
E Andersson
E Andersson
E Andersson
E Zotenko
EA Kabat
FN Papavasiliou
H Jeong
H Jeong
IR Cohen
JA Weinstein
JA Yoder
JD Hansen
JH Postlethwait
JP Rast
JY Park
LA Clark
M Girvan
M Muramatsu
N Danilova
NS Trede
NY Zheng
O Ozier
P Parham
Rotem Ben-Hamo
S Tavazoie
SH Kleinstein
SH Yook
SJ Foster
Sol Efroni
T Mora
TB Kepler
TX Liu
U Hershberg
VMA Batagelj
WB Barbazuk
XL He
Z Li
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

A sweetpotato gene index established by de novo assembly of pyrosequencing and Sanger sequences and mining for gene-based microsatellite markers

Author: A Conesa
A Kriegner
A Papanicolaou
B Chevreux
C Soderlund
Carlos Rivera
Cynthia Quispe
DA Shagin
Diogenes Cerna
DP Zhang
DR Bentley
G Aparicio
Genoveva Rossel
J Hu
J Low
J Quackenbush
Jack Hou
Jaime A Pacheco
JC Cervantes-Flores
JC Vera
Ji Young Kim
JJ Doyle
JR Miller
Julio Solis
JW Low
KL Childs
LC Da Maia
Luis Rojas
Luz R Tincopa
M Ghislain
MI Buteler
NR Thomson
O Harismendy
Omar Palomino
PC Bundock
Reinhard Simon
RL Jarret
Rocio Alagon
Roland Schafleitner
Ronald F Robles
WB Barbazuk
YT Tseng
YY Zhu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Sweetpotato (<it>Ipomoea batatas </it>(L.) Lam.), a hexaploid outcrossing crop, is an important staple and food security crop in developing countries in Africa and Asia. The availability of genomic resources for sweetpotato is in striking contrast to its importance for human nutrition. Previously existing sequence data were restricted to around 22,000 expressed sequence tag (EST) sequences and ~ 1,500 GenBank sequences. We have used 454 pyrosequencing to augment the available gene sequence information to enhance functional genomics and marker design for this plant species. Results Two quarter 454 pyrosequencing runs used two normalized cDNA collections from stems and leaves from drought-stressed sweetpotato clone <it>Tanzania </it>and yielded 524,209 reads, which were assembled together with 22,094 publically available expressed sequence tags into 31,685 sets of overlapping DNA segments and 34,733 unassembled sequences. Blastx comparisons with the UniRef100 database allowed annotation of 23,957 contigs and 15,342 singletons resulting in 24,657 putatively unique genes. Further, 27,119 sequences had no match to protein sequences of UniRef100database. On the basis of this gene index, we have identified 1,661 gene-based microsatellite sequences, of which 223 were selected for testing and 195 were successfully amplified in a test panel of 6 hexaploid (<it>I. batatas</it>) and 2 diploid (<it>I. trifida</it>) accessions. Conclusions The sweetpotato gene index is a useful source for functionally annotated sweetpotato gene sequences that contains three times more gene sequence information for sweetpotato than previous EST assemblies. A searchable version of the gene index, including a blastn function, is available at <url>http://www.cipotato.org/sweetpotato_gene_index</url>.</p

Comparing the normalization methods for the differential analysis of Illumina high-throughput RNA-Seq data

Author: A Mortazavi
A Oshlack
B Langmead
B Li
B Li
BM Bolstad
C Trapnell
CA Maher
ES Lander
F Li
GK Smyth
GM Church
H Rehrauer
Ho Sun Shon
J Hauke
JC Marioni
JH Schefe
JP Magalhães de
Keun Ho Ryu
M Li
MA Dillies
MD Robinson
MR Teixeira
MY Galperin
P Li
Peipei Li
R Edgar
R Morin
R Patro
S Anders
S Lee
SC Schuster
WB Barbazuk
Y Chu
Y Piao
Yongjun Piao
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Public Library of Science (PLOS)

Discovery of Pod Shatter-Resistant Associated SNPs by Deep Sequencing of a Representative Library Followed by Bulk Segregant Analysis in Rapeseed

Author: A Xu
AC Need
AS Salunkhe
CC Sánchez
CL Morgan
CP Van Tassell
D Altshuler
D Bhattramakki
DL Hyten
E Novaes
FA Feltus
Gaomiao Zhan
GP Kadkol
Guihua Liu
H Feng
H Zhang
Hanzhong Wang
Hongli Yang
IV Bi
J Batley
JS Price
JW Van Ooijen
JW Wenger
L Monna
M Hajduch
M Hasan
M Mittelbach
M Trick
N Liu
P Martin-Lopes
P Westermeier
R Li
R Venuprasad
R Wang
Rongling Wu
RT Wiedmann
RW Michelmore
S Amar
S Nasu
S Wang
Shunmou Huang
W Li
WB Barbazuk
Wei Hua
WenYC
X Wang
Xinfa Wang
Y Sun
Z Huang
Zhiyong Hu
Publication venue: Public Library of Science
Publication date: 17/04/2012
Field of study

Background: Single nucleotide polymorphisms (SNPs) are an important class of genetic marker for target gene mapping. As of yet, there is no rapid and effective method to identify SNPs linked with agronomic traits in rapeseed and other crop species. Methodology/Principal Findings: We demonstrate a novel method for identifying SNP markers in rapeseed by deep sequencing a representative library and performing bulk segregant analysis. With this method, SNPs associated with rapeseed pod shatter-resistance were discovered. Firstly, a reduced representation of the rapeseed genome was used. Genomic fragments ranging from 450–550 bp were prepared from the susceptible bulk (ten F2 plants with the silique shattering resistance index, SSRI,0.10) and the resistance bulk (ten F2 plants with SSRI.0.90), and also Solexa sequencingproduced 90 bp reads. Approximately 50 million of these sequence reads were assembled into contigs to a depth of 20-fold coverage. Secondly, 60,396 ‘simple SNPs ’ were identified, and the statistical significance was evaluated using Fisher’s exact test. There were 70 associated SNPs whose –log10p value over 16 were selected to be further analyzed. The distribution of these SNPs appeared a tight cluster, which consisted of 14 associated SNPs within a 396 kb region on chromosome A09. Our evidence indicates that this region contains a major quantitative trait locus (QTL). Finally, two associated SNPs from this region were mapped on a major QTL region

CiteSeerX

The Francis Crick Institute

High-throughput gene and SNP discovery in Eucalyptus grandis, an uncharacterized genome

Author: AP Rooney
AP Rooney
AP Weber
C Roth
CD Bustamante
Dario Grattapaglia
DB Neale
DE Stage
Derek R Drost
DL Hartl
Evandro Novaes
F Cheung
FAO
GA Tuskan
GA Watterson
Georgios J Pappas
GR Brown
J Bergelson
JM Cork
JM Eirin-Lopez
K Ohtsu
KB McIntosh
KV Krutovsky
M Barrier
M Heuertz
M Kirst
M Lynch
M Margulies
M Meyer
M Nei
Matias Kirst
MJ Moore
MW Jones-Rhoades
P Parameswaran
PK Ingvarsson
R Fluhr
RM Clark
Ronald R Sederoff
S Chang
SC Gonzalez-Martinez
SJ Emrich
SN Santos
The Arabidopsis Genome Initiative
WB Barbazuk
William G Farmerie
XF Ma
Y Matsuo
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Benefits from high-throughput sequencing using 454 pyrosequencing technology may be most apparent for species with high societal or economic value but few genomic resources. Rapid means of gene sequence and SNP discovery using this novel sequencing technology provide a set of baseline tools for genome-level research. However, it is questionable how effective the sequencing of large numbers of short reads for species with essentially no prior gene sequence information will support contig assemblies and sequence annotation. Results With the purpose of generating the first broad survey of gene sequences in <it>Eucalyptus grandis</it>, the most widely planted hardwood tree species, we used 454 technology to sequence and assemble 148 Mbp of expressed sequences (EST). EST sequences were generated from a normalized cDNA pool comprised of multiple tissues and genotypes, promoting discovery of homologues to almost half of <it>Arabidopsis</it> genes, and a comprehensive survey of allelic variation in the transcriptome. By aligning the sequencing reads from multiple genotypes we detected 23,742 SNPs, 83% of which were validated in a sample. Genome-wide nucleotide diversity was estimated for 2,392 contigs using a modified theta (θ) parameter, adapted for measuring genetic diversity from polymorphisms detected by randomly sequencing a multi-genotype cDNA pool. Diversity estimates in non-synonymous nucleotides were on average 4x smaller than in synonymous, suggesting purifying selection. Non-synonymous to synonymous substitutions (Ka/Ks) among 2,001 contigs averaged 0.30 and was skewed to the right, further supporting that most genes are under purifying selection. Comparison of these estimates among contigs identified major functional classes of genes under purifying and diversifying selection in agreement with previous researches. Conclusion In providing an abundance of foundational transcript sequences where limited prior genomic information existed, this work created part of the foundation for the annotation of the <it>E. grandis </it>genome that is being sequenced by the US Department of Energy. In addition we demonstrated that SNPs sampled in large-scale with 454 pyrosequencing can be used to detect evolutionary signatures among genes, providing one of the first genome-wide assessments of nucleotide diversity and Ka/Ks for a non-model plant species.</p

Repository Open Access to Scientific Information from Embrapa

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Characterisation of the transcriptome of a wild great tit Parus major population by next generation sequencing

Author: A Charmantier
A Futschik
A Husby
A Kunstner
Anna W Santure
Ben C Sheldon
CW Wheat
D Brett
D Garant
D Garant
D Lack
DA Dawson
DH Nussey
H Ellegren
H Ellegren
H van Bakel
I Yanai
J Brown
J Galindo
J Mank
J Merilä
J Slate
J Stapley
Jake Gratten
JC Vera
Jim A Mossman
Jon Slate
L Liou
LEB Kruuk
LW Hillier
M Pigozzi
ME Visser
MS Boyce
NEM van Bers
P Gienapp
P Korsten
PJ Drent
RH McCleery
S Bouwhuis
T Clutton-Brock
The Gene Ontology Consortium
TW Nilsen
WB Barbazuk
WC Warren
WHO De Smet
Y Barash
YY Zhu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Background: The recent development of next generation sequencing technologies has made it possible to generate very large amounts of sequence data in species with little or no genome information. Combined with the large phenotypic databases available for wild and non-model species, these data will provide an unprecedented opportunity to "genomicise" ecological model organisms and establish the genetic basis of quantitative traits in natural populations

Oxford University Research Archive

White Rose Research Online

UQ eSpace (University of Queensland)

Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library

Abstract Background To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population. Results The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme <it>Hae</it>III; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends). Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183) of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In addition, 2% of the sequences from the validated markers were associated with rainbow trout transcripts. Conclusion The use of reduced representation libraries and pyrosequencing technology proved to be an effective strategy for the discovery of a high number of putative SNPs in rainbow trout; however, modifications to the technique to decrease the false discovery rate resulting from the evolutionary recent genome duplication would be desirable.</p