12 research outputs found
Recommended from our members
Conifer DBMagic: a database housing multiple de novo transcriptome assemblies for 12 diverse conifer species
Conifers comprise an ancient and widespread plant lineage of enormous commercial and ecological value. However, compared to model woody angiosperms, such as Populus and Eucalyptus, our understanding of conifers remains quite limited at a genomic level. Large genome sizes (10,000–40,000 Mbp) and large amounts of repetitive DNA have limited efforts to produce a conifer reference genome, and genomic resource development has focused primarily on characterization of expressed sequences. Here, we report the completion of a conifer transcriptome sequencing project undertaken in collaboration with the U.S. DOE Joint Genome Institute that resulted in production of almost 12 million sequence reads. Five loblolly pine (Pinus taeda) cDNA libraries representing multiple tissues, treatments, and genotypes produced over four million sequence reads that, along with available Sanger expressed sequence tags, were used to create contig assemblies using three different assembly algorithms: Newbler, MiraEST, and NGen. In addition, libraries from 11 other conifer species, as well as one member of the Gnetales (Gnetum gnemon), produced 0.4 to 1.2 million sequence reads each. Among the selected conifer species were representatives of each of the seven phylogenetic families in the Coniferales: Araucariaceae, Cephalotaxaceae, Cupressaceae, Pinaceae, Podocarpaceae, Sciadopityaceae, and Taxaceae. Transcriptome builds for each species were generated using each of the three assemblers. All contigs for every species generated using each assembler can be obtained from Conifer DBMagic, a public database for searching, viewing, and downloading contig sequences, the associated sequence reads, and their annotations.Keywords: Database,
Gene models,
Pinus,
Comparative phylogenomics,
Annotation,
Transcriptome,
Coniferale
Data from: Multilocus analyses reveal little evidence for lineage wide adaptive evolution within major clades of soft pines (Pinus subgenus Strobus)
Estimates from molecular data for the fraction of new nonsynonymous mutations that are adaptive vary strongly across plant species. Much of this variation is due to differences in life-history strategies as they influence the effective population size (Ne). Ample variation for these estimates, however, remains even when comparisons are made across species with similar values of Ne. An open question thus remains as to why the large disparity for estimates of adaptive evolution exists among plant species. Here, we have estimated the distribution of deleterious fitness effects (DFE) and the fraction of adaptive nonsynonymous substitutions (α) for 11 species of soft pines (subgenus Strobus) using DNA sequence data from 167 orthologous nuclear gene fragments. Most newly arising nonsynonymous mutations were inferred to be so strongly deleterious that they would rarely become fixed. Little evidence for long-term adaptive evolution was detected, as all 11 estimates for α were not significantly different from zero. Nucleotide diversity at synonymous sites, moreover, was strongly correlated with attributes of the DFE across species, thus illustrating a strong consistency with the expectations from the Nearly Neutral Theory of molecular evolution. Application of these patterns to genome-wide expectations for these species, however, was difficult as the loci chosen for analysis were a biased set of conserved loci which greatly influenced estimates of the DFE and α. This implies that genome-wide parameter estimates will need truly genome-wide data, so that many of existing patterns documented previously for forest trees (e.g. little evidence for signature of selection) may need revision
Association Genetics of Coastal Douglas Fir (Pseudotsuga menziesii var. menziesii, Pinaceae). I. Cold-Hardiness Related Traits
Adaptation to cold is one of the greatest challenges to forest trees. This process is highly synchronized with environmental cues relating to photoperiod and temperature. Here, we use a candidate gene-based approach to search for genetic associations between 384 single-nucleotide polymorphism (SNP) markers from 117 candidate genes and 21 cold-hardiness related traits. A general linear model approach, including population structure estimates as covariates, was implemented for each marker–trait pair. We discovered 30 highly significant genetic associations [false discovery rate (FDR) Q < 0.10] across 12 candidate genes and 10 of the 21 traits. We also detected a set of 7 markers that had elevated levels of differentiation between sampling sites situated across the Cascade crest in northeastern Washington. Marker effects were small (r2 < 0.05) and within the range of those published previously for forest trees. The derived SNP allele, as measured by a comparison to a recently diverged sister species, typically affected the phenotype in a way consistent with cold hardiness. The majority of markers were characterized as having largely nonadditive modes of gene action, especially underdominance in the case of cold-tolerance related phenotypes. We place these results in the context of trade-offs between the abilities to grow longer and to avoid fall cold damage, as well as putative epigenetic effects. These associations provide insight into the genetic components of complex traits in coastal Douglas fir, as well as highlight the need for landscape genetic approaches to the detection of adaptive genetic diversity
Recommended from our members
HoweGlennForestryConiferDBMagicDatabaseSupplementaryFile4.pptx
Conifers comprise an ancient and widespread plant lineage of enormous commercial and ecological value. However, compared to model woody angiosperms, such as Populus and Eucalyptus, our understanding of conifers remains quite limited at a genomic level. Large genome sizes (10,000–40,000 Mbp) and large amounts of repetitive DNA have limited efforts to produce a conifer reference genome, and genomic resource development has focused primarily on characterization of expressed sequences. Here, we report the completion of a conifer transcriptome sequencing project undertaken in collaboration with the U.S. DOE Joint Genome Institute that resulted in production of almost 12 million sequence reads. Five loblolly pine (Pinus taeda) cDNA libraries representing multiple tissues, treatments, and genotypes produced over four million sequence reads that, along with available Sanger expressed sequence tags, were used to create contig assemblies using three different assembly algorithms: Newbler, MiraEST, and NGen. In addition, libraries from 11 other conifer species, as well as one member of the Gnetales (Gnetum gnemon), produced 0.4 to 1.2 million sequence reads each. Among the selected conifer species were representatives of each of the seven phylogenetic families in the Coniferales: Araucariaceae, Cephalotaxaceae, Cupressaceae, Pinaceae, Podocarpaceae, Sciadopityaceae, and Taxaceae. Transcriptome builds for each species were generated using each of the three assemblers. All contigs for every species generated using each assembler can be obtained from Conifer DBMagic, a public database for searching, viewing, and downloading contig sequences, the associated sequence reads, and their annotations
Recommended from our members
HoweGlennForestryConiferDBMagicDatabaseSupplementaryFile1.xlsx
Conifers comprise an ancient and widespread plant lineage of enormous commercial and ecological value. However, compared to model woody angiosperms, such as Populus and Eucalyptus, our understanding of conifers remains quite limited at a genomic level. Large genome sizes (10,000–40,000 Mbp) and large amounts of repetitive DNA have limited efforts to produce a conifer reference genome, and genomic resource development has focused primarily on characterization of expressed sequences. Here, we report the completion of a conifer transcriptome sequencing project undertaken in collaboration with the U.S. DOE Joint Genome Institute that resulted in production of almost 12 million sequence reads. Five loblolly pine (Pinus taeda) cDNA libraries representing multiple tissues, treatments, and genotypes produced over four million sequence reads that, along with available Sanger expressed sequence tags, were used to create contig assemblies using three different assembly algorithms: Newbler, MiraEST, and NGen. In addition, libraries from 11 other conifer species, as well as one member of the Gnetales (Gnetum gnemon), produced 0.4 to 1.2 million sequence reads each. Among the selected conifer species were representatives of each of the seven phylogenetic families in the Coniferales: Araucariaceae, Cephalotaxaceae, Cupressaceae, Pinaceae, Podocarpaceae, Sciadopityaceae, and Taxaceae. Transcriptome builds for each species were generated using each of the three assemblers. All contigs for every species generated using each assembler can be obtained from Conifer DBMagic, a public database for searching, viewing, and downloading contig sequences, the associated sequence reads, and their annotations
Recommended from our members
HoweGlennForestryConiferDBMagicDatabaseSupplementaryFile2.xlsx
Conifers comprise an ancient and widespread plant lineage of enormous commercial and ecological value. However, compared to model woody angiosperms, such as Populus and Eucalyptus, our understanding of conifers remains quite limited at a genomic level. Large genome sizes (10,000–40,000 Mbp) and large amounts of repetitive DNA have limited efforts to produce a conifer reference genome, and genomic resource development has focused primarily on characterization of expressed sequences. Here, we report the completion of a conifer transcriptome sequencing project undertaken in collaboration with the U.S. DOE Joint Genome Institute that resulted in production of almost 12 million sequence reads. Five loblolly pine (Pinus taeda) cDNA libraries representing multiple tissues, treatments, and genotypes produced over four million sequence reads that, along with available Sanger expressed sequence tags, were used to create contig assemblies using three different assembly algorithms: Newbler, MiraEST, and NGen. In addition, libraries from 11 other conifer species, as well as one member of the Gnetales (Gnetum gnemon), produced 0.4 to 1.2 million sequence reads each. Among the selected conifer species were representatives of each of the seven phylogenetic families in the Coniferales: Araucariaceae, Cephalotaxaceae, Cupressaceae, Pinaceae, Podocarpaceae, Sciadopityaceae, and Taxaceae. Transcriptome builds for each species were generated using each of the three assemblers. All contigs for every species generated using each assembler can be obtained from Conifer DBMagic, a public database for searching, viewing, and downloading contig sequences, the associated sequence reads, and their annotations
Recommended from our members
HoweGlennForestryConiferDBMagicDatabaseSupplementaryFile3.xlsx
Conifers comprise an ancient and widespread plant lineage of enormous commercial and ecological value. However, compared to model woody angiosperms, such as Populus and Eucalyptus, our understanding of conifers remains quite limited at a genomic level. Large genome sizes (10,000–40,000 Mbp) and large amounts of repetitive DNA have limited efforts to produce a conifer reference genome, and genomic resource development has focused primarily on characterization of expressed sequences. Here, we report the completion of a conifer transcriptome sequencing project undertaken in collaboration with the U.S. DOE Joint Genome Institute that resulted in production of almost 12 million sequence reads. Five loblolly pine (Pinus taeda) cDNA libraries representing multiple tissues, treatments, and genotypes produced over four million sequence reads that, along with available Sanger expressed sequence tags, were used to create contig assemblies using three different assembly algorithms: Newbler, MiraEST, and NGen. In addition, libraries from 11 other conifer species, as well as one member of the Gnetales (Gnetum gnemon), produced 0.4 to 1.2 million sequence reads each. Among the selected conifer species were representatives of each of the seven phylogenetic families in the Coniferales: Araucariaceae, Cephalotaxaceae, Cupressaceae, Pinaceae, Podocarpaceae, Sciadopityaceae, and Taxaceae. Transcriptome builds for each species were generated using each of the three assemblers. All contigs for every species generated using each assembler can be obtained from Conifer DBMagic, a public database for searching, viewing, and downloading contig sequences, the associated sequence reads, and their annotations
Recommended from our members
HoweGlennForestryConiferDBMagicDatabase.pdf
Conifers comprise an ancient and widespread plant lineage of enormous commercial and ecological value. However, compared to model woody angiosperms, such as Populus and Eucalyptus, our understanding of conifers remains quite limited at a genomic level. Large genome sizes (10,000–40,000 Mbp) and large amounts of repetitive DNA have limited efforts to produce a conifer reference genome, and genomic resource development has focused primarily on characterization of expressed sequences. Here, we report the completion of a conifer transcriptome sequencing project undertaken in collaboration with the U.S. DOE Joint Genome Institute that resulted in production of almost 12 million sequence reads. Five loblolly pine (Pinus taeda) cDNA libraries representing multiple tissues, treatments, and genotypes produced over four million sequence reads that, along with available Sanger expressed sequence tags, were used to create contig assemblies using three different assembly algorithms: Newbler, MiraEST, and NGen. In addition, libraries from 11 other conifer species, as well as one member of the Gnetales (Gnetum gnemon), produced 0.4 to 1.2 million sequence reads each. Among the selected conifer species were representatives of each of the seven phylogenetic families in the Coniferales: Araucariaceae, Cephalotaxaceae, Cupressaceae, Pinaceae, Podocarpaceae, Sciadopityaceae, and Taxaceae. Transcriptome builds for each species were generated using each of the three assemblers. All contigs for every species generated using each assembler can be obtained from Conifer DBMagic, a public database for searching, viewing, and downloading contig sequences, the associated sequence reads, and their annotations