33 research outputs found
Exploring the Switchgrass Transcriptome Using Second-Generation Sequencing Technology
Background: Switchgrass (Panicum virgatum L.) is a C4 perennial grass and widely popular as an important bioenergy crop. To accelerate the pace of developing high yielding switchgrass cultivars adapted to diverse environmental niches, the generation of genomic resources for this plant is necessary. The large genome size and polyploid nature of switchgrass makes whole genome sequencing a daunting task even with current technologies. Exploring the transcriptional landscape using next generation sequencing technologies provides a viable alternative to whole genome sequencing in switchgrass. Principal Findings: Switchgrass cDNA libraries from germinating seedlings, emerging tillers, flowers, and dormant seeds were sequenced using Roche 454 GS-FLX Titanium technology, generating 980,000 reads with an average read length of 367 bp. De novo assembly generated 243,600 contigs with an average length of 535 bp. Using the foxtail millet genome as a reference greatly improved the assembly and annotation of switchgrass ESTs. Comparative analysis of the 454-derived switchgrass EST reads with other sequenced monocots including Brachypodium, sorghum, rice and maize indicated a 70– 80 % overlap. RPKM analysis demonstrated unique transcriptional signatures of the four tissues analyzed in this study. More than 24,000 ESTs were identified in the dormant seed library. In silico analysis indicated that there are more than 2000 EST-SSRs in this collection. Expression of several orphan ESTs was confirmed by RT-PCR. Significance: We estimate that about 90 % of the switchgrass gene space has been covered in this analysis. This study nearl
Molecular Evolution of Regulatory Genes in Spruces from Different Species and Continents: Heterogeneous Patterns of Linkage Disequilibrium and Selection but Correlated Recent Demographic Changes
Genes involved in transcription regulation may represent valuable targets in association genetics studies because of their key roles in plant development and potential selection at the molecular level. Selection and demographic signatures at the sequence level were investigated for five regulatory genes belonging to the knox-I family (KN1, KN2, KN3, KN4) and the HD-Zip III family (HB-3) in three Picea species affected by post-glacial recolonization in North America and Europe. To disentangle neutral and selective forces and estimate linkage disequilibrium (LD) on a gene basis, complete or nearly complete gene sequences were analysed. Nucleotide variation within species, haplotype structure, LD, and neutrality tests, in addition to coalescent simulations based on Tajima’s D and Fay and Wu’s H, were estimated. Nucleotide diversity was generally low in all species (average π = 0.002–0.003) and much heterogeneity was seen in LD and selection signatures among genes and species. Most of the genes harboured an excess of both rare and frequent alleles in the three species. Simulations showed that this excess was significantly higher than that expected under neutrality and a bottleneck during the Last Glacial Maximum followed by population expansion at the Pleistocene/Holocene boundary or shortly after best explains the correlated sequence patterns. These results indicate that despite recent large demographic changes in the three boreal species from two continents, species-specific selection signatures could still be detected from the analysis of nearly complete regulatory gene sequences. Such different signatures indicate differential subfunctionalization of gene family members in the three congeneric species
Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers
<p>Abstract</p> <p>Background</p> <p>Sesame is an important oil crop, but limited transcriptomic and genomic data are currently available. This information is essential to clarify the fatty acid and lignan biosynthesis molecular mechanism. In addition, a shortage of sesame molecular markers limits the efficiency and accuracy of genetic breeding. High-throughput transcriptomic sequencing is essential to generate a large transcriptome sequence dataset for gene discovery and molecular marker development.</p> <p>Results</p> <p>Sesame transcriptomes from five tissues were sequenced using Illumina paired-end sequencing technology. The cleaned raw reads were assembled into a total of 86,222 unigenes with an average length of 629 bp. Of the unigenes, 46,584 (54.03%) had significant similarity with proteins in the NCBI nonredundant protein database and Swiss-Prot database (E-value < 10<sup>-5</sup>). Of these annotated unigenes, 10,805 and 27,588 unigenes were assigned to gene ontology categories and clusters of orthologous groups, respectively. In total, 22,003 (25.52%) unigenes were mapped onto 119 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG). Furthermore, 44,750 unigenes showed homology to 15,460 <it>Arabidopsis </it>genes based on BLASTx analysis against The Arabidopsis Information Resource (TAIR, Version 10) and revealed relatively high gene coverage. In total, 7,702 unigenes were converted into SSR markers (EST-SSR). Dinucleotide SSRs were the dominant repeat motif (67.07%, 5,166), followed by trinucleotide (24.89%, 1,917), tetranucleotide (4.31%, 332), hexanucleotide (2.62%, 202), and pentanucleotide (1.10%, 85) SSRs. AG/CT (46.29%) was the dominant repeat motif, followed by AC/GT (16.07%), AT/AT (10.53%), AAG/CTT (6.23%), and AGG/CCT (3.39%). Fifty EST-SSRs were randomly selected to validate amplification and to determine the degree of polymorphism in the genomic DNA pools. Forty primer pairs successfully amplified DNA fragments and detected significant amounts of polymorphism among 24 sesame accessions.</p> <p>Conclusions</p> <p>This study demonstrates that Illumina paired-end sequencing is a fast and cost-effective approach to gene discovery and molecular marker development in non-model organisms. Our results provide a comprehensive sequence resource for sesame research.</p
Genetic Structure, Linkage Disequilibrium and Signature of Selection in Sorghum: Lessons from Physically Anchored DArT Markers
Population structure, extent of linkage disequilibrium (LD) as well as signatures of selection were investigated in sorghum using a core sample representative of worldwide diversity. A total of 177 accessions were genotyped with 1122 informative physically anchored DArT markers. The properties of DArTs to describe sorghum genetic structure were compared to those of SSRs and of previously published RFLP markers. Model-based (STRUCTURE software) and Neighbor-Joining diversity analyses led to the identification of 6 groups and confirmed previous evolutionary hypotheses. Results were globally consistent between the different marker systems. However, DArTs appeared more robust in terms of data resolution and bayesian group assignment. Whole genome linkage disequilibrium as measured by mean r2 decreased from 0.18 (between 0 to 10 kb) to 0.03 (between 100 kb to 1 Mb), stabilizing at 0.03 after 1 Mb. Effects on LD estimations of sample size and genetic structure were tested using i. random sampling, ii. the Maximum Length SubTree algorithm (MLST), and iii. structure groups. Optimizing population composition by the MLST reduced the biases in small samples and seemed to be an efficient way of selecting samples to make the best use of LD as a genome mapping approach in structured populations. These results also suggested that more than 100,000 markers may be required to perform genome-wide association studies in collections covering worldwide sorghum diversity. Analysis of DArT markers differentiation between the identified genetic groups pointed out outlier loci potentially linked to genes controlling traits of interest, including disease resistance genes for which evidence of selection had already been reported. In addition, evidence of selection near a homologous locus of FAR1 concurred with sorghum phenotypic diversity for sensitivity to photoperiod
Insect herbivory (Choristoneura fumiferana, Tortricidea) underlies tree population structure (Picea glauca, Pinaceae)
Variation in insect herbivory can lead to population structure in plant hosts as indicated by defence traits. In annual herbaceous, defence traits may vary between geographic areas but evidence of such patterns is lacking for long-lived species. This may result from the variety of selection pressures from herbivores, long distance gene flow, genome properties, and lack of research. We investigated the antagonistic interaction between white spruce (Picea glauca) and spruce budworm (SBW, Choristoneura fumiferana) the most devastating forest insect of eastern North America in common garden experiments. White spruces that are able to resist SBW attack were reported to accumulate the acetophenones piceol and pungenol constitutively in their foliage. We show that levels of these acetophenones and transcripts of the gene responsible for their release is highly heritable and that their accumulation is synchronized with the most devastating stage of SBW. Piceol and pungenol concentrations negatively correlate with rate of development in female SBW and follow a non-random geographic variation pattern that is partially explained by historical damage from SBW and temperature. Our results show that accumulation of acetophenones is an efficient resistance mechanism against SBW in white spruce and that insects can affect population structure of a long-lived plant
Comparison of phenotypic and genetic clone delineation in quaking aspen, Populus tremuloides
Key message: Clonal delineation at nuclear microsatellites and phenotypic traits showed high correspondence and revealed an important role of both sexual and clonal reproduction for stand genetic structure. Abstract: Quaking aspen (Populus tremuloides Michx.) grows throughout the northern and central portions of North America. Reproduction occurs both sexually via seeds and clonally from root suckers. Clonal delineation using morphological/phenological traits, and more recently, highly variable nuclear microsatellites have shown considerable variation in the size of clonal assemblies, and the relative importance of sexual versus clonal reproduction across the species range. In order to provide reliable estimates of genet size (N/G; ramets per sampled genet) and genotypic diversity (G/N; genets/ramets), and to compare genetic and phenotypic clone delineation, we characterized 181 sampled stems (ramets) at seven nuclear microsatellites, and morphological and phenological traits from six clones (genet size ≥11). Genotypic diversity was moderate (G/N = 0.18) and within the range reported in other studies across North America. Multivariate statistics revealed a high correspondence between genetic and phenotypic clone delineation, both with and without predefined genetic groups (94.2 %, 81.7 %). Moderate average genet size (5.6 ramets per genet) and the occurrence of genetically distinct single-ramet genets surrounded by larger genets suggested intermediate levels of sexual reproduction contributing to the genetic structure of this stand. Significant differences among genets were found for phenological and morphological traits such as bark thickness and leaf shape. However, most clones showed no significant differences in diameter growth which was likely caused by poor drainage in this high clay soil that inhibited the expression of genetic differences in growth