12 research outputs found

    Exploring the Switchgrass Transcriptome Using Second-Generation Sequencing Technology

    Get PDF
    Background: Switchgrass (Panicum virgatum L.) is a C4 perennial grass and widely popular as an important bioenergy crop. To accelerate the pace of developing high yielding switchgrass cultivars adapted to diverse environmental niches, the generation of genomic resources for this plant is necessary. The large genome size and polyploid nature of switchgrass makes whole genome sequencing a daunting task even with current technologies. Exploring the transcriptional landscape using next generation sequencing technologies provides a viable alternative to whole genome sequencing in switchgrass. Principal Findings: Switchgrass cDNA libraries from germinating seedlings, emerging tillers, flowers, and dormant seeds were sequenced using Roche 454 GS-FLX Titanium technology, generating 980,000 reads with an average read length of 367 bp. De novo assembly generated 243,600 contigs with an average length of 535 bp. Using the foxtail millet genome as a reference greatly improved the assembly and annotation of switchgrass ESTs. Comparative analysis of the 454-derived switchgrass EST reads with other sequenced monocots including Brachypodium, sorghum, rice and maize indicated a 70– 80 % overlap. RPKM analysis demonstrated unique transcriptional signatures of the four tissues analyzed in this study. More than 24,000 ESTs were identified in the dormant seed library. In silico analysis indicated that there are more than 2000 EST-SSRs in this collection. Expression of several orphan ESTs was confirmed by RT-PCR. Significance: We estimate that about 90 % of the switchgrass gene space has been covered in this analysis. This study nearl

    Distribution of simple sequence repeats in switchgrass ESTs.

    No full text
    <p>Di-, tri-, tetra-, penta- and hexa-nucleotide repeats were analyzed and their frequency plotted as a function of the repeat number.</p

    Heat map of switchgrass gene expression in four different tissues.

    No full text
    <p>Individual columns represent the four tissues used in this study while each row represents a unique contig. The dendrogram represents the similarity of expression profiles between the tissue samples based on average linkage clustering. The color key represents the log2 transformed RPKM values. Red indicates a low level of expression, green and blue are intermediate levels, while purple indicates maximal levels of gene expression. The highest value for RPKM after log 2 transformation was 17 and the lowest value was 0.1.</p

    Overview of the C4 NAD<sup>+</sup>-Malic enzyme photosynthesis.

    No full text
    <p>The numbers shown in red font in the figure correspond to the ESTs that were identified in the 454 sequencing analysis. Ala-Alanine; AlaAT – Alanine aminotransferase; Alpha-KG-alpha ketoglutarate; Asp-aspartate; AspAT-aspartate aminotransferase; CA-carbonic anhydrase; Glu-glutamate; OA-oxaloacetate; PEP-phosphoenol pyruvate; PEPC-phosphoenolpyruvate carboxylase; PPDK –pyruvate phosphate dikinase; Pyr-pyruvate; NAD<sup>+</sup>-MDH- NAD<sup>+</sup> dependent malate dehydrogenase; NAD<sup>+</sup>-ME- NAD<sup>+</sup> dependent malic enzyme.</p

    Switchgrass 454-based sequencing reads mapped to the foxtail millet genome.

    No full text
    <p>The number of contigs and unassembled ESTs that produced significant alignments to the foxtail millet genome are plotted for each 0.5 megabase interval. Radial axis line represents one log interval. Numbers on the circumference represent the Foxtail millet chromosomes (1–9) and each chromosome is given a different color. Assembled contigs aligned to the forward strand of the <i>Setaria italica</i> reference genome assembly are shown in blue for forward strand alignments and in yellow for reverse strand alignments. Singleton reads aligned to the forward strand are shown on the same figure in red, while singleton reads aligned to the reverse strand are shown in green. Diagram was prepared using Circos (<a href="http://mkweb.bcgsc.ca/circos" target="_blank">http://mkweb.bcgsc.ca/circos</a>).</p

    Unigene assembly features of switchgrass transcriptome.

    No full text
    <p>(A) Histogram of contig lengths following the 2-step <i>de novo</i> assembly process. The x-axis has been truncated at 3 kb. The longest contig is 12,437 base pairs. (B) Histogram of the average read-depth coverage for assembled contigs. Coverage values greater than 30Γ— have been binned together.</p

    Plant GO-slim terms associated with switchgrass.

    No full text
    <p>Venn diagram of the distribution of plant GO-slim terms associated with switchgrass contigs represented in molecular function, biological process and cellular component categories.</p

    Switchgrass gene expression analysis by RT-PCR.

    No full text
    <p>The four panels represents the amplifications from the cDNA derived from the four different tissues – flowers, tillers, seeds and seedlings. The lanes labeled 1–22 are amplifications of EST contigs with functional annotations. Lanes 23–42 contains amplifications of EST contigs that did not show any homologies to sequences in the databases. Lane 43 is the amplification of the teosinte branched 1 gene that shows tiller specific expression. M indicates the 100 bp DNA size ladder.</p
    corecore