27 research outputs found
Core reactions of fatty acid biosynthesis reconstructed based on the <i>de novo</i> assembly and annotation of <i>C. oleifera</i> transcriptome.
<p>During fatty acid biosynthesis, two-carbon units are added for each cycle reaction, and the four-step cycle is repeated until the appropriate chain-length is reached. Finally, different types of fatty acids are synthesized. The identified enzymes are shown in boxes and abbreviated as below: ACC, acetyl-CoA carboxylase (EC: 6.4.1.2); MAT, malonyl-CoA ACP transacylase (EC: 2.3.1.39); KAS, beta-ketoacyl-ACP synthase (KAS I, EC: 2.3.1.41; KASII, EC: 2.3.1.179; KAS III, EC: 2.3.1.180); KAR, beta-ketoacyl-ACP reductase (EC: 1.1.1.100); HAD, beta-hydroxyacyl-ACP dehydrase (EC: 4.2.1.-); EAR, enoyl-ACP reductase (EC: 1.3.1.9); AAD, acyl-ACP desaturase (EC: 1.14.19.2); OAH, oleoyl-ACP hydrolase (EC: 3.1.2.14); FatA, Acyl-ACP thioesterase A (EC: 3.1.2.-); Δ<sup>12</sup>D, Δ<sup>12</sup>(ω<sup>6</sup>)-desaturase (EC: 1.4.19.6). The numbers-in-circles indicates the repeat time of the condensation reaction.</p
Quantitative RT-PCR validations of the 17 candidate lipid-related genes in the <i>C. oleifera</i> transcriptome.
<p>17 candidate unigenes involved in lipid metabolism including (<b>a</b>) fatty acid and (<b>b</b>) TAG pathways were selected for the quantitative RT-PCR analysis. Standard error of the mean for three biological replicates (nested with three technical replicates) is represented by the error bars. Results represent the mean (± SD) of the three experiments. The translation elongation factor 1-alpha (TEF) gene was chosen as an internal standard.</p
Transcriptome Analysis of the Oil-Rich Tea Plant, <i>Camellia oleifera</i>, Reveals Candidate Genes Related to Lipid Metabolism
<div><p>Background</p><p>Rapidly driven by the need for developing sustainable sources of nutritionally important fatty acids and the rising concerns about environmental impacts after using fossil oil, oil-plants have received increasing awareness nowadays. As an important oil-rich plant in China, <i>Camellia oleifera</i> has played a vital role in providing nutritional applications, biofuel productions and chemical feedstocks. However, the lack of <i>C. oleifera</i> genome sequences and little genetic information have largely hampered the urgent needs for efficient utilization of the abundant germplasms towards modern breeding efforts of this woody oil-plant.</p><p>Results</p><p>Here, using the 454 GS-FLX sequencing platform, we generated approximately 600,000 RNA-Seq reads from four tissues of <i>C. oleifera</i>. These reads were trimmed and assembled into 104,842 non-redundant putative transcripts with a total length of ∼38.9 Mb, representing more than 218-fold of all the <i>C. oleifera</i> sequences currently deposited in the GenBank (as of March 2014). Based on the BLAST similarity searches, nearly 42.6% transcripts could be annotated with known genes, conserved domains, or Gene Ontology (GO) terms. Comparisons with the cultivated tea tree, <i>C. sinensis</i>, identified 3,022 pairs of orthologs, of which 211 exhibited the evidence under positive selection. Pathway analysis detected the majority of genes potentially related to lipid metabolism. Evolutionary analysis of omega-6 fatty acid desaturase (<i>FAD2</i>) genes among 20 oil-plants unexpectedly suggests that a parallel evolution may occur between <i>C. oleifera</i> and <i>Olea oleifera</i>. Additionally, more than 2,300 simple sequence repeats (SSRs) and 20,200 single-nucleotide polymorphisms (SNPs) were detected in the <i>C. oleifera</i> transcriptome.</p><p>Conclusions</p><p>The generated transcriptome represents a considerable increase in the number of sequences deposited in the public databases, providing an unprecedented opportunity to discover all related-genes associated with lipid metabolic pathway in <i>C. oleifera</i>. It will greatly enhance the generation of new varieties of <i>C. oleifera</i> with increased yields and high quality.</p></div
Phylogenetic analyses of the <i>FAD2</i> genes among 20 oil-plants.
<p>(<b>a</b>) The alignment of Cole|AFK31315 (<i>C. oleifera</i>, AFK31315), Cche|AGH32914 (<i>C. chekiangoleosa</i>, AGH32914) and ColeFAD2 (ColeIsotig4522:451–1599) amino acid sequences. The solid black lines indicate conserved amino acids. The filled boxes represent three H-boxes, including HECGH (red box), HRRHH (blue box), and HVAHH (green box). The position (left) is based on <i>FAD2</i> gene in <i>C. chekiangoleosa</i> (AGH32914). The three inconsistent amino acids were plotted in uppercase letters (black). Multiple sequence alignment was performed using ClustalW <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0104150#pone.0104150-Chenna1" target="_blank">[58]</a>, <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0104150#pone.0104150-Larkin1" target="_blank">[59]</a> package. (<b>b</b>) The amino acid sequences were used for phylogenetic tree analysis. The asterisk indicates the <i>FAD2</i> gene (ColeFAD2) detected in the assembled <i>C. oleifera</i> transcriptome (ColeIsotig4522:451–1599). I–V represent the five groups of all the 20 oil-plants classified by the sequence similarity. The GenBank accession numbers and the full species names of the genes used here are: Scom|CAA63432 (<i>Solanum commersonii</i>, CAA63432); Atha|NP_187819 (<i>Arabidopsis thaliana</i>, NP_187819); Hann|AAL68982 (<i>Helianthus annuus</i>, AAL68982); Brap|CAD30827 (<i>Brassica rapa</i>, CAD30827); Sole|BAC22091 (<i>Spinacia oleracea</i>, BAC22091); Oeur|AAL93620 (<i>Olea europaea</i>, AAL93620); Pgra|AAO37754 (<i>Punica granatum</i>, AAO37754); Oeur|AAW63041 (<i>Olea europaea</i>, AAW63041); Gmax|BAD89862 (<i>Glycine max</i>, BAD89862); Hbra|AAY87459 (<i>Hevea brasiliensis</i>, AAY87459); Jcur|ABA41034 (<i>Jatropha curcas</i>, ABA41034); Ptom|ABC41578 (<i>Populus tomentosa</i>, ABC41578); Vmon|ABL86147 (<i>Vernicia Montana</i>, ABL86147); Lusi|ACF49507 (<i>Linum usitatissimum</i>, ACF49507); Rcom|002530704 (<i>Ricinus communis</i>, XP_002530704); Ahyp|ACZ06072 (<i>Arachis hypogaea</i>, ACZ06072); Pvul|ADO17551 (<i>Phaseolus vulgaris</i>, ADO17551); Vfor|AEE69020 (<i>Vernicia fordii</i>, AEE69020); Vlab|AEI60128 (<i>Vitis labrusca</i>, AEI60128); Cole|AFK31315 (<i>C. oleifera</i>, AFK31315); Cche|AGH32914 (<i>C. chekiangoleosa</i>, AGH32914).</p
RNA editing detected by transcriptome reads mapping.
a<p>Strands are indicated with “+”, positive strand, and “−”, negative strand;</p>b<p>Base in the positive strand;</p>c<p>Transcriptome reads that represent corresponding base substitutions that were counted;</p>d<p>Underline indicates the edited base.</p
Overview of <i>C. oleifera</i> transcriptome sequencing and assembly.
<p>(<b>a</b>) Length distribution of 454 sequencing reads after filtering and trimming adapters. (<b>b</b>) Length distribution of the singletons and assembled isotigs.</p
Distribution of simple sequence repeats (SSRs), single nucleotide polymorphisms (SNPs) and insertion/deletions (InDels) in <i>C. oleifera</i> isotigs.
<p>(<b>a</b>) Di-, tri-, tetra-, penta- and hexa-nucleotide repeats were analyzed. The x-axis shows the type of the SSRs, whereas y-axis shows total number of SSRs in different classes. (<b>b</b>) Frequencies of different SNPs/InDels. The x-axis indicates the substitution type of SNPs/InDels, while y-axis represents the number of SNPs/InDels for each substitution type.</p
Summary of the chloroplast genome sequencing, assembly and features.
a<p>LSC, large single copy;</p>b<p>SSC, small single copy;</p>c<p>IR, inverted repeats.</p
Triacylglycerol (TAG) biosynthesis pathway reconstructed based on the <i>de novo</i> assembly and annotation of <i>C. oleifera</i> transcriptome.
<p>Identified enzymes are shown in boxes, including: GK, glycerol kinase (EC: 2.7.1.30); GPAT, glycerol-3-phosphate O-acyltransferase (EC: 2.3.1.15); AGPAT, 1-acyl-sn-glycerol-3-phosphate O-acyltransferase (EC: 2.3.1.51); PP, phosphatidate phosphatase (EC: 3.1.3.4); DGAT, diacylglycerol O-acyltransferase (EC: 2.3.1.20); and PDAT, phopholipid∶ diacyglycerol acyltransferase (EC: 2.3.1.158). The dashed arrows denote reaction(s) in which the enzymes are not shown.</p
Contradiction between Plastid Gene Transcription and Function Due to Complex Posttranscriptional Splicing: An Exemplary Study of <em>ycf15</em> Function and Evolution in Angiosperms
<div><p>Plant chloroplast genes are usually co-transcribed while its posttranscriptional splicing is fairly complex and remains largely unsolved. On basis of sequencing the three complete <i>Camellia</i> (Theaceae) chloroplast genomes for the first time, we comprehensively analyzed the evolutionary patterns of <i>ycf15</i>, a plastid gene quite paradoxical in terms of its function and evolution, along the inferred angiosperm phylogeny. Although many species in separate lineages including the three species reported here contained an intact <i>ycf15</i> gene in their chloroplast genomes, the phylogenetic mixture of both intact and obviously disabled <i>ycf15</i> genes imply that they are all non-functional. Both intracellular gene transfer (IGT) and horizontal gene transfer (HGT) failed to explain such distributional anomalies. While, transcriptome analyses revealed that <i>ycf15</i> was transcribed as precursor polycistronic transcript which contained <i>ycf2</i>, <i>ycf15</i> and antisense <i>trnL-CAA</i>. The transcriptome assembly was surprisingly found to cover near the complete <i>Camellia</i> chloroplast genome. Many non-coding regions including pseudogenes were mapped by multiple transcripts, indicating the generality of pseudogene transcriptions. Our results suggest that plastid DNA posttranscriptional splicing may involve complex cleavage of non-functional genes.</p> </div
