144 research outputs found

    Global isoform-specific transcript alterations and deregulated networks in clear cell renal cell carcinoma.

    Get PDF
    Extensive genome-wide analyses of deregulated gene expression have now been performed for many types of cancer. However, most studies have focused on deregulation at the gene-level, which may overlook the alterations of specific transcripts for a given gene. Clear cell renal cell carcinoma (ccRCC) is one of the best-characterized and most pervasive renal cancers, and ccRCCs are well-documented to have aberrant RNA processing. In the present study, we examine the extent of aberrant isoform-specific RNA expression by reporting a comprehensive transcript-level analysis, using the new kallisto-sleuth-RATs pipeline, investigating coding and non-coding differential transcript expression in ccRCC. We analyzed 50 ccRCC tumors and their matched normal samples from The Cancer Genome Altas datasets. We identified 7,339 differentially expressed transcripts and 94 genes exhibiting differential transcript isoform usage in ccRCC. Additionally, transcript-level coexpression network analyses identified vasculature development and the tricarboxylic acid cycle as the most significantly deregulated networks correlating with ccRCC progression. These analyses uncovered several uncharacterized transcripts, including lncRNAs FGD5-AS1 and AL035661.1, as potential regulators of the tricarboxylic acid cycle associated with ccRCC progression. As ccRCC still presents treatment challenges, our results provide a new resource of potential therapeutics targets and highlight the importance of exploring alternative methodologies in transcriptome-wide studies

    Extending the rapeseed gene pool with resynthesized Brassica napus II: Heterosis

    Get PDF
    Hybrid breeding relies on the combination of parents from two differing heterotic groups. However, the genetic diversity in adapted oilseed rape breeding material is rather limited. Therefore, the use of resynthesized Brassica napus as a distant gene pool was investigated. Hybrids were derived from crosses between 44 resynthesized lines with a diverse genetic background and two male sterile winter oilseed rape tester lines. The hybrids were evaluated together with their parents and check cultivars in 2 years and five locations in Germany. Yield, plant height, seed oil, and protein content were monitored, and genetic distances were estimated with molecular markers (127 polymorphic RFLP fragments). Resynthesized lines varied in yield between 40.9 dt/ha and 21.5 dt/ha, or between 85.1 and 44.6% of check cultivar yields. Relative to check cultivars, hybrids varied from 91.6 to 116.6% in yield and from 94.5 to 103.3% in seed oil content. Mid-parent heterosis varied from −3.5 to 47.2% for yield. The genetic distance of parental lines was not significantly correlated with heterosis or hybrid yield. Although resynthesized lines do not meet the elite rapeseed standards, they are a valuable source for hybrid breeding due to their large distance from present breeding material and their high heterosis when combined with European winter oilseed rape

    Application of kernel functions for accurate similarity search in large chemical databases

    Get PDF
    Background Similaritysearch in chemical structure databases is an important problem with many applications in chemical genomics, drug design, and efficient chemical probe screening among others. It is widely believed that structure based methods provide an efficient way to do the query. Recently various graph kernel functions have been designed to capture the intrinsic similarity of graphs. Though successful in constructing accurate predictive and classification models, graph kernel functions can not be applied to large chemical compound database due to the high computational complexity and the difficulties in indexing similarity search for large databases. Results To bridge graph kernel function and similarity search in chemical databases, we applied a novel kernel-based similarity measurement, developed in our team, to measure similarity of graph represented chemicals. In our method, we utilize a hash table to support new graph kernel function definition, efficient storage and fast search. We have applied our method, named G-hash, to large chemical databases. Our results show that the G-hash method achieves state-of-the-art performance for k-nearest neighbor (k-NN) classification. Moreover, the similarity measurement and the index structure is scalable to large chemical databases with smaller indexing size, and faster query processing time as compared to state-of-the-art indexing methods such as Daylight fingerprints, C-tree and GraphGrep. Conclusions Efficient similarity query processing method for large chemical databases is challenging since we need to balance running time efficiency and similarity search accuracy. Our previous similarity search method, G-hash, provides a new way to perform similarity search in chemical databases. Experimental study validates the utility of G-hash in chemical databases

    Alteration of the embryo transcriptome of hexaploid winter wheat (Triticum aestivum cv. Mercia) during maturation and germination

    Get PDF
    Grain dormancy and germination are areas of biology that are of considerable interest to the cereal community. We have used a 9,155-feature wheat unigene cDNA microarray resource to investigate changes in the wheat embryo transcriptome during late grain development and maturation and during the first 48 h of postimbibition germination. In the embryo 392 mRNAs accumulated by twofold or greater over the time course from 21 days postanthesis (dpa) to 40 dpa and on through 1 and 2 days postgermination. These included mRNAs encoding proteins involved in amino acid biosynthesis and metabolism, cell division and subsequent cell development, signal transduction, lipid metabolism, energy production, protein turnover, respiration, initiation of transcription, initiation of translation and ribosomal composition. A number of mRNAs encoding proteins of unknown function also accumulated over the time course. Conversely 163 sequences showed decreases of twofold or greater over the time course. A small number of mRNAs also showed rapid accumulation specifically during the first 48 h of germination. We also examined alterations in the accumulation of transcripts encoding proteins involved in abscisic acid signalling. Thus, we describe changes in the level of transcripts encoding wheat Viviparous 1 (Vp1) and other interacting proteins. Interestingly, the transcript encoding wheat Viviparous-interacting protein 1 showed a pattern of accumulation that correlates inversely with germination. Our data suggests that the majority of the transcripts required for germination accumulate in the embryo prior to germination and we discuss the implications of these findings with regard to manipulation of germination in wheat

    Gene coexpression clusters and putative regulatory elements underlying seed storage reserve accumulation in Arabidopsis

    Get PDF
    Abstract Background In Arabidopsis, a large number of genes involved in the accumulation of seed storage reserves during seed development have been characterized, but the relationship of gene expression and regulation underlying this physiological process remains poorly understood. A more holistic view of this molecular interplay will help in the further study of the regulatory mechanisms controlling seed storage compound accumulation. Results We identified gene coexpression networks in the transcriptome of developing Arabidopsis (Arabidopsis thaliana) seeds from the globular to mature embryo stages by analyzing publicly accessible microarray datasets. Genes encoding the known enzymes in the fatty acid biosynthesis pathway were found in one coexpression subnetwork (or cluster), while genes encoding oleosins and seed storage proteins were identified in another subnetwork with a distinct expression profile. In the triacylglycerol assembly pathway, only the genes encoding diacylglycerol acyltransferase 1 (DGAT1) and a putative cytosolic "type 3" DGAT exhibited a similar expression pattern with genes encoding oleosins. We also detected a large number of putative cis-acting regulatory elements in the promoter regions of these genes, and promoter motifs for LEC1 (LEAFY COTYLEDON 1), DOF (DNA-binding-with-One-Finger), GATA, and MYB transcription factors (TF), as well as SORLIP5 (Sequences Over-Represented in Light-Induced Promoters 5), are overrepresented in the promoter regions of fatty acid biosynthetic genes. The conserved CCAAT motifs for B3-domain TFs and binding sites for bZIP (basic-leucine zipper) TFs are enriched in the promoters of genes encoding oleosins and seed storage proteins. Conclusions Genes involved in the accumulation of seed storage reserves are expressed in distinct patterns and regulated by different TFs. The gene coexpression clusters and putative regulatory elements presented here provide a useful resource for further experimental characterization of protein interactions and regulatory networks in this process.</p

    Generation, Annotation and Analysis of First Large-Scale Expressed Sequence Tags from Developing Fiber of Gossypium barbadense L

    Get PDF
    BACKGROUND: Cotton fiber is the world's leading natural fiber used in the manufacture of textiles. Gossypium is also the model plant in the study of polyploidization, evolution, cell elongation, cell wall development, and cellulose biosynthesis. G. barbadense L. is an ideal candidate for providing new genetic variations useful to improve fiber quality for its superior properties. However, little is known about fiber development mechanisms of G. barbadense and only a few molecular resources are available in GenBank. METHODOLOGY AND PRINCIPAL FINDINGS: In total, 10,979 high-quality expressed sequence tags (ESTs) were generated from a normalized fiber cDNA library of G. barbadense. The ESTs were clustered and assembled into 5852 unigenes, consisting of 1492 contigs and 4360 singletons. The blastx result showed 2165 unigenes with significant similarity to known genes and 2687 unigenes with significant similarity to genes of predicted proteins. Functional classification revealed that unigenes were abundant in the functions of binding, catalytic activity, and metabolic pathways of carbohydrate, amino acid, energy, and lipids. The function motif/domain-related cytoskeleton and redox homeostasis were enriched. Among the 5852 unigenes, 282 and 736 unigenes were identified as potential cell wall biosynthesis and transcription factors, respectively. Furthermore, the relationships among cotton species or between cotton and other model plant systems were analyzed. Some putative species-specific unigenes of G. barbadense were highlighted. CONCLUSIONS/SIGNIFICANCE: The ESTs generated in this study are from the first large-scale EST project for G. barbadense and significantly enhance the number of G. barbadense ESTs in public databases. This knowledge will contribute to cotton improvements by studying fiber development mechanisms of G. barbadense, establishing a breeding program using marker-assisted selection, and discovering candidate genes related to important agronomic traits of cotton through oligonucleotide array. Our work will also provide important resources for comparative genomics, polyploidization, and genome evolution among Gossypium species
    corecore