51 research outputs found

    ProbCD: enrichment analysis accounting for categorization uncertainty

    Get PDF
    As in many other areas of science, systems biology makes extensive use of statistical association and significance estimates in contingency tables, a type of categorical data analysis known in this field as enrichment (also over-representation or enhancement) analysis. In spite of efforts to create probabilistic annotations, especially in the Gene Ontology context, or to deal with uncertainty in high throughput-based datasets, current enrichment methods largely ignore this probabilistic information since they are mainly based on variants of the Fisher Exact Test. We developed an open-source R package to deal with probabilistic categorical data analysis, ProbCD, that does not require a static contingency table. The contingency table for
the enrichment problem is built using the expectation of a Bernoulli Scheme stochastic process given the categorization probabilities. An on-line interface was created to allow usage by non-programmers and is available at: http://xerad.systemsbiology.net/ProbCD/. We present an analysis framework and software tools to address the issue of uncertainty in categorical data analysis. In particular, concerning the enrichment analysis, ProbCD can accommodate: (i) the stochastic nature of the high-throughput experimental techniques and (ii) probabilistic gene annotation

    MediPlEx - a tool to combine in silico & experimental gene expression profiles of the model legume Medicago truncatula

    Get PDF
    Henckel K, Küster H, Stutz L, Goesmann A. MediPlEx - a tool to combine in silico and experimental gene expression profiles of the model legume Medicago truncatula. BMC Research Notes. 2010;3(1): 262.BACKGROUND:Expressed Sequence Tags (ESTs) are in general used to gain a first insight into gene activities from a species of interest. Subsequently, and typically based on a combination of EST and genome sequences, microarray-based expression analyses are performed for a variety of conditions. In some cases, a multitude of EST and microarray experiments are conducted for one species, covering different tissues, cell states, and cell types. Under these circumstances, the challenge arises to combine results derived from the different expression profiling strategies, with the goal to uncover novel information on the basis of the integrated datasets.FINDINGS:Using our new application, MediPlEx (MEDIcago truncatula multiPLe EXpression analysis), expression data from EST experiments, oligonucleotide microarrays and Affymetrix GeneChips can be combined and analyzed, leading to a novel approach to integrated transcriptome analysis. We have validated our tool via the identification of a set of well-characterized AM-specific and AM-induced marker genes, identified by MediPlEx on the basis of in silico and experimental gene expression profiles from roots colonized with AM fungi.CONCLUSIONS:MediPlEx offers an integrated analysis pipeline for different sets of expression data generated for the model legume Medicago truncatula. As expected, in silico and experimental gene expression data that cover the same biological condition correlate well. The collection of differentially expressed genes identified via MediPlEx provides a starting point for functional studies in plant mutants. MediPlEx can freely be used at http://www.cebitec.uni-bielefeld.de/mediplex

    Effect of SGLT2 inhibitors on stroke and atrial fibrillation in diabetic kidney disease: Results from the CREDENCE trial and meta-analysis

    Get PDF
    BACKGROUND AND PURPOSE: Chronic kidney disease with reduced estimated glomerular filtration rate or elevated albuminuria increases risk for ischemic and hemorrhagic stroke. This study assessed the effects of sodium glucose cotransporter 2 inhibitors (SGLT2i) on stroke and atrial fibrillation/flutter (AF/AFL) from CREDENCE (Canagliflozin and Renal Events in Diabetes With Established Nephropathy Clinical Evaluation) and a meta-Analysis of large cardiovascular outcome trials (CVOTs) of SGLT2i in type 2 diabetes mellitus. METHODS: CREDENCE randomized 4401 participants with type 2 diabetes mellitus and chronic kidney disease to canagliflozin or placebo. Post hoc, we estimated effects on fatal or nonfatal stroke, stroke subtypes, and intermediate markers of stroke risk including AF/AFL. Stroke and AF/AFL data from 3 other completed large CVOTs and CREDENCE were pooled using random-effects meta-Analysis. RESULTS: In CREDENCE, 142 participants experienced a stroke during follow-up (10.9/1000 patient-years with canagliflozin, 14.2/1000 patient-years with placebo; hazard ratio [HR], 0.77 [95% CI, 0.55-1.08]). Effects by stroke subtypes were: ischemic (HR, 0.88 [95% CI, 0.61-1.28]; n=111), hemorrhagic (HR, 0.50 [95% CI, 0.19-1.32]; n=18), and undetermined (HR, 0.54 [95% CI, 0.20-1.46]; n=17). There was no clear effect on AF/AFL (HR, 0.76 [95% CI, 0.53-1.10]; n=115). The overall effects in the 4 CVOTs combined were: Total stroke (HRpooled, 0.96 [95% CI, 0.82-1.12]), ischemic stroke (HRpooled, 1.01 [95% CI, 0.89-1.14]), hemorrhagic stroke (HRpooled, 0.50 [95% CI, 0.30-0.83]), undetermined stroke (HRpooled, 0.86 [95% CI, 0.49-1.51]), and AF/AFL (HRpooled, 0.81 [95% CI, 0.71-0.93]). There was evidence that SGLT2i effects on total stroke varied by baseline estimated glomerular filtration rate (P=0.01), with protection in the lowest estimated glomerular filtration rate (45 mL/min/1.73 m2]) subgroup (HRpooled, 0.50 [95% CI, 0.31-0.79]). CONCLUSIONS: Although we found no clear effect of SGLT2i on total stroke in CREDENCE or across trials combined, there was some evidence of benefit in preventing hemorrhagic stroke and AF/AFL, as well as total stroke for those with lowest estimated glomerular filtration rate. Future research should focus on confirming these data and exploring potential mechanisms

    Deep RNA Sequencing Reveals Novel Cardiac Transcriptomic Signatures for Physiological and Pathological Hypertrophy

    Get PDF
    Although both physiological hypertrophy (PHH) and pathological hypertrophy (PAH) of the heart have similar morphological appearances, only PAH leads to fatal heart failure. In the present study, we used RNA sequencing (RNA-Seq) to determine the transcriptomic signatures for both PHH and PAH. Approximately 13–20 million reads were obtained for both models, among which PAH showed more differentially expressed genes (DEGs) (2,041) than PHH (245). The expression of 417 genes was barely detectable in the normal heart but was suddenly activated in PAH. Among them, Foxm1 and Plk1 are of particular interest, since Ingenuity Pathway Analysis (IPA) using DEGs and upstream motif analysis showed that they are essential hub proteins that regulate the expression of downstream proteins associated with PAH. Meanwhile, 52 genes related to collagen, chemokines, and actin showed opposite expression patterns between PHH and PAH. MAZ-binding motifs were enriched in the upstream region of the participating genes. Alternative splicing (AS) of exon variants was also examined using RNA-Seq data for PAH and PHH. We found 317 and 196 exon inclusions and exon exclusions, respectively, for PAH, and 242 and 172 exon inclusions and exclusions, respectively for PHH. The AS pattern was mostly related to gains or losses of domains, changes in activity, and localization of the encoded proteins. The splicing variants of 8 genes (i.e., Fhl1, Rcan1, Ndrg2, Synpo, Ttll1, Cxxc5, Egfl7, and Tmpo) were experimentally confirmed. Multilateral pathway analysis showed that the patterns of quantitative (DEG) and qualitative (AS) changes differ depending on the type of pathway in PAH and PHH. One of the most significant changes in PHH is the severe downregulation of autoimmune pathways accompanied by significant AS. These findings revealed the unique transcriptomic signatures of PAH and PHH and also provided a more comprehensive understanding at both the quantitative and qualitative levels

    Gene expression down-regulation in CD90+ prostate tumor-associated stromal cells involves potential organ-specific genes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The prostate stroma is a key mediator of epithelial differentiation and development, and potentially plays a role in the initiation and progression of prostate cancer. The tumor-associated stroma is marked by increased expression of CD90/THY1. Isolation and characterization of these stromal cells could provide valuable insight into the biology of the tumor microenvironment.</p> <p>Methods</p> <p>Prostate CD90<sup>+ </sup>stromal fibromuscular cells from tumor specimens were isolated by cell-sorting and analyzed by DNA microarray. Dataset analysis was used to compare gene expression between histologically normal and tumor-associated stromal cells. For comparison, stromal cells were also isolated and analyzed from the urinary bladder.</p> <p>Results</p> <p>The tumor-associated stromal cells were found to have decreased expression of genes involved in smooth muscle differentiation, and those detected in prostate but not bladder. Other differential expression between the stromal cell types included that of the CXC-chemokine genes.</p> <p>Conclusion</p> <p>CD90<sup>+ </sup>prostate tumor-associated stromal cells differed from their normal counterpart in expression of multiple genes, some of which are potentially involved in organ development.</p

    Signal transduction-related responses to phytohormones and environmental challenges in sugarcane

    Get PDF
    BACKGROUND: Sugarcane is an increasingly economically and environmentally important C4 grass, used for the production of sugar and bioethanol, a low-carbon emission fuel. Sugarcane originated from crosses of Saccharum species and is noted for its unique capacity to accumulate high amounts of sucrose in its stems. Environmental stresses limit enormously sugarcane productivity worldwide. To investigate transcriptome changes in response to environmental inputs that alter yield we used cDNA microarrays to profile expression of 1,545 genes in plants submitted to drought, phosphate starvation, herbivory and N(2)-fixing endophytic bacteria. We also investigated the response to phytohormones (abscisic acid and methyl jasmonate). The arrayed elements correspond mostly to genes involved in signal transduction, hormone biosynthesis, transcription factors, novel genes and genes corresponding to unknown proteins. RESULTS: Adopting an outliers searching method 179 genes with strikingly different expression levels were identified as differentially expressed in at least one of the treatments analysed. Self Organizing Maps were used to cluster the expression profiles of 695 genes that showed a highly correlated expression pattern among replicates. The expression data for 22 genes was evaluated for 36 experimental data points by quantitative RT-PCR indicating a validation rate of 80.5% using three biological experimental replicates. The SUCAST Database was created that provides public access to the data described in this work, linked to tissue expression profiling and the SUCAST gene category and sequence analysis. The SUCAST database also includes a categorization of the sugarcane kinome based on a phylogenetic grouping that included 182 undefined kinases. CONCLUSION: An extensive study on the sugarcane transcriptome was performed. Sugarcane genes responsive to phytohormones and to challenges sugarcane commonly deals with in the field were identified. Additionally, the protein kinases were annotated based on a phylogenetic approach. The experimental design and statistical analysis applied proved robust to unravel genes associated with a diverse array of conditions attributing novel functions to previously unknown or undefined genes. The data consolidated in the SUCAST database resource can guide further studies and be useful for the development of improved sugarcane varieties

    Canagliflozin and Cardiovascular and Renal Outcomes in Type 2 Diabetes Mellitus and Chronic Kidney Disease in Primary and Secondary Cardiovascular Prevention Groups

    Get PDF
    Background: Canagliflozin reduces the risk of kidney failure in patients with type 2 diabetes mellitus and chronic kidney disease, but effects on specific cardiovascular outcomes are uncertain, as are effects in people without previous cardiovascular disease (primary prevention). Methods: In CREDENCE (Canagliflozin and Renal Events in Diabetes With Established Nephropathy Clinical Evaluation), 4401 participants with type 2 diabetes mellitus and chronic kidney disease were randomly assigned to canagliflozin or placebo on a background of optimized standard of care. Results: Primary prevention participants (n=2181, 49.6%) were younger (61 versus 65 years), were more often female (37% versus 31%), and had shorter duration of diabetes mellitus (15 years versus 16 years) compared with secondary prevention participants (n=2220, 50.4%). Canagliflozin reduced the risk of major cardiovascular events overall (hazard ratio [HR], 0.80 [95% CI, 0.67-0.95]; P=0.01), with consistent reductions in both the primary (HR, 0.68 [95% CI, 0.49-0.94]) and secondary (HR, 0.85 [95% CI, 0.69-1.06]) prevention groups (P for interaction=0.25). Effects were also similar for the components of the composite including cardiovascular death (HR, 0.78 [95% CI, 0.61-1.00]), nonfatal myocardial infarction (HR, 0.81 [95% CI, 0.59-1.10]), and nonfatal stroke (HR, 0.80 [95% CI, 0.56-1.15]). The risk of the primary composite renal outcome and the composite of cardiovascular death or hospitalization for heart failure were also consistently reduced in both the primary and secondary prevention groups (P for interaction &gt;0.5 for each outcome). Conclusions: Canagliflozin significantly reduced major cardiovascular events and kidney failure in patients with type 2 diabetes mellitus and chronic kidney disease, including in participants who did not have previous cardiovascular disease

    An Expanded Evaluation of Protein Function Prediction Methods Shows an Improvement In Accuracy

    Get PDF
    Background: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging. Results: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2. Conclusions: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent

    An expanded evaluation of protein function prediction methods shows an improvement in accuracy

    Get PDF
    Background: A major bottleneck in our understanding of the molecular underpinnings of life is the assignment of function to proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, assessing methods for protein function prediction and tracking progress in the field remain challenging. Results: We conducted the second critical assessment of functional annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. We evaluated 126 methods from 56 research groups for their ability to predict biological functions using Gene Ontology and gene-disease associations using Human Phenotype Ontology on a set of 3681 proteins from 18 species. CAFA2 featured expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis compared the best methods from CAFA1 to those of CAFA2. Conclusions: The top-performing methods in CAFA2 outperformed those from CAFA1. This increased accuracy can be attributed to a combination of the growing number of experimental annotations and improved methods for function prediction. The assessment also revealed that the definition of top-performing algorithms is ontology specific, that different performance metrics can be used to probe the nature of accurate predictions, and the relative diversity of predictions in the biological process and human phenotype ontologies. While there was methodological improvement between CAFA1 and CAFA2, the interpretation of results and usefulness of individual methods remain context-dependent. Keywords: Protein function prediction, Disease gene prioritizationpublishedVersio
    corecore