14 research outputs found

    Improving GSEA for Analysis of Biologic Pathways for Differential Gene Expression across a Binary Phenotype

    Get PDF
    Gene-set analysis evaluates the expression of biological pathways, or a priori defined gene sets, rather than that of single genes, in association with a binary phenotype, and is of great biologic interest in many DNA microarray studies. Gene Set Enrichment Analysis (GSEA) has been applied widely as a tool for gene-set analyses. We describe here some critical problems with GSEA and propose an alternative method by extending the single-gene analysis method, Significance Analysis of Microarray (SAM), to gene-set analyses (SAM-GS). Specifically, we illustrate, in a simulation study, that GSEA gives statistical significance to gene sets that have no gene associated with the phenotype (null gene sets), and has very low power to detect gene sets in which half the genes are highly associated with the phenotype (truly-associated gene sets). SAM-GS, on the other hand, performs perfectly in the simulation study: none of the null gene sets is identified with statistical significance, while all of the truly-associated gene sets are. The two methods are also compared in the analyses of three real microarray datasets and relevant pathways, the diverging results of which clearly show the advantages of SAM-GS over GSEA, both statistically and biologically

    Improving gene set analysis of microarray data by SAM-GS

    Get PDF
    <p>Abstract</p> <p>Background</p> <p><it>Gene-set </it>analysis evaluates the expression of biological pathways, or <it>a priori </it>defined gene sets, rather than that of individual genes, in association with a binary phenotype, and is of great biologic interest in many DNA microarray studies. Gene Set Enrichment Analysis (GSEA) has been applied widely as a tool for gene-set analyses. We describe here some critical problems with GSEA and propose an alternative method by extending the individual-gene analysis method, Significance Analysis of Microarray (SAM), to gene-set analyses (SAM-GS).</p> <p>Results</p> <p>Using a mouse microarray dataset with simulated gene sets, we illustrate that GSEA gives statistical significance to gene sets that have no gene associated with the phenotype (null gene sets), and has very low power to detect gene sets in which half the genes are moderately or strongly associated with the phenotype (truly-associated gene sets). SAM-GS, on the other hand, performs very well. The two methods are also compared in the analyses of three real microarray datasets and relevant pathways, the diverging results of which clearly show advantages of SAM-GS over GSEA, both statistically and biologically. In a microarray study for identifying biological pathways whose gene expressions are associated with <it>p53 </it>mutation in cancer cell lines, we found biologically relevant performance differences between the two methods. Specifically, there are 31 additional pathways identified as significant by SAM-GS over GSEA, that are associated with the presence vs. absence of <it>p53</it>. Of the 31 gene sets, 11 actually involve <it>p53 </it>directly as a member. A further 6 gene sets directly involve the extrinsic and intrinsic apoptosis pathways, 3 involve the cell-cycle machinery, and 3 involve cytokines and/or JAK/STAT signaling. Each of these 12 gene sets, then, is in a direct, well-established relationship with aspects of <it>p53 </it>signaling. Of the remaining 8 gene sets, 6 have plausible, if less well established, links with <it>p53</it>.</p> <p>Conclusion</p> <p>We conclude that GSEA has important limitations as a gene-set analysis approach for microarray experiments for identifying biological pathways associated with a binary phenotype. As an alternative statistically-sound method, we propose SAM-GS. A free Excel Add-In for performing SAM-GS is available for public use.</p

    A molecular classifier for predicting future graft loss in late kidney transplant biopsies

    No full text
    Kidney transplant recipients that develop signs of renal dysfunction or proteinuria one or more years after transplantation are at considerable risk for progression to renal failure. To assess the kidney at this time, a “for-cause” biopsy is performed, but this provides little indication as to which recipients will go on to organ failure. In an attempt to identify molecules that could provide this information, we used micorarrays to analyze gene expression in 105 for-cause biopsies taken between 1 and 31 years after transplantation. Using supervised principal components analysis, we derived a molecular classifier to predict graft loss. The genes associated with graft failure were related to tissue injury, epithelial dedifferentiation, matrix remodeling, and TGF-β effects and showed little overlap with rejection-associated genes. We assigned a prognostic molecular risk score to each patient, identifying those at high or low risk for graft loss. The molecular risk score was correlated with interstitial fibrosis, tubular atrophy, tubulitis, interstitial inflammation, proteinuria, and glomerular filtration rate. In multivariate analysis, molecular risk score, peritubular capillary basement membrane multilayering, arteriolar hyalinosis, and proteinuria were independent predictors of graft loss. In an independent validation set, the molecular risk score was the only predictor of graft loss. Thus, the molecular risk score reflects active injury and is superior to either scarring or function in predicting graft failure

    Transcriptomic Signatures of End-Stage Human Dilated Cardiomyopathy Hearts with and without Left Ventricular Assist Device Support

    No full text
    Left ventricular assist device (LVAD) use in patients with dilated cardiomyopathy (DCM) can lead to a differential response in the LV and right ventricle (RV), and RV failure remains the most common complication post-LVAD insertion. We assessed transcriptomic signatures in end-stage DCM, and evaluated changes in gene expression (mRNA) and regulation (microRNA/miRNA) following LVAD. LV and RV free-wall tissues were collected from end-stage DCM hearts with (n = 8) and without LVAD (n = 8). Non-failing control tissues were collected from donated hearts (n = 6). Gene expression (for mRNAs/miRNAs) was determined using microarrays. Our results demonstrate that immune response, oxygen homeostasis, and cellular physiological processes were the most enriched pathways among differentially expressed genes in both ventricles of end-stage DCM hearts. LV genes involved in circadian rhythm, muscle contraction, cellular hypertrophy, and extracellular matrix (ECM) remodelling were differentially expressed. In the RV, genes related to the apelin signalling pathway were affected. Following LVAD use, immune response genes improved in both ventricles; oxygen homeostasis and ECM remodelling genes improved in the LV and, four miRNAs normalized. We conclude that LVAD reduced the expression and induced additional transcriptomic changes of various mRNAs and miRNAs as an integral component of the reverse ventricular remodelling in a chamber-specific manner
    corecore