21 research outputs found

    Cross-study validation for the assessment of prediction algorithms

    Get PDF
    Motivation: Numerous competing algorithms for prediction in high-dimensional settings have been developed in the statistical and machine-learning literature. Learning algorithms and the prediction models they generate are typically evaluated on the basis of cross-validation error estimates in a few exemplary datasets. However, in most applications, the ultimate goal of prediction modeling is to provide accurate predictions for independent samples obtained in different settings. Cross-validation within exemplary datasets may not adequately reflect performance in the broader application context. Methods: We develop and implement a systematic approach to ‘cross-study validation’, to replace or supplement conventional cross-validation when evaluating high-dimensional prediction models in independent datasets. We illustrate it via simulations and in a collection of eight estrogen-receptor positive breast cancer microarray gene-expression datasets, where the objective is predicting distant metastasis-free survival (DMFS). We computed the C-index for all pairwise combinations of training and validation datasets. We evaluate several alternatives for summarizing the pairwise validation statistics, and compare these to conventional cross-validation. Results: Our data-driven simulations and our application to survival prediction with eight breast cancer microarray datasets, suggest that standard cross-validation produces inflated discrimination accuracy for all algorithms considered, when compared to cross-study validation. Furthermore, the ranking of learning algorithms differs, suggesting that algorithms performing best in cross-validation may be suboptimal when evaluated through independent validation. Availability: The survHD: Survival in High Dimensions package (http://www.bitbucket.org/lwaldron/survhd) will be made available through Bioconductor. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online

    Identification of glucocorticoid-related molecular signature by whole blood methylome analysis

    Full text link
    Objective Cushing's syndrome represents a state of excessive glucocorticoids related to glucocorticoid treatments or to endogenous hypercortisolism. Cushing's syndrome is associated with high morbidity, with significant inter-individual variability. Likewise, adrenal insufficiency is a life-threatening condition of cortisol deprivation. Currently, hormone assays contribute to identify Cushing's syndrome or adrenal insufficiency. However, no biomarker directly quantifies the biological glucocorticoid action. The aim of this study was to identify such markers. Design We evaluated whole blood DNA methylome in 94 samples obtained from patients with different glucocorticoid states (Cushing's syndrome, eucortisolism, adrenal insufficiency). We used an independent cohort of 91 samples for validation. Methods Leukocyte DNA was obtained from whole blood samples. Methylome was determined using the Illumina methylation chip array (~850 000 CpG sites). Both unsupervised (principal component analysis) and supervised (Limma) methods were used to explore methylome profiles. A Lasso-penalized regression was used to select optimal discriminating features. Results Whole blood methylation profile was able to discriminate samples by their glucocorticoid status: glucocorticoid excess was associated with DNA hypomethylation, recovering within months after Cushing's syndrome correction. In Cushing's syndrome, an enrichment in hypomethylated CpG sites was observed in the region of FKBP5 gene locus. A methylation predictor of glucocorticoid excess was built on a training cohort and validated on two independent cohorts. Potential CpG sites associated with the risk for specific complications, such as glucocorticoid-related hypertension or osteoporosis, were identified, needing now to be confirmed on independent cohorts. Conclusions Whole blood DNA methylome is dynamically impacted by glucocorticoids. This biomarker could contribute to better assessment of glucocorticoid action beyond hormone assays

    Whole blood methylome-derived features to discriminate endocrine hypertension

    Get PDF
    Background: Arterial hypertension represents a worldwide health burden and a major risk factor for cardiovascular morbidity and mortality. Hypertension can be primary (primary hypertension, PHT), or secondary to endocrine disorders (endocrine hypertension, EHT), such as Cushing's syndrome (CS), primary aldosteronism (PA), and pheochromocytoma/paraganglioma (PPGL). Diagnosis of EHT is currently based on hormone assays. Efficient detection remains challenging, but is crucial to properly orientate patients for diagnostic confirmation and specific treatment. More accurate biomarkers would help in the diagnostic pathway. We hypothesized that each type of endocrine hypertension could be associated with a specific blood DNA methylation signature, which could be used for disease discrimination. To identify such markers, we aimed at exploring the methylome profiles in a cohort of 255 patients with hypertension, either PHT (n = 42) or EHT (n = 213), and at identifying specific discriminating signatures using machine learning approaches. Results: Unsupervised classification of samples showed discrimination of PHT from EHT. CS patients clustered separately from all other patients, whereas PA and PPGL showed an overall overlap. Global methylation was decreased in the CS group compared to PHT. Supervised comparison with PHT identified differentially methylated CpG sites for each type of endocrine hypertension, showing a diffuse genomic location. Among the most differentially methylated genes, FKBP5 was identified in the CS group. Using four different machine learning methods—Lasso (Least Absolute Shrinkage and Selection Operator), Logistic Regression, Random Forest, and Support Vector Machine—predictive models for each type of endocrine hypertension were built on training cohorts (80% of samples for each hypertension type) and estimated on validation cohorts (20% of samples for each hypertension type). Balanced accuracies ranged from 0.55 to 0.74 for predicting EHT, 0.85 to 0.95 for predicting CS, 0.66 to 0.88 for predicting PA, and 0.70 to 0.83 for predicting PPGL. Conclusions: The blood DNA methylome can discriminate endocrine hypertension, with methylation signatures for each type of endocrine disorder

    Expression Profiling of Major Histocompatibility and Natural Killer Complex Genes Reveals Candidates for Controlling Risk of Graft versus Host Disease

    Get PDF
    Background: The major histocompatibility complex (MHC) is the most important genomic region that contributes to the risk of graft versus host disease (GVHD) after haematopoietic stem cell transplantation. Matching of MHC class I and II genes is essential for the success of transplantation. However, the MHC contains additional genes that also contribute to the risk of developing acute GVHD. It is difficult to identify these genes by genetic association studies alone due to linkage disequilibrium in this region. Therefore, we aimed to identify MHC genes and other genes involved in the pathophysiology of GVHD by mRNA expression profiling. Methodology/Principal Findings: To reduce the complexity of the task, we used genetically well-defined rat inbred strains and a rat skin explant assay, an in-vitro-model of the graft versus host reaction (GVHR), to analyze the expression of MHC, natural killer complex (NKC), and other genes in cutaneous GVHR. We observed a statistically significant and strong up or down regulation of 11 MHC, 6 NKC, and 168 genes encoded in other genomic regions, i.e. 4.9%, 14.0%, and 2.6% of the tested genes respectively. The regulation of 7 selected MHC and 3 NKC genes was confirmed by quantitative real-time PCR and in independent skin explant assays. In addition, similar regulations of most of the selected genes were observed in GVHD-affected skin lesions of transplanted rats and in human skin explant assays. Conclusions/Significance: We identified rat and human MHC and NKC genes that are regulated during GVHR in skin explant assays and could therefore serve as biomarkers for GVHD. Several of the respective human genes, including HLA-DMB, C2, AIF1, SPR1, UBD, and OLR1, are polymorphic. These candidates may therefore contribute to the genetic risk of GVHD in patients

    Comprehensive Molecular Characterization of Pheochromocytoma and Paraganglioma

    Get PDF
    SummaryWe report a comprehensive molecular characterization of pheochromocytomas and paragangliomas (PCCs/PGLs), a rare tumor type. Multi-platform integration revealed that PCCs/PGLs are driven by diverse alterations affecting multiple genes and pathways. Pathogenic germline mutations occurred in eight PCC/PGL susceptibility genes. We identified CSDE1 as a somatically mutated driver gene, complementing four known drivers (HRAS, RET, EPAS1, and NF1). We also discovered fusion genes in PCCs/PGLs, involving MAML3, BRAF, NGFR, and NF1. Integrated analysis classified PCCs/PGLs into four molecularly defined groups: a kinase signaling subtype, a pseudohypoxia subtype, a Wnt-altered subtype, driven by MAML3 and CSDE1, and a cortical admixture subtype. Correlates of metastatic PCCs/PGLs included the MAML3 fusion gene. This integrated molecular characterization provides a comprehensive foundation for developing PCC/PGL precision medicine

    Restructuring of the male mice peripheral circadian network after bariatric surgery

    No full text
    Bariatric surgery is still the most effective long-term weight-loss therapy. Recent data indicate that surgical outcomes may be affected by diurnal food intake patterns. In this study, we aimed to investigate how surgery-induced metabolic adaptations (i.e. weight loss) interact with circadian clock function. For that reason, vertical sleeve gastrectomy (VSG) was performed in obese mice and rhythms in behavior, tissue rhythmicity, and white adipose tissue transcriptome were evaluated. VSG under constant darkness conditions led to a maximum weight loss of 18% compared to a loss of 3% after sham surgery. Post-surgical weight development was characterized by two distinct intervals of catabolic and subsequent anabolic metabolic state. Locomotor activity was not affected. However, VSG significantly increased active phase meal frequency in the anabolic state. No significant effects on clock gene rhythmicity were detected in adrenal and white adipose tissue (WAT) explant cultures. Transcriptome rhythm analyses of subcutaneous WAT revealed a reduction of cycling genes after VSG (sham: 2493 vs VSG: 1013) independent of sustained rhythms in core clock gene expression. This may be a consequence of weight loss-induced morphological reconstruction of WAT that overwrites the direct influence of the local clock machinery on the transcriptome. However, VSG altered rhythmic transcriptional regulation of WAT lipid metabolism pathways. Thus, our data suggest a reorganization of diurnal metabolic rhythms after VSG downstream of the molecular clock machinery.Bariatric surgery is still the most effective long-term weight-loss therapy. Recent data indicate that surgical outcomes may be affected by diurnal food intake patterns. In this study, we aimed to investigate how surgery-induced metabolic adaptations (i.e. weight loss) interact with circadian clock function. For that reason, vertical sleeve gastrectomy (VSG) was performed in obese mice and rhythms in behavior, tissue rhythmicity, and white adipose tissue transcriptome were evaluated. VSG under constant darkness conditions led to a maximum weight loss of 18% compared to a loss of 3% after sham surgery. Post-surgical weight development was characterized by two distinct intervals of catabolic and subsequent anabolic metabolic state. Locomotor activity was not affected. However, VSG significantly increased active phase meal frequency in the anabolic state. No significant effects on clock gene rhythmicity were detected in adrenal and white adipose tissue (WAT) explant cultures. Transcriptome rhythm analyses of subcutaneous WAT revealed a reduction of cycling genes after VSG (sham: 2493 vs VSG: 1013) independent of sustained rhythms in core clock gene expression. This may be a consequence of weight loss-induced morphological reconstruction of WAT that overwrites the direct influence of the local clock machinery on the transcriptome. However, VSG altered rhythmic transcriptional regulation of WAT lipid metabolism pathways. Thus, our data suggest a reorganization of diurnal metabolic rhythms after VSG downstream of the molecular clock machinery
    corecore