395 research outputs found

    Proteomics biomarker discovery for individualized prevention of familial pancreatic cancer using statistical learning

    Get PDF
    BACKGROUND: The low five-year survival rate of pancreatic ductal adenocarcinoma (PDAC) and the low diagnostic rate of early-stage PDAC via imaging highlight the need to discover novel biomarkers and improve the current screening procedures for early diagnosis. Familial pancreatic cancer (FPC) describes the cases of PDAC that are present in two or more individuals within a circle of first-degree relatives. Using innovative high-throughput proteomics, we were able to quantify the protein profiles of individuals at risk from FPC families in different potential pre-cancer stages. However, the high-dimensional proteomics data structure challenges the use of traditional statistical analysis tools. Hence, we applied advanced statistical learning methods to enhance the analysis and improve the results’ interpretability. METHODS: We applied model-based gradient boosting and adaptive lasso to deal with the small, unbalanced study design via simultaneous variable selection and model fitting. In addition, we used stability selection to identify a stable subset of selected biomarkers and, as a result, obtain even more interpretable results. In each step, we compared the performance of the different analytical pipelines and validated our approaches via simulation scenarios. RESULTS: In the simulation study, model-based gradient boosting showed a more accurate prediction performance in the small, unbalanced, and high-dimensional datasets than adaptive lasso and could identify more relevant variables. Furthermore, using model-based gradient boosting, we discovered a subset of promising serum biomarkers that may potentially improve the current screening procedure of FPC. CONCLUSION: Advanced statistical learning methods helped us overcome the shortcomings of an unbalanced study design in a valuable clinical dataset. The discovered serum biomarkers provide us with a clear direction for further investigations and more precise clinical hypotheses regarding the development of FPC and optimal strategies for its early detection

    Comparison of genetic risk prediction models to improve prediction of coronary heart disease in two large cohorts of the MONICA/KORA study

    Get PDF
    It is still unclear how genetic information, provided as single-nucleotide polymorphisms (SNPs), can be most effectively integrated into risk prediction models for coronary heart disease (CHD) to add significant predictive value beyond clinical risk models. For the present study, a population-based case-cohort was used as a trainingset (451 incident cases, 1488 noncases) and an independent cohort as testset (160 incident cases, 2749 noncases). The following strategies to quantify genetic information were compared: A weighted genetic risk score including Metabochip SNPs associated with CHD in the literature (GRSMetabo ); selection of the most predictive SNPs among these literature-confirmed variants using priority-Lasso (PLMetabo ); validation of two comprehensive polygenic risk scores: GRSGola based on Metabochip data, and GRSKhera (available in the testset only) based on cross-validated genome-wide genotyping data. We used Cox regression to assess associations with incident CHD. C-index, category-free net reclassification index (cfNRI) and relative integrated discrimination improvement (IDIrel ) were used to quantify the predictive performance of genetic information beyond Framingham risk score variables. In contrast to GRSMetabo and PLMetabo , GRSGola significantly improved the prediction (delta C-index [95% confidence interval]: 0.0087 [0.0044, 0.0130]; IDIrel : 0.0509 [0.0131, 0.0894]; cfNRI improved only in cases: 0.1761 [0.0253, 0.3219]). GRSKhera yielded slightly worse prediction results than GRSGola

    Atrial fibrillation genetic risk differentiates cardioembolic stroke from other stroke subtypes

    Get PDF
    Objective We sought to assess whether genetic risk factors for atrial fibrillation (AF) can explain cardioembolic stroke risk. Methods We evaluated genetic correlations between a previous genetic study of AF and AF in the presence of cardioembolic stroke using genome-wide genotypes from the Stroke Genetics Network (N = 3,190 AF cases, 3,000 cardioembolic stroke cases, and 28,026 referents). We tested whether a previously validated AF polygenic risk score (PRS) associated with cardioembolic and other stroke subtypes after accounting for AF clinical risk factors. Results We observed a strong correlation between previously reported genetic risk for AF, AF in the presence of stroke, and cardioembolic stroke (Pearson r = 0.77 and 0.76, respectively, across SNPs with p 0.1). Conclusion: s Genetic risk of AF is associated with cardioembolic stroke, independent of clinical risk factors. Studies are warranted to determine whether AF genetic risk can serve as a biomarker for strokes caused by AF

    Genome-wide meta-analysis of phytosterols reveals five novel loci and a detrimental effect on coronary atherosclerosis

    Get PDF
    Phytosterol serum concentrations are under tight genetic control. The relationship between phytosterols and coronary artery disease (CAD) is controversially discussed. We perform a genome-wide meta-analysis of 32 phytosterol traits reflecting resorption, cholesterol synthesis and esterification in six studies with up to 9758 subjects and detect ten independent genome-wide significant SNPs at seven genomic loci. We confirm previously established associations at ABCG5/8 and ABO and demonstrate an extended locus heterogeneity at ABCG5/8 with different functional mechanisms. New loci comprise HMGCR, NPC1L1, PNLIPRP2, SCARB1 and APOE. Based on these results, we perform Mendelian Randomization analyses (MR) revealing a risk-increasing causal relationship of sitosterol serum concentrations and CAD, which is partly mediated by cholesterol. Here we report that phytosterols are polygenic traits. MR add evidence of both, direct and indirect causal effects of sitosterol on CAD

    Novel multiple sclerosis susceptibility loci implicated in epigenetic regulation

    Get PDF
    We conducted a genome-wide association study (GWAS) on multiple sclerosis (MS) susceptibility in German cohorts with 4888 cases and 10,395 controls. In addition to associations within the major histocompatibility complex (MHC) region, 15 non-MHC loci reached genome-wide significance. Four of these loci are novel MS susceptibility loci. They map to the genes L3MBTL3, MAZ, ERG, and SHMT1. The lead variant at SHMT1 was replicated in an independent Sardinian cohort. Products of the genes L3MBTL3, MAZ, and ERG play important roles in immune cell regulation. SHMT1 encodes a serine hydroxymethyltransferase catalyzing the transfer of a carbon unit to the folate cycle. This reaction is required for regulation of methylation homeostasis, which is important for establishment and maintenance of epigenetic signatures. Our GWAS approach in a defined population with limited genetic substructure detected associations not found in larger, more heterogeneous cohorts, thus providing new clues regarding MS pathogenesis

    Integrative clinical transcriptome analysis reveals TMPRSS2‐ERG dependency of prognostic biomarkers in prostate adenocarcinoma

    Get PDF
    In prostate adenocarcinoma (PCa), distinction between indolent and aggressive disease is challenging. Around 50% of PCa are characterized by TMPRSS2‐ERG (T2E)‐fusion oncoproteins defining two molecular subtypes (T2E‐positive/negative). However, current prognostic tests do not differ between both molecular subtypes, which might affect outcome prediction. To investigate gene‐signatures associated with metastasis in T2E‐positive and T2E‐negative PCa independently, we integrated tumor transcriptomes and clinicopathological data of two cohorts (total n = 783), and analyzed metastasis‐associated gene‐signatures regarding the T2E‐status. Here, we show that the prognostic value of biomarkers in PCa critically depends on the T2E‐status. Using gene‐set enrichment analyses, we uncovered that metastatic T2E‐positive and T2E‐negative PCa are characterized by distinct gene‐signatures. In addition, by testing genes shared by several functional gene‐signatures for their association with event‐free survival in a validation cohort (n = 272), we identified five genes (ASPN, BGN, COL1A1, RRM2 and TYMS)—three of which are included in commercially available prognostic tests—whose high expression was significantly associated with worse outcome exclusively in T2E‐negative PCa. Among these genes, RRM2 and TYMS were validated by immunohistochemistry in another validation cohort (n = 135), and several of them proved to add prognostic information to current clinicopathological predictors, such as Gleason score, exclusively for T2E‐negative patients. No prognostic biomarkers were identified exclusively for T2E‐positive tumors. Collectively, our study discovers that the T2E‐status, which is per se not a strong prognostic biomarker, crucially determines the prognostic value of other biomarkers. Our data suggest that the molecular subtype needs to be considered when applying prognostic biomarkers for outcome prediction in PCa
    corecore