70 research outputs found

    Super Learner

    Get PDF
    Previous articles (van der Laan and Dudoit (2003); van der Laan et al. (2006); Sinisi et al. (2007)) advertised and theoretically validated the use of cross-validation to select among many candidate estimators to compute a so called super learner which outperforms any of the given candidate estimators. The theoretical basis was provided for this super learner based on oracle results for the cross-validation selector (e.g., van der Laan and Dudoit (2003); van der Laan et al. (2006)) and in Sinisi et al. (2007). In addition, these papers contained a practical demonstration of the adaptivity of this so called super learner in the context of prediction of the fitness of the HIV virus as a function of its mutations. This article proposes a fast algorithm for constructing a super learner in prediction which uses V-fold cross-validation to select a functional form of an initial set of candidate predictors according to a parametric or semi-parametric model, or possibly, data adaptively. The paper contains a proof that the resulting super learner performs asymptotically as well as the oracle selector among the continuum of estimators defined by the (semi-)parametric functional forms of the initial set of candidate estimators. This approach also yields a new class of cross-validation methods to select among a family of candidate estimators by formulating the minimization of the cross-validated risk over the family of candidate estimators as a new least squares regression problem which itself can be carried out with any type of parametric or nonparametric regression methodology (e.g. using cross-validation itself), thereby preventing over-fitting of the cross-validated risk. Simulations and data analysis suggest this new proposed super learner superior to competing methods. This approach for construction of a super learner generalizes to any parameter which can be defined as a minimizer of a loss function

    Interaction between the microbiome and TP53 in human lung cancer.

    Get PDF
    BACKGROUND: Lung cancer is the leading cancer diagnosis worldwide and the number one cause of cancer deaths. Exposure to cigarette smoke, the primary risk factor in lung cancer, reduces epithelial barrier integrity and increases susceptibility to infections. Herein, we hypothesize that somatic mutations together with cigarette smoke generate a dysbiotic microbiota that is associated with lung carcinogenesis. Using lung tissue from 33 controls and 143 cancer cases, we conduct 16S ribosomal RNA (rRNA) bacterial gene sequencing, with RNA-sequencing data from lung cancer cases in The Cancer Genome Atlas serving as the validation cohort. RESULTS: Overall, we demonstrate a lower alpha diversity in normal lung as compared to non-tumor adjacent or tumor tissue. In squamous cell carcinoma specifically, a separate group of taxa are identified, in which Acidovorax is enriched in smokers. Acidovorax temporans is identified within tumor sections by fluorescent in situ hybridization and confirmed by two separate 16S rRNA strategies. Further, these taxa, including Acidovorax, exhibit higher abundance among the subset of squamous cell carcinoma cases with TP53 mutations, an association not seen in adenocarcinomas. CONCLUSIONS: The results of this comprehensive study show both microbiome-gene and microbiome-exposure interactions in squamous cell carcinoma lung cancer tissue. Specifically, tumors harboring TP53 mutations, which can impair epithelial function, have a unique bacterial consortium that is higher in relative abundance in smoking-associated tumors of this type. Given the significant need for clinical diagnostic tools in lung cancer, this study may provide novel biomarkers for early detection

    Triple-Negative Breast Cancer Risk Genes Identified by Multigene Hereditary Cancer Panel Testing

    Get PDF
    Background: Germline genetic testing with hereditary cancer gene panels can identify women at increased risk of breast cancer. However, those at increased risk of triple-negative (estrogen receptor-negative, progesterone receptor-negative, human epidermal growth factor receptor-negative) breast cancer (TNBC) cannot be identified because predisposition genes for TNBC, other than BRCA1, have not been established. The aim of this study was to define the cancer panel genes associated with increased risk of TNBC. Methods: Multigene panel testing for 21 genes in 8753 TNBC patients was performed by a clinical testing laboratory, and testing for 17 genes in 2148 patients was conducted by a Triple Negative Breast Cancer Consortium(TNBCC) of research studies. Associations between deleterious mutations in cancer predisposition genes and TNBC were evaluated using results from TNBC patients and reference controls. Results: Germline pathogenic variants in BARD1, BRCA1, BRCA2, PALB2, and RAD51D were associated with high risk (odds ratio > 5.0) of TNBC and greater than 20% lifetime risk for overall breast cancer among Caucasians. Pathogenic variants in BRIP1, RAD51C, and TP53 were associated with moderate risk (odds ratio > 2) of TNBC. Similar trends were observed for the African American population. Pathogenic variants in these TNBC genes were detected in 12.0% (3.7% non-BRCA1/2) of all participants. Conclusions: Multigene hereditary cancer panel testing can identify women with elevated risk of TNBC due to mutations in BARD1, BRCA1, BRCA2, PALB2, and RAD51D. These women can potentially benefit from improved screening, risk management, and cancer prevention strategies. Patients with mutations may also benefit from specific targeted therapeutic strategies.Peer reviewe

    Association of the CHEK2 c.1100delC variant, radiotherapy, and systemic treatment with contralateral breast cancer risk and breast cancer-specific survival

    Get PDF
    Background: Breast cancer (BC) patients with a germline CHEK2 c.1100delC variant have an increased risk of contralateral BC (CBC) and worse BC-specific survival (BCSS) compared to non-carriers.Aim: To assessed the associations of CHEK2 c.1100delC, radiotherapy, and systemic treatment with CBC risk and BCSS.Methods: Analyses were based on 82,701 women diagnosed with a first primary invasive BC including 963 CHEK2 c.1100delC carriers; median follow-up was 9.1 years. Differential associations with treatment by CHEK2 c.1100delC status were tested by including interaction terms in a multivariable Cox regression model. A multi-state model was used for further insight into the relation between CHEK2 c.1100delC status, treatment, CBC risk and death. Results: There was no evidence for differential associations of therapy with CBC risk by CHEK2 c.1100delC status. The strongest association with reduced CBC risk was observed for the combination of chemotherapy and endocrine therapy [HR (95% CI): 0.66 (0.55-0.78)]. No association was observed with radiotherapy.Results from the multi-state model showed shorter BCSS for CHEK2 c.1100delC carriers versus non-carriers also after accounting for CBC occurrence [HR (95% CI): 1.30 (1.09-1.56)].Conclusion: Systemic therapy was associated with reduced CBC risk irrespective of CHEK2 c.1100delC status. Moreover, CHEK2 c.1100delC carriers had shorter BCSS, which appears not to be fully explained by their CBC risk.Peer reviewe

    Imaging biomarker roadmap for cancer studies.

    Get PDF
    Imaging biomarkers (IBs) are integral to the routine management of patients with cancer. IBs used daily in oncology include clinical TNM stage, objective response and left ventricular ejection fraction. Other CT, MRI, PET and ultrasonography biomarkers are used extensively in cancer research and drug development. New IBs need to be established either as useful tools for testing research hypotheses in clinical trials and research studies, or as clinical decision-making tools for use in healthcare, by crossing 'translational gaps' through validation and qualification. Important differences exist between IBs and biospecimen-derived biomarkers and, therefore, the development of IBs requires a tailored 'roadmap'. Recognizing this need, Cancer Research UK (CRUK) and the European Organisation for Research and Treatment of Cancer (EORTC) assembled experts to review, debate and summarize the challenges of IB validation and qualification. This consensus group has produced 14 key recommendations for accelerating the clinical translation of IBs, which highlight the role of parallel (rather than sequential) tracks of technical (assay) validation, biological/clinical validation and assessment of cost-effectiveness; the need for IB standardization and accreditation systems; the need to continually revisit IB precision; an alternative framework for biological/clinical validation of IBs; and the essential requirements for multicentre studies to qualify IBs for clinical use.Development of this roadmap received support from Cancer Research UK and the Engineering and Physical Sciences Research Council (grant references A/15267, A/16463, A/16464, A/16465, A/16466 and A/18097), the EORTC Cancer Research Fund, and the Innovative Medicines Initiative Joint Undertaking (grant agreement number 115151), resources of which are composed of financial contribution from the European Union's Seventh Framework Programme (FP7/2007-2013) and European Federation of Pharmaceutical Industries and Associations (EFPIA) companies' in kind contribution

    ARTICLEAssociation of the CHEK2 c.1100delC variant, radiotherapy, and systemic treatment with contralateral breast cancer risk and breast cancer-specific survival

    Get PDF
    Aim To assessed the associations of CHEK2 c.1100delC, radiotherapy, and systemic treatment with CBC risk and BCSS. Methods Analyses were based on 82,701 women diagnosed with a first primary invasive BC including 963 CHEK2 c.1100delC carriers; median follow-up was 9.1 years. Differential associations with treatment by CHEK2 c.1100delC status were tested by including interaction terms in a multivariable Cox regression model. A multi-state model was used for further insight into the relation between CHEK2 c.1100delC status, treatment, CBC risk and death. Results There was no evidence for differential associations of therapy with CBC risk by CHEK2 c.1100delC status. The strongest association with reduced CBC risk was observed for the combination of chemotherapy and endocrine therapy [HR (95% CI): 0.66 (0.55–0.78)]. No association was observed with radiotherapy. Results from the multi-state model showed shorter BCSS for CHEK2 c.1100delC carriers versus non-carriers also after accounting for CBC occurrence [HR (95% CI): 1.30 (1.09–1.56)]. Conclusion Systemic therapy was associated with reduced CBC risk irrespective of CHEK2 c.1100delC status. Moreover, CHEK2 c.1100delC carriers had shorter BCSS, which appears not to be fully explained by their CBC risk

    Joint association of mammographic density adjusted for age and body mass index and polygenic risk score with breast cancer risk

    Get PDF
    Background Mammographic breast density, adjusted for age and body mass index, and a polygenic risk score (PRS), comprised of common genetic variation, are both strong risk factors for breast cancer and increase discrimination of risk models. Understanding their joint contribution will be important to more accurately predict risk. Methods Using 3628 breast cancer cases and 5126 controls of European ancestry from eight case-control studies, we evaluated joint associations of a 77-single nucleotide polymorphism (SNP) PRS and quantitative mammographic density measures with breast cancer. Mammographic percent density and absolute dense area were evaluated using thresholding software and examined as residuals after adjusting for age, 1/BMI, and study. PRS and adjusted density phenotypes were modeled both continuously (per 1 standard deviation, SD) and categorically. We fit logistic regression models and tested the null hypothesis of multiplicative joint associations for PRS and adjusted density measures using likelihood ratio and global and tail-based goodness of fit tests within the subset of six cohort or population-based studies. Results Adjusted percent density (odds ratio (OR) = 1.45 per SD, 95% CI 1.38–1.52), adjusted absolute dense area (OR = 1.34 per SD, 95% CI 1.28–1.41), and the 77-SNP PRS (OR = 1.52 per SD, 95% CI 1.45–1.59) were associated with breast cancer risk. There was no evidence of interaction of the PRS with adjusted percent density or dense area on risk of breast cancer by either the likelihood ratio (P > 0.21) or goodness of fit tests (P > 0.09), whether assessed continuously or categorically. The joint association (OR) was 2.60 in the highest categories of adjusted PD and PRS and 0.34 in the lowest categories, relative to women in the second density quartile and middle PRS quintile. Conclusions The combined associations of the 77-SNP PRS and adjusted density measures are generally well described by multiplicative models, and both risk factors provide independent information on breast cancer risk.Includes Cancer Research UK, Horizon 2020 and FP

    Combined Associations of a Polygenic Risk Score and Classical Risk Factors With Breast Cancer Risk.

    Get PDF
    We evaluated the joint associations between a new 313-variant PRS (PRS313) and questionnaire-based breast cancer risk factors for women of European ancestry, using 72 284 cases and 80 354 controls from the Breast Cancer Association Consortium. Interactions were evaluated using standard logistic regression and a newly developed case-only method for breast cancer risk overall and by estrogen receptor status. After accounting for multiple testing, we did not find evidence that per-standard deviation PRS313 odds ratio differed across strata defined by individual risk factors. Goodness-of-fit tests did not reject the assumption of a multiplicative model between PRS313 and each risk factor. Variation in projected absolute lifetime risk of breast cancer associated with classical risk factors was greater for women with higher genetic risk (PRS313 and family history) and, on average, 17.5% higher in the highest vs lowest deciles of genetic risk. These findings have implications for risk prevention for women at increased risk of breast cancer
    corecore