3,105 research outputs found

    Joint and individual analysis of breast cancer histologic images and genomic covariates

    Get PDF
    A key challenge in modern data analysis is understanding connections between complex and differing modalities of data. For example, two of the main approaches to the study of breast cancer are histopathology (analyzing visual characteristics of tumors) and genetics. While histopathology is the gold standard for diagnostics and there have been many recent breakthroughs in genetics, there is little overlap between these two fields. We aim to bridge this gap by developing methods based on Angle-based Joint and Individual Variation Explained (AJIVE) to directly explore similarities and differences between these two modalities. Our approach exploits Convolutional Neural Networks (CNNs) as a powerful, automatic method for image feature extraction to address some of the challenges presented by statistical analysis of histopathology image data. CNNs raise issues of interpretability that we address by developing novel methods to explore visual modes of variation captured by statistical algorithms (e.g. PCA or AJIVE) applied to CNN features. Our results provide many interpretable connections and contrasts between histopathology and genetics

    DNA polymerase B deficiency is linked to aggressive breast cancer: a comprehensive analysis of gene copy number, mRNA and protein expression in multiple cohorts

    Get PDF
    Short arm of chromosome 8 is a hot spot for chromosomal breaks, losses and amplifications in breast cancer. Although such genetic changes may have phenotypic consequences, the identity of candidate gene(s) remains to be clearly defined. Pol β gene is localized to chromosome 8p12 - p11 and encodes a key DNA base excision repair protein. Pol β may be a tumour suppressor and involved in breast cancer pathogenesis. We conducted the first and the largest study to comprehensively evaluate pol β in breast cancer. We investigated pol β gene copy number changes in two cohorts (n=128 & n=1952), pol β mRNA expression in two cohorts (n=249 & n=1952) and pol β protein expression in two cohorts (n=1406 & n=252). Artificial neural network analysis for pol β interacting genes was performed in 249 tumours. For mechanistic insights, pol β gene copy number changes, mRNA and protein levels were investigated together in 1 28 tumours and validated in 1952 tumours. Low pol β mRNA expression as well as low pol β protein expression was associated high grade, lymph node positivity, pleomorphism, triple negative, basal - like phenotypes and poor survival (ps<0.001). In oestrogen receptor (ER) positive sub - group that received tamoxifen, low pol β protein remains associated with aggressive phenotype and poor survival (ps<0.001). Artificial neural network analysis revealed ER as a top pol β interacting gene. Mechanistically, there was strong positive correlation between pol β gene copy number changes and pol β mRNA expression (p<0.0000001) and between pol β mRNA and pol β protein expression (p<0.0000001). This is the first study to provide evidence that pol β deficiency is linked to aggressive breast cancer and may have prognostic and predictive significance in patients

    Adverse prognostic and predictive significance of low DNA-dependent protein kinase catalytic subunit (DNA-PKcs) expression in early-stage breast cancers

    Get PDF
    Background: DNA-dependent protein kinase catalytic subunit (DNA-PKcs), a serine threonine kinase belonging to the PIKK family (phosphoinositide 3-kinase-like-family of protein kinase), is a critical component of the non-homologous end joining (NHEJ) pathway required for the repair of DNA double strand breaks. DNA-PKcs may be involved in breast cancer pathogenesis. Methods: We evaluated clinicopathological significance of DNA-PKcs protein expression in 1161 tumours and DNA-PKcs mRNA expression in 1950 tumours. We correlated DNA-PKcs to other markers of aggressive phenotypes, DNA repair, apoptosis and cell cycle regulation. Results: Low DNA-PKcs protein expression was associated with higher tumour grade, higher mitotic index, tumour de-differentiation and tumour type (ps<0.05). Absence of BRCA1, low XRCC1/SMUG1/APE1/Polβ were also more likely in low DNA-PKcs expressing tumours (ps<0.05). Low DNA-PKcs protein expression was significantly associated with worse breast cancer specific survival (BCCS) in univariate and multivariate analysis (ps<0.01). At the mRNA level, low DNA-PKcs was associated with PAM50.Her2 and PAM50.LumA molecular phenotypes (ps<0.01) and poor BCSS. In patients with ER positive tumours who received endocrine therapy, low DNA-PKcs (protein and mRNA) was associated with poor survival. In ER negative patients, low DNA-PKcs mRNA remains significantly associated with adverse outcome. Conclusions: Our study suggests that low DNA-PKcs expression may have prognostic and predictive significance in breast cancers

    An evaluation of DNA-damage response and cell-cycle pathways for breast cancer classification

    Get PDF
    Accurate subtyping or classification of breast cancer is important for ensuring proper treatment of patients and also for understanding the molecular mechanisms driving this disease. While there have been several gene signatures proposed in the literature to classify breast tumours, these signatures show very low overlaps, different classification performance, and not much relevance to the underlying biology of these tumours. Here we evaluate DNA-damage response (DDR) and cell cycle pathways, which are critical pathways implicated in a considerable proportion of breast tumours, for their usefulness and ability in breast tumour subtyping. We think that subtyping breast tumours based on these two pathways could lead to vital insights into molecular mechanisms driving these tumours. Here, we performed a systematic evaluation of DDR and cell-cycle pathways for subtyping of breast tumours into the five known intrinsic subtypes. Homologous Recombination (HR) pathway showed the best performance in subtyping breast tumours, indicating that HR genes are strongly involved in all breast tumours. Comparisons of pathway based signatures and two standard gene signatures supported the use of known pathways for breast tumour subtyping. Further, the evaluation of these standard gene signatures showed that breast tumour subtyping, prognosis and survival estimation are all closely related. Finally, we constructed an all-inclusive super-signature by combining (union of) all genes and performing a stringent feature selection, and found it to be reasonably accurate and robust in classification as well as prognostic value. Adopting DDR and cell cycle pathways for breast tumour subtyping achieved robust and accurate breast tumour subtyping, and constructing a super-signature which contains feature selected mix of genes from these molecular pathways as well as clinical aspects is valuable in clinical practice.Comment: 28 pages, 7 figures, 6 table

    Genomic and protein expression analysis reveals flap endonuclease 1 (FEN1) as a key biomarker in breast and ovarian cancer

    Get PDF
    FEN1 has key roles in Okazaki fragment maturation during replication, long patch base excision repair, rescue of stalled replication forks, maintenance of telomere stability and apoptosis. FEN1 may be dysregulated in breast and ovarian cancers and have clinicopathological significance in patients. We comprehensively investigated FEN1 mRNA expression in multiple cohorts of breast cancer [training set (128), test set (249), external validation (1952)]. FEN1 protein expression was evaluated in 568 oestrogen receptor (ER) negative breast cancers, 894 ER positive breast cancers and 156 ovarian epithelial cancers. FEN1 mRNA overexpression was highly significantly associated with high grade (p= 4.89 x 10 - 57) , high mitotic index (p= 5.25 x 10 - 28), pleomorphism (p= 6.31 x 10-19), ER negative (p= 9.02 x 10-35 ), PR negative (p= 9.24 x 10-24 ), triple negative phenotype (p= 6.67 x 10-21) , PAM50.Her2 (p=5.19 x 10-13 ), PAM50.Basal (p=2.7 x 10-41), PAM50.LumB (p=1.56 x 10-26), integrative molecular cluster 1 (intClust.1) ( p=7.47 x 10-12), intClust.5 (p=4.05 x 10-12) and intClust. 10 (p=7.59 x 10-38 ) breast cancers. FEN1 mRNA overexpression is associated with poor breast cancer specific survival in univariate (p=4.4 x 10-16) and multivariate analysis (p=9.19 x 10-7). At the protein level, in ER positive tumours , FEN1 overexpression remains significantly linked to high grade, high mitotic index and pleomorphism (ps< 0.01). In ER negative tumours, high FEN1 is significantly associated with pleomorphism, tumour type, lymphovascular invasion, triple negative phenotype, EGFR and HER2 expression (ps<0.05). In ER positive as well as in ER negative tumours, FEN1 protein over expression is associated with poor survival in univariate and multivariate analysis (ps<0.01). In ovarian epithelial cancers , similarly, FEN1 overexpression is associated with high grade, high stage and poor survival (ps<0.05). We conclude that FEN1 is a promising biomarker in breast and ovarian epithelial cancer

    Proliferation and estrogen signaling can distinguish patients at risk for early versus late relapse among estrogen receptor positive breast cancers

    Get PDF
    Introduction: We examined if a combination of proliferation markers and estrogen receptor (ER) activity could predict early versus late relapses in ER-positive breast cancer and inform the choice and length of adjuvant endocrine therapy. Methods: Baseline affymetrix gene-expression profiles from ER-positive patients who received no systemic therapy (n = 559), adjuvant tamoxifen for 5 years (cohort-1: n = 683, cohort-2: n = 282) and from 58 patients treated with neoadjuvant letrozole for 3 months (gene-expression available at baseline, 14 and 90 days) were analyzed. A proliferation score based on the expression of mitotic kinases (MKS) and an ER-related score (ERS) adopted from Oncotype DX® were calculated. The same analysis was performed using the Genomic Grade Index as proliferation marker and the luminal gene score from the PAM50 classifier as measure of estrogen-related genes. Median values were used to define low and high marker groups and four combinations were created. Relapses were grouped into time cohorts of 0-2.5, 0-5, 5-10 years. Results: In the overall 10 years period, the proportional hazards assumption was violated for several biomarker groups indicating time-dependent effects. In tamoxifen-treated patients Low-MKS/Low-ERS cancers had continuously increasing risk of relapse that was higher after 5 years than Low-MKS/High-ERS cancers [0 to 10 year, HR 3.36; p = 0.013]. High-MKS/High-ERS cancers had low risk of early relapse [0-2.5 years HR 0.13; p = 0.0006], but high risk of late relapse which was higher than in the High-MKS/Low-ERS group [after 5 years HR 3.86; p = 0.007]. The High-MKS/Low-ERS subset had most of the early relapses [0 to 2.5 years, HR 6.53; p < 0.0001] especially in node negative tumors and showed minimal response to neoadjuvant letrozole. These findings were qualitatively confirmed in a smaller independent cohort of tamoxifen-treated patients. Using different biomarkers provided similar results. Conclusions: Early relapses are highest in highly proliferative/low-ERS cancers, in particular in node negative tumors. Relapses occurring after 5 years of adjuvant tamoxifen are highest among the highly-proliferative/high-ERS tumors although their risk of recurrence is modest in the first 5 years on tamoxifen. These tumors could be the best candidates for extended endocrine therapy

    A transfer-learning approach to feature extraction from cancer transcriptomes with deep autoencoders

    Get PDF
    Publicado en Lecture Notes in Computer Science.The diagnosis and prognosis of cancer are among the more challenging tasks that oncology medicine deals with. With the main aim of fitting the more appropriate treatments, current personalized medicine focuses on using data from heterogeneous sources to estimate the evolu- tion of a given disease for the particular case of a certain patient. In recent years, next-generation sequencing data have boosted cancer prediction by supplying gene-expression information that has allowed diverse machine learning algorithms to supply valuable solutions to the problem of cancer subtype classification, which has surely contributed to better estimation of patient’s response to diverse treatments. However, the efficacy of these models is seriously affected by the existing imbalance between the high dimensionality of the gene expression feature sets and the number of sam- ples available for a particular cancer type. To counteract what is known as the curse of dimensionality, feature selection and extraction methods have been traditionally applied to reduce the number of input variables present in gene expression datasets. Although these techniques work by scaling down the input feature space, the prediction performance of tradi- tional machine learning pipelines using these feature reduction strategies remains moderate. In this work, we propose the use of the Pan-Cancer dataset to pre-train deep autoencoder architectures on a subset com- posed of thousands of gene expression samples of very diverse tumor types. The resulting architectures are subsequently fine-tuned on a col- lection of specific breast cancer samples. This transfer-learning approach aims at combining supervised and unsupervised deep learning models with traditional machine learning classification algorithms to tackle the problem of breast tumor intrinsic-subtype classification.Universidad de Málaga. Campus de Excelencia Internacional Andalucía Tech
    • …
    corecore