52 research outputs found
An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics
For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale. Analysis of clinicopathologic annotations for over 11,000 cancer patients in the TCGA program leads to the generation of TCGA Clinical Data Resource, which provides recommendations of clinical outcome endpoint usage for 33 cancer types
Integrated analysis of RNA and DNA from the phase III trial CALGB 40601 identifies predictors of response to trastuzumab-based neoadjuvant chemotherapy in HER2-positive breast cancer
Purpose: Response to a complex trastuzumab-based regimen is affected by multiple features of the tumor and its microenvironment. Developing a predictive algorithm is key to optimizing HER2-targeting therapy. Experimental Design: We analyzed 137 pretreatment tumors with mRNA-seq and DNA exome sequencing from CALGB 40601, a neoadjuvant phase III trial of paclitaxel plus trastuzumab with or without lapatinib in stage II to III HER2-positive breast cancer. We adopted an Elastic Net regularized regression approach that controls for covarying features within high-dimensional data. First, we applied 517 known gene expression signatures to develop an Elastic Net model to predict pCR, which we validated on 143 samples from four independent trials. Next, we performed integrative analyses incorporating clinicopathologic information with somatic mutation status, DNA copy number alterations (CNA), and gene signatures. Results: The Elastic Net model using only gene signatures predicted pCR in the validation sets (AUC ¼ 0.76). Integrative analyses showed that models containing gene signatures, clinical features, and DNA information were better pCR predictors than models containing a single data type. Frequently selected variables from the multiplatform models included amplifications of chromosome 6p, TP53 mutation, HER2-enriched subtype, and immune signatures. Variables predicting resistance included Luminal/ERþ features. Conclusions: Models using RNA only, as well as integrated RNA and DNA models, can predict pCR with improved accuracy over clinical variables. Somatic DNA alterations (mutation, CNAs), tumor molecular subtype (HER2E, Luminal), and the microenvironment (immune cells) were independent predictors of response to trastuzumab and paclitaxel-based regimens. This highlights the complexity of predicting response in HER2-positive breast cancer
The prognostic significance of low-frequency somatic mutations in metastatic cutaneous melanoma
Background: Little is known about the prognostic significance of somatically mutated genes in metastatic melanoma (MM). We have employed a combined clinical and bioinformatics approach on tumor samples from cutaneous melanoma (SKCM) as part of The Cancer Genome Atlas project (TCGA) to identify mutated genes with potential clinical relevance. Methods: After limiting our DNA sequencing analysis to MM samples (n = 356) and to the CANCER CENSUS gene list, we filtered out mutations with low functional significance (snpEFF). We performed Cox analysis on 53 genes that were mutated in ≥3% of samples, and had ≥50% difference in incidence of mutations in deceased subjects versus alive subjects. Results: Four genes were potentially prognostic [RAC1, FGFR1, CARD11, CIITA; false discovery rate (FDR) 75% of the samples that exhibited corresponding DNA mutations. The low frequency, UV signature type and RNA expression of the 22 genes in MM samples were confirmed in a separate multi-institution validation cohort (n = 413). An underpowered analysis within a subset of this validation cohort with available patient follow-up (n = 224) showed that somatic mutations in SPEN and RAC1 reached borderline prognostic significance [log-rank favorable (p = 0.09) and adverse (p = 0.07), respectively]. Somatic mutations in SPEN, and to a lesser extent RAC1, were not associated with definite gene copy number or RNA expression alterations. High (>2+) nuclear plus cytoplasmic expression intensity for SPEN was associated with longer melanoma-specific overall survival (OS) compared to lower (≤ 2+) nuclear intensity (p = 0.048). We conclude that expressed somatic mutations in infrequently mutated genes beyond the well-characterized ones (e.g., BRAF, RAS, CDKN2A, PTEN, TP53), such as RAC1 and SPEN, may have prognostic significance in MM
FOXA1 and adaptive response determinants to HER2 targeted therapy in TBCRC 036
Inhibition of the HER2/ERBB2 receptor is a keystone to treating HER2-positive malignancies, particularly breast cancer, but a significant fraction of HER2-positive (HER2+) breast cancers recur or fail to respond. Anti-HER2 monoclonal antibodies, like trastuzumab or pertuzumab, and ATP active site inhibitors like lapatinib, commonly lack durability because of adaptive changes in the tumor leading to resistance. HER2+ cell line responses to inhibition with lapatinib were analyzed by RNAseq and ChIPseq to characterize transcriptional and epigenetic changes. Motif analysis of lapatinib-responsive genomic regions implicated the pioneer transcription factor FOXA1 as a mediator of adaptive responses. Lapatinib in combination with FOXA1 depletion led to dysregulation of enhancers, impaired adaptive upregulation of HER3, and decreased proliferation. HER2-directed therapy using clinically relevant drugs (trastuzumab with or without lapatinib or pertuzumab) in a 7-day clinical trial designed to examine early pharmacodynamic response to antibody-based anti-HER2 therapy showed reduced FOXA1 expression was coincident with decreased HER2 and HER3 levels, decreased proliferation gene signatures, and increased immune gene signatures. This highlights the importance of the immune response to anti-HER2 antibodies and suggests that inhibiting FOXA1-mediated adaptive responses in combination with HER2 targeting is a potential therapeutic strategy
CALGB 40603 (Alliance): Long-Term Outcomes and Genomic Correlates of Response and Survival After Neoadjuvant Chemotherapy With or Without Carboplatin and Bevacizumab in Triple-Negative Breast Cancer
PURPOSECALGB 40603 (NCT00861705), a 2 × 2 randomized phase II trial, demonstrated that adding carboplatin or bevacizumab to weekly paclitaxel (wP) followed by doxorubicin and cyclophosphamide significantly increased the pathologic complete response (pCR) rate in stage II-III triple-negative breast cancer. We now report long-term outcomes (LTOs) and correlative science end points.PATIENTS AND METHODSThe Kaplan-Meier method was used to estimate LTOs in 443 patients who initiated study treatment. Log-rank tests and Cox proportional hazards models evaluated the impact of clinical characteristics, pathologic response, calculated residual cancer burden (RCB) in patients with residual disease (RD), treatment assignment, and dose delivery during wP on LTOs, including event-free survival (EFS). Genomic predictors of treatment response and outcomes were assessed on pretreatment tumor samples by mRNA sequencing.RESULTSAmong baseline characteristics, only the clinical stage was associated with LTOs. At a median follow-up of 7.9 years, LTOs were not significantly improved with either carboplatin or bevacizumab, overall or in patients with basal-like subtype cancers by genomic analysis. Patients with pCR (n = 205, 46.3%) had significantly higher 5-year EFS (85.5% v 56.6%, log-rank P <.0001) and overall survival (87.9% v 63.4%, P <.0001) rates compared with patients with RD, even those with RCB class I. Among clinical and genomic features, evidence of immune activation, including tumor-infiltrating lymphocytes and low B-cell receptor evenness, was associated with pCR and improved EFS.CONCLUSIONDespite higher pCR rates, neither carboplatin nor bevacizumab appeared to improve LTOs although the study was not powered to assess these secondary end points. pCR was associated with superior LTOs even when compared with minimal RD. Markers of immune activation in pretreatment tumor biopsies were independently associated with higher pCR rates and improved survival
Driver Fusions and Their Implications in the Development and Treatment of Human Cancers.
Gene fusions represent an important class of somatic alterations in cancer. We systematically investigated fusions in 9,624 tumors across 33 cancer types using multiple fusion calling tools. We identified a total of 25,664 fusions, with a 63% validation rate. Integration of gene expression, copy number, and fusion annotation data revealed that fusions involving oncogenes tend to exhibit increased expression, whereas fusions involving tumor suppressors have the opposite effect. For fusions involving kinases, we found 1,275 with an intact kinase domain, the proportion of which varied significantly across cancer types. Our study suggests that fusions drive the development of 16.5% of cancer cases and function as the sole driver in more than 1% of them. Finally, we identified druggable fusions involving genes such as TMPRSS2, RET, FGFR3, ALK, and ESR1 in 6.0% of cases, and we predicted immunogenic peptides, suggesting that fusions may provide leads for targeted drug and immune therapy
An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics
For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale. Analysis of clinicopathologic annotations for over 11,000 cancer patients in the TCGA program leads to the generation of TCGA Clinical Data Resource, which provides recommendations of clinical outcome endpoint usage for 33 cancer types
Perspective on Oncogenic Processes at the End of the Beginning of Cancer Genomics
The Cancer Genome Atlas (TCGA) has catalyzed systematic characterization of diverse genomic alterations underlying human cancers. At this historic junction marking the completion of genomic characterization of over 11,000 tumors from 33 cancer types, we present our current understanding of the molecular processes governing oncogenesis. We illustrate our insights into cancer through synthesis of the findings of the TCGA PanCancer Atlas project on three facets of oncogenesis: (1) somatic driver mutations, germline pathogenic variants, and their interactions in the tumor; (2) the influence of the tumor genome and epigenome on transcriptome and proteome; and (3) the relationship between tumor and the microenvironment, including implications for drugs targeting driver events and immunotherapies. These results will anchor future characterization of rare and common tumor types, primary and relapsed tumors, and cancers across ancestry groups and will guide the deployment of clinical genomic sequencing
The Cancer Genome Atlas Comprehensive Molecular Characterization of Renal Cell Carcinoma
Renal cell carcinoma(RCC) is not a single disease, but several histologically defined cancers with different genetic drivers, clinical courses, and therapeutic responses. The current study evaluated 843 RCC from the three major histologic subtypes, including 488 clear cell RCC, 274 papillary RCC, and 81 chromophobe RCC. Comprehensive genomic and phenotypic analysis of the RCC subtypes reveals distinctive features of each subtype that provide the foundation for the development of subtype-specific therapeutic and management strategies for patients affected with these cancers. Somatic alteration of BAP1, PBRM1, and PTEN and altered metabolic pathways correlated with subtype-specific decreased survival, while CDKN2A alteration, increased DNA hypermethylation, and increases in the immune-related Th2 gene expression signature correlated with decreased survival within all major histologic subtypes. CIMP-RCC demonstrated an increased immune signature, and a uniform and distinct metabolic expression pattern identified a subset of metabolically divergent (MD) ChRCC that associated with extremely poor survival
Scalable Open Science Approach for Mutation Calling of Tumor Exomes Using Multiple Genomic Pipelines
The Cancer Genome Atlas (TCGA) cancer genomics dataset includes over 10,000 tumor-normal exome pairs across 33 different cancer types, in total >400 TB of raw data files requiring analysis. Here we describe the Multi-Center Mutation Calling in Multiple Cancers project, our effort to generate a comprehensive encyclopedia of somatic mutation calls for the TCGA data to enable robust cross-tumor-type analyses. Our approach accounts for variance and batch effects introduced by the rapid advancement of DNA extraction, hybridization-capture, sequencing, and analysis methods over time. We present best practices for applying an ensemble of seven mutation-calling algorithms with scoring and artifact filtering. The dataset created by this analysis includes 3.5 million somatic variants and forms the basis for PanCan Atlas papers. The results have been made available to the research community along with the methods used to generate them. This project is the result of collaboration from a number of institutes and demonstrates how team science drives extremely large genomics projects
- …
