172 research outputs found
Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference
An important feature of Bayesian statistics is the opportunity to do
sequential inference: the posterior distribution obtained after seeing a
dataset can be used as prior for a second inference. However, when Monte Carlo
sampling methods are used for inference, we only have a set of samples from the
posterior distribution. To do sequential inference, we then either have to
evaluate the second posterior at only these locations and reweight the samples
accordingly, or we can estimate a functional description of the posterior
probability distribution from the samples and use that as prior for the second
inference. Here, we investigated to what extent we can obtain an accurate joint
posterior from two datasets if the inference is done sequentially rather than
jointly, under the condition that each inference step is done using Monte Carlo
sampling. To test this, we evaluated the accuracy of kernel density estimates,
Gaussian mixtures, vine copulas and Gaussian processes in approximating
posterior distributions, and then tested whether these approximations can be
used in sequential inference. In low dimensionality, Gaussian processes are
more accurate, whereas in higher dimensionality Gaussian mixtures or vine
copulas perform better. In our test cases, posterior approximations are
preferable over direct sample reweighting, although joint inference is still
preferable over sequential inference. Since the performance is case-specific,
we provide an R package mvdens with a unified interface for the density
approximation methods
Heterofusion:Fusing genomics data of different measurement scales
In systems biology, it is becoming increasingly common to measure biochemical entities at different levels of the same biological system. Hence, data fusion problems are abundant in the life sciences. With the availability of a multitude of measuring techniques, one of the central problems is the heterogeneity of the data. In this paper, we discuss a specific form of heterogeneity, namely, that of measurements obtained at different measurement scales, such as binary, ordinal, interval, and ratioāscaled variables. Three generic fusion approaches are presented of which two are new to the systems biology community. The methods are presented, put in context, and illustrated with a realālife genomics example
Detection of Colorectal Cancer by Serum and Tissue Protein Profiling: A Prospective Study in a Population at Risk
Colorectal cancer (CRC) is the second most common cause of cancer-related death in Europe and its prognosis is largely dependent on stage at diagnosis. Currently, there are no suitable tumour markers for early detection of CRC. In a retrospective study we previously found discriminative CRC serum protein profiles with surface enhanced laser desorption ionisationātime of flight mass spectrometry (SELDI-TOF MS). We now aimed at prospective validation of these profiles. Additionally, we assessed their applicability for follow-up after surgery and investigated tissue protein profiles of patients with CRC and adenomatous polyps (AP). Serum and tissue samples were collected from patients without known malignancy with an indication for colonoscopy and patients with AP and CRC during colonoscopy. Serum samples of controls (CON; n = 359), patients with AP (n = 177) and CRC (n = 73), as well as tissue samples from AP (n = 52) and CRC (n = 47) were analysed as described previously. Peak intensities were compared by non-parametric testing. Discriminative power of differentially expressed proteins was assessed with support vector machines (SVM). We confirmed the decreased serum levels of apolipoprotein C-1 in CRC in the current population. No differences were observed between CON and AP. Apolipoprotein C-I levels did not change significantly within 1 month post-surgery, although a gradual return to normal levels was observed. Several proteins differed between AP and CRC tissue, among which a peak with similar mass as apolipoprotein C-1. This peak was increased in CRC compared to AP. Although we prospectively validated the serum decrease of apolipoprotein C-1 in CRC, serum protein profiles did not yield SVM classifiers with suitable sensitivity and specificity for classification of our patient groups
Genomic patterns resembling BRCA1- and BRCA2-mutated breast cancers predict benefit of intensified carboplatin-based chemotherapy
Introduction: BRCA-mutated breast cancer cells lack the DNA-repair mechanism homologous recombination that is required for error-free DNA double-strand break (DSB) repair. Homologous recombination deficiency (HRD) may cause hypersensitivity to DNA DSB-inducing agents, such as bifunctional alkylating agents and platinum salts. HRD can be caused by BRCA mutations, and by other mechanisms. To identify HRD, studies have focused on triple-negative (TN) breast cancers as these resemble BRCA1-mutated breast cancer closely and might also share this hypersensitivity. However, ways to identify HRD in non-BRCA-mutated, estrogen receptor (ER)-positive breast cancers have remained elusive. The current study provides evidence that genomic patterns resembling BRCA1- or BRCA2- mutated breast cancers can identify breast cancer patients with TN as well as ER-positive, HER2-negative tumors that are sensitive to intensified, DSB-inducing chemotherapy. Methods: Array comparative genomic hybridization (aCGH) was used to classify breast cancers. Patients with tumors with similar aCGH patterns as BRCA1- and/or BRCA2-mutated breast cancers were defined as having a BRCA-like(CGH) status, others as non-BCRA-like(CGH). Stage-III patients (n = 249) had participated in a randomized controlled trial of adjuvant high-dose (HD) cyclophosphamide-thiotepa-carboplatin (CTC) versus 5-fluorouracil-epirubicin-cyclophosphamide (FE90C) chemotherapy. Results: Among patients with BRCA-like(CGH) tumors (81/249, 32%), a significant benefit of HD-CTC compared to FE90C was observed regarding overall survival (adjusted hazard ratio 0.19, 95% CI: 0.08 to 0.48) that was not seen for patients with non-BRCA-like(CGH) tumors (adjusted hazard ratio 0.90, 95% CI: 0.53 to 1.54) (P = 0.004). Half of all BRCA-like(CGH) tumors were ER-positive. Conclusions: Distinct aCGH patterns differentiated between HER2-negative patients with a markedly improved outcome after adjuvant treatment with an intensified DNA-DSB-inducing regimen (BRCA-like(CGH) patients) and those without benefit (non-BRCA-like(CGH) patients)
A critical evaluation of network and pathway based classifiers for outcome prediction in breast cancer
Recently, several classifiers that combine primary tumor data, like gene
expression data, and secondary data sources, such as protein-protein
interaction networks, have been proposed for predicting outcome in breast
cancer. In these approaches, new composite features are typically constructed
by aggregating the expression levels of several genes. The secondary data
sources are employed to guide this aggregation. Although many studies claim
that these approaches improve classification performance over single gene
classifiers, the gain in performance is difficult to assess. This stems mainly
from the fact that different breast cancer data sets and validation procedures
are employed to assess the performance. Here we address these issues by
employing a large cohort of six breast cancer data sets as benchmark set and by
performing an unbiased evaluation of the classification accuracies of the
different approaches. Contrary to previous claims, we find that composite
feature classifiers do not outperform simple single gene classifiers. We
investigate the effect of (1) the number of selected features; (2) the specific
gene set from which features are selected; (3) the size of the training set and
(4) the heterogeneity of the data set on the performance of composite feature
and single gene classifiers. Strikingly, we find that randomization of
secondary data sources, which destroys all biological information in these
sources, does not result in a deterioration in performance of composite feature
classifiers. Finally, we show that when a proper correction for gene set size
is performed, the stability of single gene sets is similar to the stability of
composite feature sets. Based on these results there is currently no reason to
prefer prognostic classifiers based on composite features over single gene
classifiers for predicting outcome in breast cancer
Role of variant allele fraction and rare SNP filtering to improve cellular DNA repair endpoint association
BackgroundLarge cancer genome studies continue to reveal new players in treatment response and tumorigenesis. The discrimination of functional alterations from the abundance of passenger genetic alterations still poses challenges and determines DNA sequence variant selection procedures. Here we evaluate variant selection strategies that select homozygous variants and rare SNPs and assess its value in detecting tumor cells with DNA repair defects.MethodsTo this end we employed a panel of 29 patient-derived head and neck squamous cell carcinoma (HNSCC) cell lines, of which a subset harbors DNA repair defects. Mitomycin C (MMC) sensitivity was used as functional endpoint of DNA crosslink repair deficiency. 556 genes including the Fanconi anemia (FA) and homologous recombination (HR) genes, whose products strongly determine MMC response, were capture-sequenced.ResultsWe show a strong association between MMC sensitivity, thus loss of DNA repair function, and the presence of homozygous and rare SNPs in the relevant FA/HR genes. Excluding such selection criteria impedes the discrimination of crosslink repair status by mutation analysis. Applied to all KEGG pathways, we find that the association with MMC sensitivity is strongest in the KEGG FA pathway, therefore also demonstrating the value of such selection strategies for exploratory analyses. Variant analyses in 56 clinical samples demonstrate that homozygous variants occur more frequently in tumor suppressor genes than oncogenes further supporting the role of a homozygosity criterion to improve gene function association or tumor suppressor gene identification studies.ConclusionTogether our data show that the detection of relevant genes or of repair pathway defected tumor cells can be improved by the consideration of allele zygosity and SNP allele frequencies
Multiple low dose therapy as an effective strategy to treat EGFR inhibitor-resistant NSCLC tumours
Resistance to targeted cancer drugs is thought to result from selective pressure exerted by a high drug dose. Partial inhibition of multiple components in the same oncogenic signalling pathway may add up to complete pathway inhibition, while decreasing the selective pressure on each component to acquire a resistance mutation. We report here testing of this Multiple Low Dose (MLD) therapy model in EGFR mutant NSCLC. We show that as little as 20% of the individual effective drug doses is sufficient to completely block MAPK signalling and proliferation when used in 3D (RAF + MEK + ERK) or 4D (EGFR + RAF + MEK + ERK) inhibidor combinations. Importantly, EGFR mutant NSCLC cells treated with MLD therapy do not develop resistance. Using several animal models, we find durable responses to MLD therapy without associated toxicity. Our data support the notion that MLD therapy could deliver clinical benefit, even for those having acquired resistance to third generation EGFR inhibidor therapy.This work was supported by a grant from the Dutch Cancer Society through the Oncode Institute. Al.V. was supported by the Fondo de Investigaciones Sanitarias, FIS (PI16-01898, and by the Spanish Association Against Cancer, AECC (CGB14142035THOM) and Ideas Semilla project (IDEAS098VILL-IDEAS16) and Generalitat de Catalunya (2014SGR364). L.F. received a European Unionās Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie, grant agreement number 799850. E.N. was funded by Instituto Carlos III through the project PI18/00920. We thank CERCA Program/Generalitat de Catalunya for their institutional support and grant 2017SGR448
- ā¦