354 research outputs found

    Improved personalized survival prediction of patients with diffuse large B-cell Lymphoma using gene expression profiling

    Get PDF
    BACKGROUND: Thirty to forty percent of patients with Diffuse Large B-cell Lymphoma (DLBCL) have an adverse clinical evolution. The increased understanding of DLBCL biology has shed light on the clinical evolution of this pathology, leading to the discovery of prognostic factors based on gene expression data, genomic rearrangements and mutational subgroups. Nevertheless, additional efforts are needed in order to enable survival predictions at the patient level. In this study we investigated new machine learning-based models of survival using transcriptomic and clinical data. METHODS: Gene expression profiling (GEP) of in 2 different publicly available retrospective DLBCL cohorts were analyzed. Cox regression and unsupervised clustering were performed in order to identify probes associated with overall survival on the largest cohort. Random forests were created to model survival using combinations of GEP data, COO classification and clinical information. Cross-validation was used to compare model results in the training set, and Harrel's concordance index (c-index) was used to assess model's predictability. Results were validated in an independent test set. RESULTS: Two hundred thirty-three and sixty-four patients were included in the training and test set, respectively. Initially we derived and validated a 4-gene expression clusterization that was independently associated with lower survival in 20% of patients. This pattern included the following genes: TNFRSF9, BIRC3, BCL2L1 and G3BP2. Thereafter, we applied machine-learning models to predict survival. A set of 102 genes was highly predictive of disease outcome, outperforming available clinical information and COO classification. The final best model integrated clinical information, COO classification, 4-gene-based clusterization and the expression levels of 50 individual genes (training set c-index, 0.8404, test set c-index, 0.7942). CONCLUSION: Our results indicate that DLBCL survival models based on the application of machine learning algorithms to gene expression and clinical data can largely outperform other important prognostic variables such as disease stage and COO. Head-to-head comparisons with other risk stratification models are needed to compare its usefulness

    Incidence, risk factors, and outcomes of second neoplasms in patients with acute promyelocytic leukemia:the PETHEMA-PALG experience

    Get PDF
    The most important challenges in acute promyelocytic leukemia (APL) is preventing early death and reducing long-term events, such as second neoplasms (s-NPLs). We performed a retrospective analysis of 2670 unselected APL patients, treated with PETHEMA “chemotherapy based” and “chemotherapy free” protocols. Only de novo APL patients who achieved complete remission (CR) and completed the three consolidation cycles were enrolled into the analysis. Out of 2670 APL patients, there were 118 (4.4%) who developed s-NPLs with the median latency period (between first CR and diagnosis of s-NPL) of 48.0 months (range 2.8–231.1): 43.3 (range: 2.8–113.9) for s-MDS/AML and 61.7 (range: 7.1–231.1) for solid tumour. The 5-year CI of all s-NPLs was of 4.43% and 10 years of 7.92%. Among s-NPLs, there were 58 cases of s-MDS/AML, 3 cases of other hematological neoplasms, 57 solid tumours and 1 non-identified neoplasm. The most frequent solid tumour was colorectal, lung and breast cancer. Overall, the 2-year OS from diagnosis of s-NPLs was 40.6%, with a median OS of 11.1 months. Multivariate analysis identified age of 35 years (hazard ratio = 0.2584; p &lt; 0.0001) as an independent prognostic factor for s-NPLs. There were no significant differences in CI of s-NPLs at 5 years between chemotherapy-based vs chemotherapy-free regimens (hazard ratio = 1.09; p = 0.932). Larger series with longer follow-up are required to confirm the potential impact of ATO+ATRA regimens to reduce the incidence of s-NPLs after front-line therapy for APL.</p

    Machine Learning Improves Risk Stratification in Myelodysplastic Neoplasms : An Analysis of the Spanish Group of Myelodysplastic Syndromes

    Get PDF
    Myelodysplastic neoplasms (MDS) are a heterogeneous group of hematological stem cell disorders characterized by dysplasia, cytopenias, and increased risk of acute leukemia. As prognosis differs widely between patients, and treatment options vary from observation to allogeneic stem cell transplantation, accurate and precise disease risk prognostication is critical for decision making. With this aim, we retrieved registry data from MDS patients from 90 Spanish institutions. A total of 7202 patients were included, which were divided into a training (80%) and a test (20%) set. A machine learning technique (random survival forests) was used to model overall survival (OS) and leukemia-free survival (LFS). The optimal model was based on 8 variables (age, gender, hemoglobin, leukocyte count, platelet count, neutrophil percentage, bone marrow blast, and cytogenetic risk group). This model achieved high accuracy in predicting OS (c-indexes; 0.759 and 0.776) and LFS (c-indexes; 0.812 and 0.845). Importantly, the model was superior to the revised International Prognostic Scoring System (IPSS-R) and the age-adjusted IPSS-R. This difference persisted in different age ranges and in all evaluated disease subgroups. Finally, we validated our results in an external cohort, confirming the superiority of the Artificial Intelligence Prognostic Scoring System for MDS (AIPSS-MDS) over the IPSS-R, and achieving a similar performance as the molecular IPSS. In conclusion, the AIPSS-MDS score is a new prognostic model based exclusively on traditional clinical, hematological, and cytogenetic variables. AIPSS-MDS has a high prognostic accuracy in predicting survival in MDS patients, outperforming other well-established risk-scoring systems

    Machine Learning Improves Risk Stratification in Myelodysplastic Neoplasms: An Analysis of the Spanish Group of Myelodysplastic Syndromes

    No full text
    Myelodysplastic neoplasms (MDS) are a heterogeneous group of hematological stem cell disorders characterized by dysplasia, cytopenias, and increased risk of acute leukemia. As prognosis differs widely between patients, and treatment options vary from observation to allogeneic stem cell transplantation, accurate and precise disease risk prognostication is critical for decision making. With this aim, we retrieved registry data from MDS patients from 90 Spanish institutions. A total of 7202 patients were included, which were divided into a training (80%) and a test (20%) set. A machine learning technique (random survival forests) was used to model overall survival (OS) and leukemia-free survival (LFS). The optimal model was based on 8 variables (age, gender, hemoglobin, leukocyte count, platelet count, neutrophil percentage, bone marrow blast, and cytogenetic risk group). This model achieved high accuracy in predicting OS (c-indexes; 0.759 and 0.776) and LFS (c-indexes; 0.812 and 0.845). Importantly, the model was superior to the revised International Prognostic Scoring System (IPSS-R) and the age-adjusted IPSS-R. This difference persisted in different age ranges and in all evaluated disease subgroups. Finally, we validated our results in an external cohort, confirming the superiority of the Artificial Intelligence Prognostic Scoring System for MDS (AIPSS-MDS) over the IPSS-R, and achieving a similar performance as the molecular IPSS. In conclusion, the AIPSS-MDS score is a new prognostic model based exclusively on traditional clinical, hematological, and cytogenetic variables. AIPSS-MDS has a high prognostic accuracy in predicting survival in MDS patients, outperforming other well-established risk-scoring systems

    Luminosity determination using Z boson production at the CMS experiment

    No full text
    International audienceThe measurement of Z boson production is presented as a method to determine the integrated luminosity of CMS data sets. The analysis uses proton-proton collision data, recorded by the CMS experiment at the CERN LHC in 2017 at a center-of-mass energy of 13 TeV. Events with Z bosons decaying into a pair of muons are selected. The total number of Z bosons produced in a fiducial volume is determined, together with the identification efficiencies and correlations from the same dataset, in small intervals of 2 pb1^{-1} of integrated luminosity, thus facilitating the efficiency and rate measurement as a function of time and instantaneous luminosity. Using the ratio of the efficiency-corrected numbers of Z bosons, the precisely measured integrated luminosity of one data set is used to determine the luminosity of another. For the first time, a full quantitative uncertainty analysis of the use of Z bosons for the integrated luminosity measurement is performed. The uncertainty in the extrapolation between two data sets, recorded in 2017 at low and high instantaneous luminosity, is less than 0.5%. We show that the Z boson rate measurement constitutes a precise method, complementary to traditional methods, with the potential to improve the measurement of the integrated luminosity

    Search for direct production of GeV-scale resonances decaying to a pair of muons in proton-proton collisions at s\sqrt{s} = 13 TeV

    No full text
    International audienceA search for direct production of low-mass dimuon resonances is performed using s\sqrt{s} = 13 TeV proton-proton collision data collected by the CMS experiment during the 2017-2018 operation of the CERN LHC with an integrated luminosity of 96.6 fb1^{-1}. The search exploits a dedicated high-rate trigger stream that records events with two muons with transverse momenta as low as 3 GeV but does not include the full event information. The search is performed by looking for narrow peaks in the dimuon mass spectrum in the ranges of 1.1-2.6 GeV and 4.2-7.9 GeV. No significant excess of events above the expectation from the standard model background is observed. Model-independent limits on production rates of dimuon resonances within the experimental fiducial acceptance are set. Competitive or world's best limits are set at 90% confidence level for a minimal dark photon model and for a scenario with two Higgs doublets and an extra complex scalar singlet (2HDM+S). Values of the squared kinetic mixing coefficient ε2\varepsilon^2 in the dark photon model above 106^{-6} are excluded over most of the mass range of the search. In the 2HDM+S, values of the mixing angle sin(θH)\sin(\theta_\text{H}) above 0.08 are excluded over most of the mass range of the search with a fixed ratio of the Higgs doublets vacuum expectation tanβ\tan\beta = 0.5
    corecore