8 research outputs found

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Full text link
    Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License

    Unified classification and risk-stratification in Acute Myeloid Leukemia.

    Get PDF
    Clinical recommendations for Acute Myeloid Leukemia (AML) classification and risk-stratification remain heavily reliant on cytogenetic findings at diagnosis, which are present in <50% of patients. Using comprehensive molecular profiling data from 3,653 patients we characterize and validate 16 molecular classes describing 100% of AML patients. Each class represents diverse biological AML subgroups, and is associated with distinct clinical presentation, likelihood of response to induction chemotherapy, risk of relapse and death over time. Secondary AML-2, emerges as the second largest class (24%), associates with high-risk disease, poor prognosis irrespective of flow Minimal Residual Disease (MRD) negativity, and derives significant benefit from transplantation. Guided by class membership we derive a 3-tier risk-stratification score that re-stratifies 26% of patients as compared to standard of care. This results in a unified framework for disease classification and risk-stratification in AML that relies on information from cytogenetics and 32 genes. Last, we develop an open-access patient-tailored clinical decision support tool

    Molecular, clinical and therapeutic determinants of outcome in NPM1 mutated AML

    No full text
    Although NPM1-mutated acute myeloid leukemia (AML) carries a generally favorable prognosis, many patients still relapse and die. Previous studies identified several molecular and clinical features associated with poor outcome, however only FLT3-ITD mutation and adverse karyotype are currently used for risk stratification due to inconsistent results and uncertainty around how other factors should influence treatment, particularly given the strong prognostic impact of post-induction measurable residual disease (MRD). Here we analyzed a large group of patients with NPM1mut AML enrolled in prospective trials (NCRI AML17 and AML19, n=1357) to delineate the impact of baseline molecular and clinical features, post induction MRD status and treatment intensity on outcome. FLT3-ITD (HR 1.28, 95%CI 1.01-1.63), DNMT3A (HR 1.65, 95%CI 1.32-2.05), WT1 (HR 1.74, 95%CI 1272-2.38) and non-ABD NPM1 mutations (HR 1.64, 95%CI 1.22-2.21) were independently associated with poorer overall survival (OS). These factors were also strongly associated with MRD positivity. For patients achieving MRD negativity, these mutations (except FLT3-ITD) were associated with an increased cumulative incidence of relapse (CIR) and poorer OS. However, apart from the few patients with adverse cytogenetics, we could not identify any group of MRD negative patients with a CIR >40% or with benefit from allograft in first remission. Intensified chemotherapy with the FLAG-Ida regimen was associated with improved outcomes in all subgroups, with greater benefits observed in the highest risk molecular subgroups

    Molecular subclasses of clear cell ovarian carcinoma and their impact on disease behavior and outcomes

    Get PDF
    PURPOSE: To identify molecular subclasses of clear cell ovarian carcinoma (CCOC) and assess their impact on clinical presentation and outcomes. EXPERIMENTAL DESIGN: We profiled 421 primary CCOCs that passed quality control using a targeted deep sequencing panel of 163 putative CCOC driver genes and whole transcriptome sequencing of 211 of these tumors. Molecularly defined subgroups were identified and tested for association with clinical characteristics and overall survival. RESULTS: We detected a putative somatic driver mutation in at least one candidate gene in 95% (401/421) of CCOC tumors including ARID1A (in 49% of tumors), PIK3CA (49%), TERT (20%), and TP53 (16%). Clustering of cancer driver mutations and RNA expression converged upon two distinct subclasses of CCOC. The first was dominated by ARID1A-mutated tumors with enriched expression of canonical CCOC genes and markers of platinum resistance; the second was largely comprised of tumors with TP53 mutations and enriched for the expression of genes involved in extracellular matrix organization and mesenchymal differentiation. Compared with the ARID1A-mutated group, women with TP53-mutated tumors were more likely to have advanced-stage disease, no antecedent history of endometriosis, and poorer survival, driven by their advanced stage at presentation. In women with ARID1A-mutated tumors, there was a trend toward a lower rate of response to first-line platinum-based therapy. CONCLUSIONS: Our study suggests that CCOC consists of two distinct molecular subclasses with distinct clinical presentation and outcomes, with potential relevance to both traditional and experimental therapy responsiveness. See related commentary by Lheureux, p. 483

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    No full text
    Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    No full text
    Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License
    corecore