73 research outputs found

    Self-Supervised and Controlled Multi-Document Opinion Summarization

    Full text link
    We address the problem of unsupervised abstractive summarization of collections of user generated reviews with self-supervision and control. We propose a self-supervised setup that considers an individual document as a target summary for a set of similar documents. This setting makes training simpler than previous approaches by relying only on standard log-likelihood loss. We address the problem of hallucinations through the use of control codes, to steer the generation towards more coherent and relevant summaries.Finally, we extend the Transformer architecture to allow for multiple reviews as input. Our benchmarks on two datasets against graph-based and recent neural abstractive unsupervised models show that our proposed method generates summaries with a superior quality and relevance.This is confirmed in our human evaluation which focuses explicitly on the faithfulness of generated summaries We also provide an ablation study, which shows the importance of the control setup in controlling hallucinations and achieve high sentiment and topic alignment of the summaries with the input reviews.Comment: 18 pages including 5 pages appendi

    Aligning Language Models with Preferences through f-divergence Minimization

    Full text link
    Aligning language models with preferences can be posed as approximating a target distribution representing some desired behavior. Existing approaches differ both in the functional form of the target distribution and the algorithm used to approximate it. For instance, Reinforcement Learning from Human Feedback (RLHF) corresponds to minimizing a reverse KL from an implicit target distribution arising from a KL penalty in the objective. On the other hand, Generative Distributional Control (GDC) has an explicit target distribution and minimizes a forward KL from it using the Distributional Policy Gradient (DPG) algorithm. In this paper, we propose a new approach, f-DPG, which allows the use of any f-divergence to approximate any target distribution. f-DPG unifies both frameworks (RLHF, GDC) and the approximation methods (DPG, RL with KL penalties). We show the practical benefits of various choices of divergence objectives and demonstrate that there is no universally optimal objective but that different divergences are good for approximating different targets. For instance, we discover that for GDC, the Jensen-Shannon divergence frequently outperforms forward KL divergence by a wide margin, leading to significant improvements over prior work

    Positional information resolves structural variations and uncovers an evolutionarily divergent genetic locus in accessions of Arabidopsis thaliana.

    Get PDF
    Genome sequencing of closely related individuals has yielded valuable insights that link genome evolution to phenotypic variations. However, advancement in sequencing technology has also led to an escalation in the number of poor quality–drafted genomes assembled based on reference genomes that can have highly divergent or haplotypic regions. The self-fertilizing nature of Arabidopsis thaliana poses an advantage to sequencing projects because its genome is mostly homozygous. To determine the accuracy of an Arabidopsis drafted genome in less conserved regions, we performed a resequencing experiment on a 3 ~71-kb genomic interval in the Landsberg erecta (Ler-0) accession. We identified novel structural variations (SVs) between Ler-0 and the reference accession Col-0 using a long-range polymerase chain reaction approach to generate an Illumina data set that has positional information, that is, a data set with reads that map to a known location. Positional information is important for accurate genome assembly and the resolution of SVs particularly in highly duplicated or repetitive regions. Sixty-one regions with misassembly signatures were identified from the Ler-0 draft, suggesting the presence of novel SVs that are not represented in the draft sequence. Sixty of those were resolved by iterative mapping using our data set. Fifteen large indels (>100 bp) identified from this study were found to be located either within protein-coding regions or upstream regulatory regions, suggesting the formation of novel alleles or altered regulation of existing genes in Ler-0. We propose future genome-sequencing experiments to follow a clone-based approach that incorporates positional information to ultimately reveal haplotype-specific differences between accessions

    Safety of intravenous ferric carboxymaltose versus oral iron in patients with nondialysis-dependent CKD: an analysis of the 1-year FIND-CKD trial.

    Get PDF
    Background: The evidence base regarding the safety of intravenous (IV) iron therapy in patients with chronic kidney disease (CKD) is incomplete and largely based on small studies of relatively short duration. Methods: FIND-CKD (ClinicalTrials.gov number NCT00994318) was a 1-year, open-label, multicenter, prospective study of patients with nondialysis-dependent CKD, anemia and iron deficiency randomized (1:1:2) to IV ferric carboxymaltose (FCM), targeting higher (400-600 µg/L) or lower (100-200 µg/L) ferritin, or oral iron. A post hoc analysis of adverse event rates per 100 patient-years was performed to assess the safety of FCM versus oral iron over an extended period. Results: The safety population included 616 patients. The incidence of one or more adverse events was 91.0, 100.0 and 105.0 per 100 patient-years in the high ferritin FCM, low ferritin FCM and oral iron groups, respectively. The incidence of adverse events with a suspected relation to study drug was 15.9, 17.8 and 36.7 per 100 patient-years in the three groups; for serious adverse events, the incidence was 28.2, 27.9 and 24.3 per 100 patient-years. The incidence of cardiac disorders and infections was similar between groups. At least one ferritin level ≥800 µg/L occurred in 26.6% of high ferritin FCM patients, with no associated increase in adverse events. No patient with ferritin ≥800 µg/L discontinued the study drug due to adverse events. Estimated glomerular filtration rate remained the stable in all groups. Conclusions: These results further support the conclusion that correction of iron deficiency anemia with IV FCM is safe in patients with nondialysis-dependent CKD

    Multitask Prompted Training Enables Zero-Shot Task Generalization

    Get PDF
    International audienceLarge language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models’ pretraining (Radford et al., 2019). Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale, we develop a system for easily mapping any natural language tasks into a human-readable prompted form. We convert a large set of supervised datasets, each with multiple prompts with diverse wording. These prompted datasets allow for benchmarking the ability of a model to perform completely held-out tasks. We fine-tune a pre-trained encoder-decoder model (Raffel et al., 2020; Lester et al., 2021) on this multitask mixture covering a wide variety of tasks. The model attains strong zero-shot performance on several standard datasets, often outperforming models up to 16x its size. Further, our approach attains strong performance on a subset of tasks from the BIG-bench benchmark, outperforming models up to 6x its size. All trained models are available at https://github.com/bigscience-workshop/t-zero, and all prompts are available at https://github.com/bigscience-workshop/promptsource

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Full text link
    Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License

    Search for dark matter produced in association with bottom or top quarks in √s = 13 TeV pp collisions with the ATLAS detector

    Get PDF
    A search for weakly interacting massive particle dark matter produced in association with bottom or top quarks is presented. Final states containing third-generation quarks and miss- ing transverse momentum are considered. The analysis uses 36.1 fb−1 of proton–proton collision data recorded by the ATLAS experiment at √s = 13 TeV in 2015 and 2016. No significant excess of events above the estimated backgrounds is observed. The results are in- terpreted in the framework of simplified models of spin-0 dark-matter mediators. For colour- neutral spin-0 mediators produced in association with top quarks and decaying into a pair of dark-matter particles, mediator masses below 50 GeV are excluded assuming a dark-matter candidate mass of 1 GeV and unitary couplings. For scalar and pseudoscalar mediators produced in association with bottom quarks, the search sets limits on the production cross- section of 300 times the predicted rate for mediators with masses between 10 and 50 GeV and assuming a dark-matter mass of 1 GeV and unitary coupling. Constraints on colour- charged scalar simplified models are also presented. Assuming a dark-matter particle mass of 35 GeV, mediator particles with mass below 1.1 TeV are excluded for couplings yielding a dark-matter relic density consistent with measurements

    Determination of the strong coupling constant αs from transverse energy–energy correlations in multijet events at s√=8 TeV using the ATLAS detector

    Get PDF
    Measurements of transverse energy–energy correlations and their associated asymmetries in multi-jet events using the ATLAS detector at the LHC are presented. The data used correspond to s√=8 TeV proton–proton collisions with an integrated luminosity of 20.2 fb−1 . The results are presented in bins of the scalar sum of the transverse momenta of the two leading jets, unfolded to the particle level and compared to the predictions from Monte Carlo simulations. A comparison with next-to-leading-order perturbative QCD is also performed, showing excellent agreement within the uncertainties. From this comparison, the value of the strong coupling constant is extracted for different energy regimes, thus testing the running of αs(μ) predicted in QCD up to scales over 1 TeV . A global fit to the transverse energy–energy correlation distributions yields αs(mZ)=0.1162±0.0011(exp.) +0.0084−0.0070(theo.) , while a global fit to the asymmetry distributions yields a value of αs(mZ)=0.1196±0.0013(exp.) +0.0075−0.0045(theo.)

    Measurement of the W boson polarisation in ttˉt\bar{t} events from pp collisions at s\sqrt{s} = 8 TeV in the lepton + jets channel with ATLAS

    Get PDF

    Measurement of jet fragmentation in Pb+Pb and pppp collisions at sNN=2.76\sqrt{{s_\mathrm{NN}}} = 2.76 TeV with the ATLAS detector at the LHC

    Get PDF
    corecore