59 research outputs found

    F.A.R.O.G. FORUM, Vol. 2 No. 1

    Get PDF
    https://digitalcommons.library.umaine.edu/francoamericain_forum/1004/thumbnail.jp

    Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

    Full text link
    Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used to finetune state-of-the-art large language models (LLMs). Despite this popularity, there has been relatively little public work systematizing its flaws. In this paper, we (1) survey open problems and fundamental limitations of RLHF and related methods; (2) overview techniques to understand, improve, and complement RLHF in practice; and (3) propose auditing and disclosure standards to improve societal oversight of RLHF systems. Our work emphasizes the limitations of RLHF and highlights the importance of a multi-faceted approach to the development of safer AI systems

    The United States COVID-19 Forecast Hub dataset

    Get PDF
    Academic researchers, government agencies, industry groups, and individuals have produced forecasts at an unprecedented scale during the COVID-19 pandemic. To leverage these forecasts, the United States Centers for Disease Control and Prevention (CDC) partnered with an academic research lab at the University of Massachusetts Amherst to create the US COVID-19 Forecast Hub. Launched in April 2020, the Forecast Hub is a dataset with point and probabilistic forecasts of incident cases, incident hospitalizations, incident deaths, and cumulative deaths due to COVID-19 at county, state, and national, levels in the United States. Included forecasts represent a variety of modeling approaches, data sources, and assumptions regarding the spread of COVID-19. The goal of this dataset is to establish a standardized and comparable set of short-term forecasts from modeling teams. These data can be used to develop ensemble models, communicate forecasts to the public, create visualizations, compare models, and inform policies regarding COVID-19 mitigation. These open-source data are available via download from GitHub, through an online API, and through R packages

    Precision Machine Learning

    No full text
    We explore unique considerations involved in fitting machine learning (ML) models to data with very high precision, as is often required for science applications. We empirically compare various function approximation methods and study how they scale with increasing parameters and data. We find that neural networks (NNs) can often outperform classical approximation methods on high-dimensional examples, by (we hypothesize) auto-discovering and exploiting modular structures therein. However, neural networks trained with common optimizers are less powerful for low-dimensional cases, which motivates us to study the unique properties of neural network loss landscapes and the corresponding optimization challenges that arise in the high precision regime. To address the optimization issue in low dimensions, we develop training tricks which enable us to train neural networks to extremely low loss, close to the limits allowed by numerical precision

    Multivalency effects in Pseudomonas aeruginosa biofilm inhibition and dispersal by glycopeptide dendrimers targeting lectin LecA

    Get PDF
    The galactose specific lectin LecA partly mediates the formation of antibiotic resistant biofilms by Pseudomonas aeruginosa, an opportunistic pathogen causing lethal airways infections in immunocompromised and cystic fibrosis patients, suggesting that preventing LecA binding to natural saccharides might provide new opportunities for treatment. Here 8-fold (G3) and 16-fold (G4) galactosylated analogs of GalAG2, a tetravalent G2 glycopeptide dendrimer LecA ligand and P. aeruginosa biofilm inhibitor, were obtained by convergent chloroacetyl thioether (ClAc) ligation between 4-fold or 8-fold chloroacetylated dendrimer cores and digalactosylated dendritic arms. Hemagglutination inhibition, isothermal titration calorimetry and biofilm inhibition assays showed that G3 dendrimers bind LecA slightly better than their parent G2 dendrimers and induce complete biofilm inhibition and dispersal of P. aeruginosa biofilms, while G4 dendrimers show reduced binding and no biofilm inhibition. A binding model accounting for the observed saturation of glycopeptide dendrimer galactosyl groups and LecA binding sites is proposed based on the crystal structure of a G3 dendrimer LecA complex
    corecore