27 research outputs found

    A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

    Full text link
    Pretraining is the preliminary and fundamental step in developing capable language models (LM). Despite this, pretraining data design is critically under-documented and often guided by empirically unsupported intuitions. To address this, we pretrain 28 1.5B parameter decoder-only models, training on data curated (1) at different times, (2) with varying toxicity and quality filters, and (3) with different domain compositions. First, we quantify the effect of pretraining data age. A temporal shift between evaluation data and pretraining data leads to performance degradation, which is not overcome by finetuning. Second, we explore the effect of quality and toxicity filters, showing a trade-off between performance on standard benchmarks and risk of toxic generations. Our findings indicate there does not exist a one-size-fits-all solution to filtering training data. We also find that the effects of different types of filtering are not predictable from text domain characteristics. Lastly, we empirically validate that the inclusion of heterogeneous data sources, like books and web, is broadly beneficial and warrants greater prioritization. These findings constitute the largest set of experiments to validate, quantify, and expose many undocumented intuitions about text pretraining, which we hope will help support more informed data-centric decisions in LM development

    Making 'chemical cocktails' - evolution of urban geochemical processes across the periodic table of elements.

    Get PDF
    Urbanization contributes to the formation of novel elemental combinations and signatures in terrestrial and aquatic watersheds, also known as 'chemical cocktails.' The composition of chemical cocktails evolves across space and time due to: (1) elevated concentrations from anthropogenic sources, (2) accelerated weathering and corrosion of the built environment, (3) increased drainage density and intensification of urban water conveyance systems, and (4) enhanced rates of geochemical transformations due to changes in temperature, ionic strength, pH, and redox potentials. Characterizing chemical cocktails and underlying geochemical processes is necessary for: (1) tracking pollution sources using complex chemical mixtures instead of individual elements or compounds; (2) developing new strategies for co-managing groups of contaminants; (3) identifying proxies for predicting transport of chemical mixtures using continuous sensor data; and (4) determining whether interactive effects of chemical cocktails produce ecosystem-scale impacts greater than the sum of individual chemical stressors. First, we discuss some unique urban geochemical processes which form chemical cocktails, such as urban soil formation, human-accelerated weathering, urban acidification-alkalinization, and freshwater salinization syndrome. Second, we review and synthesize global patterns in concentrations of major ions, carbon and nutrients, and trace elements in urban streams across different world regions and make comparisons with reference conditions. In addition to our global analysis, we highlight examples from some watersheds in the Baltimore-Washington DC region, which show increased transport of major ions, trace metals, and nutrients across streams draining a well-defined land-use gradient. Urbanization increased the concentrations of multiple major and trace elements in streams draining human-dominated watersheds compared to reference conditions. Chemical cocktails of major and trace elements were formed over diurnal cycles coinciding with changes in streamflow, dissolved oxygen, pH, and other variables measured by high-frequency sensors. Some chemical cocktails of major and trace elements were also significantly related to specific conductance (p<0.05), which can be measured by sensors. Concentrations of major and trace elements increased, peaked, or decreased longitudinally along streams as watershed urbanization increased, which is consistent with distinct shifts in chemical mixtures upstream and downstream of other major cities in the world. Our global analysis of urban streams shows that concentrations of multiple elements along the Periodic Table significantly increase when compared with reference conditions. Furthermore, similar biogeochemical patterns and processes can be grouped among distinct mixtures of elements of major ions, dissolved organic matter, nutrients, and trace elements as chemical cocktails. Chemical cocktails form in urban waters over diurnal cycles, decades, and throughout drainage basins. We conclude our global review and synthesis by proposing strategies for monitoring and managing chemical cocktails using source control, ecosystem restoration, and green infrastructure. We discuss future research directions applying the watershed chemical cocktail approach to diagnose and manage environmental problems. Ultimately, a chemical cocktail approach targeting sources, transport, and transformations of different and distinct elemental combinations is necessary to more holistically monitor and manage the emerging impacts of chemical mixtures in the world's fresh waters

    PaLM: Scaling Language Modeling with Pathways

    Full text link
    Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Transformer language model, which we call Pathways Language Model PaLM. We trained PaLM on 6144 TPU v4 chips using Pathways, a new ML system which enables highly efficient training across multiple TPU Pods. We demonstrate continued benefits of scaling by achieving state-of-the-art few-shot learning results on hundreds of language understanding and generation benchmarks. On a number of these tasks, PaLM 540B achieves breakthrough performance, outperforming the finetuned state-of-the-art on a suite of multi-step reasoning tasks, and outperforming average human performance on the recently released BIG-bench benchmark. A significant number of BIG-bench tasks showed discontinuous improvements from model scale, meaning that performance steeply increased as we scaled to our largest model. PaLM also has strong capabilities in multilingual tasks and source code generation, which we demonstrate on a wide array of benchmarks. We additionally provide a comprehensive analysis on bias and toxicity, and study the extent of training data memorization with respect to model scale. Finally, we discuss the ethical considerations related to large language models and discuss potential mitigation strategies

    Synthesis report of the IPCC Sixth Assessment Report (AR6), Longer report. IPCC.

    Get PDF
    This Synthesis Report (SYR) of the IPCC Sixth Assessment Report (AR6) summarises the state of knowledge of climate change, its widespread impacts and risks, and climate change mitigation and adaptation, based on the peer-reviewed scientific, technical and socio-economic literature since the publication of the IPCC’s Fifth Assessment Report (AR5) in 2014. The assessment is undertaken within the context of the evolving international landscape, in particular, developments in the UN Framework Convention on Climate Change (UNFCCC) process, including the outcomes of the Kyoto Protocol and the adoption of the Paris Agreement. It reflects the increasing diversity of those involved in climate action. This report integrates the main findings of the AR6 Working Group reports1 and the three AR6 Special Reports. It recognizes the interdependence of climate, ecosystems and biodiversity, and human societies; the value of diverse forms of knowledge; and the close linkages between climate change adaptation, mitigation, ecosystem health, human well-being and sustainable development. Building on multiple analytical frameworks, including those from the physical and social sciences, this report identifies opportunities for transformative action which are effective, feasible, just and equitable using concepts of systems transitions and resilient development pathways. Different regional classification schemes are used for physical, social and economic aspects, reflecting the underlying literature. After this introduction, Section 2, ‘Current Status and Trends’, opens with the assessment of observational evidence for our changing climate, historical and current drivers of human-induced climate change, and its impacts. It assesses the current implementation of adaptation and mitigation response options. Section 3, ‘Long-Term Climate and Development Futures’, provides a long-term assessment of climate change to 2100 and beyond in a broad range of socio-economic futures. It considers long-term characteristics, impacts, risks and costs in adaptation and mitigation pathways in the context of sustainable development. Section 4, ‘Near-Term Responses in a Changing Climate’, assesses opportunities for scaling up effective action in the period up to 2040, in the context of climate pledges, and commitments, and the pursuit of sustainable development. Based on scientific understanding, key findings can be formulated as statements of fact or associated with an assessed level of confidence using the IPCC calibrated language5 . The scientific findings are drawn from the underlying reports and arise from their Summary for Policymakers (hereafter SPM), Technical Summary (hereafter TS), and underlying chapters and are indicated by {} brackets. Figure 1.1 shows the Synthesis Report Figures Key, a guide to visual icons that are used across multiple figures within this report

    IPCC, 2023: Climate Change 2023: Synthesis Report, Summary for Policymakers. Contribution of Working Groups I, II and III to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change [Core Writing Team, H. Lee and J. Romero (eds.)]. IPCC, Geneva, Switzerland.

    Get PDF
    This Synthesis Report (SYR) of the IPCC Sixth Assessment Report (AR6) summarises the state of knowledge of climate change, its widespread impacts and risks, and climate change mitigation and adaptation. It integrates the main findings of the Sixth Assessment Report (AR6) based on contributions from the three Working Groups1 , and the three Special Reports. The summary for Policymakers (SPM) is structured in three parts: SPM.A Current Status and Trends, SPM.B Future Climate Change, Risks, and Long-Term Responses, and SPM.C Responses in the Near Term.This report recognizes the interdependence of climate, ecosystems and biodiversity, and human societies; the value of diverse forms of knowledge; and the close linkages between climate change adaptation, mitigation, ecosystem health, human well-being and sustainable development, and reflects the increasing diversity of actors involved in climate action. Based on scientific understanding, key findings can be formulated as statements of fact or associated with an assessed level of confidence using the IPCC calibrated language

    Selective dynamical imaging of interferometric data

    Get PDF
    Recent developments in very long baseline interferometry (VLBI) have made it possible for the Event Horizon Telescope (EHT) to resolve the innermost accretion flows of the largest supermassive black holes on the sky. The sparse nature of the EHT’s (u, v)-coverage presents a challenge when attempting to resolve highly time-variable sources. We demonstrate that the changing (u, v)-coverage of the EHT can contain regions of time over the course of a single observation that facilitate dynamical imaging. These optimal time regions typically have projected baseline distributions that are approximately angularly isotropic and radially homogeneous. We derive a metric of coverage quality based on baseline isotropy and density that is capable of ranking array configurations by their ability to produce accurate dynamical reconstructions. We compare this metric to existing metrics in the literature and investigate their utility by performing dynamical reconstructions on synthetic data from simulated EHT observations of sources with simple orbital variability. We then use these results to make recommendations for imaging the 2017 EHT Sgr A* data sethttp://iopscience.iop.org/2041-8205Physic

    Evaluating the acceptability of remote cognitive remediation from the perspective of psychosis service users

    No full text
    OBJECTIVES: Cognitive remediation (CR) can reduce the cognitive difficulties experienced by people with psychosis. Adapting CR to be delivered remotely provides new opportunities for extending its use. However, doing so requires further evaluation of its acceptability from service users' views. We evaluate the acceptability of therapist-supported remote CR from the perspectives of service users using participatory service user-centred methods.METHOD: After receiving 12 weeks of therapist-supported remote CR, service users were interviewed by a service user researcher following a semi-structured 18-question interview guide. Transcripts were analysed using reflexive thematic analysis with themes and codes further validated by a Lived Experience Advisory Panel and member checking.RESULTS: The study recruited 26 participants, almost all of whom reported high acceptability of remote CR, and some suggested improvements. Four themes emerged: (1) perceived treatment benefits, (2) remote versus in-person therapy, (3) the therapist's role, and (4) how it could be better.CONCLUSIONS: This study used comprehensive service user involvement methods. For some participants, technology use remained a challenge and addressing these difficulties detracted from the therapy experience. These outcomes align with existing research on remote therapy, suggesting that remote CR can expand choice and improve access to treatment for psychosis service users once barriers are addressed. Future use of remote CR should consider technology training and equipment provision to facilitate therapy for service users and therapists.</p
    corecore