6,176 research outputs found

    STEVE-1: A Generative Model for Text-to-Behavior in Minecraft

    Full text link
    Constructing AI models that respond to text instructions is challenging, especially for sequential decision-making tasks. This work introduces an instruction-tuned Video Pretraining (VPT) model for Minecraft called STEVE-1, demonstrating that the unCLIP approach, utilized in DALL-E 2, is also effective for creating instruction-following sequential decision-making agents. STEVE-1 is trained in two steps: adapting the pretrained VPT model to follow commands in MineCLIP's latent space, then training a prior to predict latent codes from text. This allows us to finetune VPT through self-supervised behavioral cloning and hindsight relabeling, bypassing the need for costly human text annotations. By leveraging pretrained models like VPT and MineCLIP and employing best practices from text-conditioned image generation, STEVE-1 costs just $60 to train and can follow a wide range of short-horizon open-ended text and visual instructions in Minecraft. STEVE-1 sets a new bar for open-ended instruction following in Minecraft with low-level controls (mouse and keyboard) and raw pixel inputs, far outperforming previous baselines. We provide experimental evidence highlighting key factors for downstream performance, including pretraining, classifier-free guidance, and data scaling. All resources, including our model weights, training scripts, and evaluation tools are made available for further research

    The regional economic impact of more graduates in the labour market: a “micro-to-macro” analysis for Scotland

    Get PDF
    This paper explores the system-wide impact of graduates on the regional economy. Graduates enjoy a significant wage premium, often interpreted as reflecting their greater productivity relative to non-graduates. If this is so there is a clear and direct supply-side impact of HEI activities on regional economies. We use an HEI-disaggregated computable general equilibrium model of Scotland to estimate the impact of the growing proportion of graduates in the Scottish labour force that is implied by the current participation rate and demographic change, taking the graduate wage premium in Scotland as an indicator of productivity enhancement. While the detailed results vary with alternative assumptions about the extent to which wage premia reflect productivity, they do suggest that the long-term supply-side impacts of HEIs provide a significant boost to regional GDP. Furthermore, the results suggest that the supply-side impacts of HEIs are likely to be more important than the expenditure impacts that are the focus of most HEI impact studies

    Large Language Models Are Human-Level Prompt Engineers

    Full text link
    By conditioning on natural language instructions, large language models (LLMs) have displayed impressive capabilities as general-purpose computers. However, task performance depends significantly on the quality of the prompt used to steer the model, and most effective prompts have been handcrafted by humans. Inspired by classical program synthesis and the human approach to prompt engineering, we propose Automatic Prompt Engineer (APE) for automatic instruction generation and selection. In our method, we treat the instruction as the "program," optimized by searching over a pool of instruction candidates proposed by an LLM in order to maximize a chosen score function. To evaluate the quality of the selected instruction, we evaluate the zero-shot performance of another LLM following the selected instruction. Experiments on 24 NLP tasks show that our automatically generated instructions outperform the prior LLM baseline by a large margin and achieve better or comparable performance to the instructions generated by human annotators on 19/24 tasks. We conduct extensive qualitative and quantitative analyses to explore the performance of APE. We show that APE-engineered prompts can be applied to steer models toward truthfulness and/or informativeness, as well as to improve few-shot learning performance by simply prepending them to standard in-context learning prompts. Please check out our webpage at https://sites.google.com/view/automatic-prompt-engineer

    Which older people decline participation in a primary care trial of physical activity and why: insights from a mixed methods approach

    Get PDF
    This article is available through the Brunel Open Access Publishing Fund. Copyright 2014 Rogers et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.Background: Physical activity is of vital importance to older peoples’ health. Physical activity intervention studies with older people often have low recruitment, yet little is known about non-participants. Methods: Patients aged 60–74 years from three UK general practices were invited to participate in a nurse-supported pedometer-based walking intervention. Demographic characteristics of 298 participants and 690 non-participants were compared. Health status and physical activity of 298 participants and 183 non-participants who completed a survey were compared using age, sex adjusted odds ratios (OR) (95% confidence intervals). 15 non-participants were interviewed to explore perceived barriers to participation. Results: Recruitment was 30% (298/988). Participants were more likely than non-participants to be female (54% v 47%; p = 0.04) and to live in affluent postcodes (73% v 62% in top quintile; p < 0.001). Participants were more likely than non-participants who completed the survey to have an occupational pension OR 2.06 (1.35-3.13), a limiting longstanding illness OR 1.72 (1.05-2.79) and less likely to report being active OR 0.55 (0.33-0.93) or walking fast OR 0.56 (0.37-0.84). Interviewees supported general practice-based physical activity studies, particularly walking, but barriers to participation included: already sufficiently active, reluctance to walk alone or at night, physical symptoms, depression, time constraints, trial equipment and duration. Conclusion: Gender and deprivation differences suggest some selection bias. However, trial participants reported more health problems and lower activity than non-participants who completed the survey, suggesting appropriate trial selection in a general practice population. Non-participant interviewees indicated that shorter interventions, addressing physical symptoms and promoting confidence in pursuing physical activity, might increase trial recruitment and uptake of practice-based physical activity endeavours.The National Institute for Health Research (NIHR) under its Research for Patient Benefit Programme (Grant Reference Number PB-PG-0909-20055)
    corecore