2,748 research outputs found

    Human Apprenticeship Learning via Kernel-based Inverse Reinforcement Learning

    Full text link
    It has been well demonstrated that inverse reinforcement learning (IRL) is an effective technique for teaching machines to perform tasks at human skill levels given human demonstrations (i.e., human to machine apprenticeship learning). This paper seeks to show that a similar application can be demonstrated with human learners. That is, given demonstrations from human experts inverse reinforcement learning techniques can be used to teach other humans to perform at higher skill levels (i.e., human to human apprenticeship learning). To show this two experiments were conducted using a simple, real-time web game where players were asked to touch targets in order to earn as many points as possible. For the experiment player performance was defined as the number of targets a player touched, irrespective of the points that a player actually earned. This allowed for in-game points to be modified and the effect of these alterations on performance measured. At no time were participants told the true performance metric. To determine the point modifications IRL was applied on demonstrations of human experts playing the game. The results of the experiment show with significance that performance improved over the control for select treatment groups. Finally, in addition to the experiment, we also detail the algorithmic challenges we faced when conducting the experiment and the techniques we used to overcome them.Comment: 31 pages, 23 figures, Submitted to Journal of Artificial Intelligence Research, "for source code, see https://github.com/mrucker/kpirl-kla

    Suitability of litter amendments for the Australian chicken meat industry

    Get PDF
    This project focused on litter amendment products, which are used overseas during the rearing of meat chickens. Litter amendments are primarily used to manage ammonia volatilisation, especially when litter is reused, but also provide antimicrobial and environmental benefits, and increase the nutrient value of spent litter. This report summarises the outcomes of consultation with representatives and stakeholders of the Australian chicken meat industry, and summarises key findings from a literature review on litter amendments

    Collaborative multidisciplinary learning : quantity surveying students’ perspectives

    Get PDF
    The construction industry is highly fragmented and is known for its adversarial culture, culminating in poor quality projects not completed on time or within budget. The aim of this study is thus to guide the design of QS programme curricula in order to help students develop the requisite knowledge and skills to work more collaboratively in their multi-disciplinary future workplaces. A qualitative approach was considered appropriate as the authors were concerned with gathering an initial understanding of what students think of multi-disciplinary learning. The data collection method used was a questionnaire which was developed by the Behaviours4Collaboration (B4C) team. Knowledge gaps were still found across all the key areas where a future QS practitioner needs to be collaborative (either as a project contributor or as a project leader) despite the need for change instigated by the multi-disciplinary (BIM) education revolution. The study concludes that universities will need to be selective in teaching, and innovative in reorienting, QS education so that a collaborative BIM education can be effected in stages, increasing in complexity as the students’ technical knowledge grows. This will help students to build the competencies needed to make them future leaders. It will also support programme currency and delivery

    Long-Term Testing of Rhodium-Based Catalysts for Mixed Alcohol Synthesis ? 2013 Progress Report

    Get PDF
    The U.S. Department of Energy’s Pacific Northwest National Laboratory has been conducting research since 2005 to develop a catalyst for the conversion of synthesis gas (carbon monoxide and hydrogen) into mixed alcohols for use in liquid transportation fuels. Initially, research involved screening possible catalysts based on a review of the literature, because at that time, there were no commercial catalysts available. The screening effort resulted in a decision to focus on catalysts containing rhodium and manganese. Subsequent research identified iridium as a key promoter for this catalyst system. Since then, research has continued to improve rhodium/manganese/iridium-based catalysts, optimizing the relative and total concentrations of the three metals, examining baseline catalysts on alternative supports, and examining effects of additional promoters. Testing was continued in FY 2013 to evaluate the performance and long-term stability of the best catalysts tested to date. Three tests were conducted. A long-term test of over 2300 hr duration at a single set of operating conditions was conducted with the best carbon-supported catalyst. A second test of about 650 hr duration at a single set of operating conditions was performed for comparison using the same catalyst formulation on an alternative carbon support. A third test of about 680 hr duration at a single set of operating conditions was performed using the best silica-supported catalyst tested to date

    Sequential quasi-Monte Carlo: Introduction for Non-Experts, Dimension Reduction, Application to Partly Observed Diffusion Processes

    Full text link
    SMC (Sequential Monte Carlo) is a class of Monte Carlo algorithms for filtering and related sequential problems. Gerber and Chopin (2015) introduced SQMC (Sequential quasi-Monte Carlo), a QMC version of SMC. This paper has two objectives: (a) to introduce Sequential Monte Carlo to the QMC community, whose members are usually less familiar with state-space models and particle filtering; (b) to extend SQMC to the filtering of continuous-time state-space models, where the latent process is a diffusion. A recurring point in the paper will be the notion of dimension reduction, that is how to implement SQMC in such a way that it provides good performance despite the high dimension of the problem.Comment: To be published in the proceedings of MCMQMC 201
    • …
    corecore