56,992 research outputs found

    SLIS Student Research Journal, Vol. 5, Iss. 1

    Get PDF

    Swings and Their Relation to Resiliency

    Get PDF

    Explore, Exploit or Listen: Combining Human Feedback and Policy Model to Speed up Deep Reinforcement Learning in 3D Worlds

    Full text link
    We describe a method to use discrete human feedback to enhance the performance of deep learning agents in virtual three-dimensional environments by extending deep-reinforcement learning to model the confidence and consistency of human feedback. This enables deep reinforcement learning algorithms to determine the most appropriate time to listen to the human feedback, exploit the current policy model, or explore the agent's environment. Managing the trade-off between these three strategies allows DRL agents to be robust to inconsistent or intermittent human feedback. Through experimentation using a synthetic oracle, we show that our technique improves the training speed and overall performance of deep reinforcement learning in navigating three-dimensional environments using Minecraft. We further show that our technique is robust to highly innacurate human feedback and can also operate when no human feedback is given

    Whewell\'s Wager: The Continuing Dialogue of Metaphysics and Physics in Science

    Full text link
    In his library at Trinity College, Cambridge University, around the year I860, William Whewell (1794-1866) engages in conversation with a company of thinkers on the province of metaphysics and physics, to form a comprehensive scientific belief. In attendance with him are Lord Francis Bacon (1561-1626), Sir Robert Boyle (1627-1691 ), Sir Isaac Newton (1642-1727) , John Henry Cardinal Newman (1801-1890), Professor Alfred North Whitehead (1861-1947), and Pope John Paul II (b. 1920). Whewell proposes a wager: Is there a possible remedy to be found for the schism between the metaphysical and the physical elements of science

    Making the most of the G8+5 Climate Change Process: Accelerating Structural Change and Technology Diffusion on a Global Scale. CEPS Task Force Reports, 5 June 2008

    Get PDF
    Under the chairmanship of Gunnar Still, Senior Vice President and Head of Environment Division at ThyssenKrupp, CEPS organized a Task Force to explore possible initiatives within the context of the G8+5 dialogue on tackling climate change. This report identifies a number of concrete measures that could reduce greenhouse gas (GHG) emissions, while at the same time stimulating structural change and technology development and diffusion. It calls for supporting action-based approaches, which are essential to achieve the necessary reductions in GHG emissions, inform the post-2012 negotiations and address the most urgent issues such as surging energy demand and the need for clean energy technologies in emerging economies. An action-based approach can be regarded as a way of integrating targets and timetables, as they are agreed, with consistent and comparable policies and measures. With a view to a long-term climate strategy, this report attempts to present a portfolio of actions that can be implemented and accelerated on a global scale – especially in the G8+5 countries and the EU, and could become a basis on which developed and developing countries can cooperate
    • …
    corecore