56,992 research outputs found
Explore, Exploit or Listen: Combining Human Feedback and Policy Model to Speed up Deep Reinforcement Learning in 3D Worlds
We describe a method to use discrete human feedback to enhance the
performance of deep learning agents in virtual three-dimensional environments
by extending deep-reinforcement learning to model the confidence and
consistency of human feedback. This enables deep reinforcement learning
algorithms to determine the most appropriate time to listen to the human
feedback, exploit the current policy model, or explore the agent's environment.
Managing the trade-off between these three strategies allows DRL agents to be
robust to inconsistent or intermittent human feedback. Through experimentation
using a synthetic oracle, we show that our technique improves the training
speed and overall performance of deep reinforcement learning in navigating
three-dimensional environments using Minecraft. We further show that our
technique is robust to highly innacurate human feedback and can also operate
when no human feedback is given
Whewell\'s Wager: The Continuing Dialogue of Metaphysics and Physics in Science
In his library at Trinity College, Cambridge University, around the year I860, William
Whewell (1794-1866) engages in conversation with a company of thinkers on the province
of metaphysics and physics, to form a comprehensive scientific belief. In attendance with him are Lord Francis Bacon (1561-1626), Sir Robert Boyle (1627-1691 ), Sir Isaac Newton (1642-1727) , John Henry Cardinal Newman (1801-1890), Professor Alfred North Whitehead (1861-1947), and Pope John Paul II (b. 1920). Whewell proposes a wager: Is there a possible remedy to be found for the schism between the metaphysical and the physical
elements of science
Making the most of the G8+5 Climate Change Process: Accelerating Structural Change and Technology Diffusion on a Global Scale. CEPS Task Force Reports, 5 June 2008
Under the chairmanship of Gunnar Still, Senior Vice President and Head of Environment Division at ThyssenKrupp, CEPS organized a Task Force to explore possible initiatives within the context of the G8+5 dialogue on tackling climate change. This report identifies a number of concrete measures that could reduce greenhouse gas (GHG) emissions, while at the same time stimulating structural change and technology development and diffusion. It calls for supporting action-based approaches, which are essential to achieve the necessary reductions in GHG emissions, inform the post-2012 negotiations and address the most urgent issues such as surging energy demand and the need for clean energy technologies in emerging economies. An action-based approach can be regarded as a way of integrating targets and timetables, as they are agreed, with consistent and comparable policies and measures. With a view to a long-term climate strategy, this report attempts to present a portfolio of actions that can be implemented and accelerated on a global scale – especially in the G8+5 countries and the EU, and could become a basis on which developed and developing countries can cooperate
- …