Search CORE

15,260 research outputs found

Data-efficient learning of feedback policies from image pixels using deep dynamical models

Author: Assael J-AM
Deisenroth MP
Schön TB
Wahlström N
Publication venue
Publication date: 08/10/2015
Field of study

Data-efficient reinforcement learning (RL) in continuous state-action spaces using very high-dimensional observations remains a key challenge in developing fully autonomous systems. We consider a particularly important instance of this challenge, the pixels-to-torques problem, where an RL agent learns a closed-loop control policy ( torques ) from pixel information only. We introduce a data-efficient, model-based reinforcement learning algorithm that learns such a closed-loop policy directly from pixel information. The key ingredient is a deep dynamical model for learning a low-dimensional feature embedding of images jointly with a predictive model in this low-dimensional feature space. Joint learning is crucial for long-term predictions, which lie at the core of the adaptive nonlinear model predictive control strategy that we use for closed-loop control. Compared to state-of-the-art RL methods for continuous states and actions, our approach learns quickly, scales to high-dimensional state spaces, is lightweight and an important step toward fully autonomous end-to-end learning from pixels to torques

arXiv.org e-Print Archive

Spiral - Imperial College Digital Repository

Robotic ubiquitous cognitive ecology for smart homes

Author: A Cesta
A Gaddam
A Gerevini
A Lotfi
A Moustapha
A. K. Ray
A. Micheli
A. Renteria
A. Saffiotti
AK Ray
AK Ray
AK Ray
C Gallicchio
C Liming
C Watkins
C. Gallicchio
C. Gennaro
C. Vairo
D Bacciu
D Bacciu
D Cook
D De
D Peebles
D Roggen
D Vernon
D Verstraeten
D. Bacciu
D. Swords
DJ Cook
G Edelman
G Leng
G. Amato
H Hongmei
H Jaeger
H. Lozano
JR Anderson
M Alam
M Kurz
M Lukosevicius
M Sokolova
M. Broxvall
M. Di Rocco
M. Dragone
MB Do
P Doherty
P Langley
P Langley
P Rashidi
P. Vance
R Kulkarni
R Sun
R Sun
S Fratini
S Knight
S Zhang
S. Chessa
S. Coleman
T. M. McGinnity
W Duch
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Robotic ecologies are networks of heterogeneous robotic devices pervasively embedded in everyday environments, where they cooperate to perform complex tasks. While their potential makes them increasingly popular, one fundamental problem is how to make them both autonomous and adaptive, so as to reduce the amount of preparation, pre-programming and human supervision that they require in real world applications. The project RUBICON develops learning solutions which yield cheaper, adaptive and efficient coordination of robotic ecologies. The approach we pursue builds upon a unique combination of methods from cognitive robotics, machine learning, planning and agent- based control, and wireless sensor networks. This paper illustrates the innovations advanced by RUBICON in each of these fronts before describing how the resulting techniques have been integrated and applied to a smart home scenario. The resulting system is able to provide useful services and pro-actively assist the users in their activities. RUBICON learns through an incremental and progressive approach driven by the feed- back received from its own activities and from the user, while also self-organizing the manner in which it uses available sensors, actuators and other functional components in the process. This paper summarises some of the lessons learned by adopting such an approach and outlines promising directions for future work

Crossref

Heriot Watt Pure

Nottingham Trent Institutional Repository (IRep)

Archivio della Ricerca - Università di Pisa

TECNALIA Publications