Search CORE

15,615 research outputs found

Developmental Bayesian Optimization of Black-Box with Visual Similarity-Based Transfer Learning

Author: Chen Liming
Dellandréa Emmanuel
Depierre Amaury
Petit Maxime
Wang Xiaofang
Publication venue
Publication date: 16/09/2018
Field of study

We present a developmental framework based on a long-term memory and reasoning mechanisms (Vision Similarity and Bayesian Optimisation). This architecture allows a robot to optimize autonomously hyper-parameters that need to be tuned from any action and/or vision module, treated as a black-box. The learning can take advantage of past experiences (stored in the episodic and procedural memories) in order to warm-start the exploration using a set of hyper-parameters previously optimized from objects similar to the new unknown one (stored in a semantic memory). As example, the system has been used to optimized 9 continuous hyper-parameters of a professional software (Kamido) both in simulation and with a real robot (industrial robotic arm Fanuc) with a total of 13 different objects. The robot is able to find a good object-specific optimization in 68 (simulation) or 40 (real) trials. In simulation, we demonstrate the benefit of the transfer learning based on visual similarity, as opposed to an amnesic learning (i.e. learning from scratch all the time). Moreover, with the real robot, we show that the method consistently outperforms the manual optimization from an expert with less than 2 hours of training time to achieve more than 88% of success

arXiv.org e-Print Archive

Crossref

HAL

Hal-Diderot

Robot-aided cloth classification using depth information and CNNs

Author: Alenyà Ribas Guillem
Corona Puyane Enric
Gabas Nova Antonio
Torras Carme
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The final publication is available at link.springer.comWe present a system to deal with the problem of classifying garments from a pile of clothes. This system uses a robot arm to extract a garment and show it to a depth camera. Using only depth images of a partial view of the garment as input, a deep convolutional neural network has been trained to classify different types of garments. The robot can rotate the garment along the vertical axis in order to provide different views of the garment to enlarge the prediction confidence and avoid confusions. In addition to obtaining very high classification scores, compared to previous approaches to cloth classification that match the sensed data against a database, our system provides a fast and occlusion-robust solution to the problem.Peer ReviewedPostprint (author's final draft

Crossref

UPCommons. Portal del coneixement obert de la UPC

Overcoming Exploration in Reinforcement Learning with Demonstrations

Author: Abbeel Pieter
Andrychowicz Marcin
McGrew Bob
Nair Ashvin
Zaremba Wojciech
Publication venue
Publication date: 25/02/2018
Field of study

Exploration in environments with sparse rewards has been a persistent problem in reinforcement learning (RL). Many tasks are natural to specify with a sparse reward, and manually shaping a reward function can result in suboptimal performance. However, finding a non-zero reward is exponentially more difficult with increasing task horizon or action dimensionality. This puts many real-world tasks out of practical reach of RL methods. In this work, we use demonstrations to overcome the exploration problem and successfully learn to perform long-horizon, multi-step robotics tasks with continuous control such as stacking blocks with a robot arm. Our method, which builds on top of Deep Deterministic Policy Gradients and Hindsight Experience Replay, provides an order of magnitude of speedup over RL on simulated robotics tasks. It is simple to implement and makes only the additional assumption that we can collect a small set of demonstrations. Furthermore, our method is able to solve tasks not solvable by either RL or behavior cloning alone, and often ends up outperforming the demonstrator policy.Comment: 8 pages, ICRA 201

arXiv.org e-Print Archive

Crossref

Explore, Exploit or Listen: Combining Human Feedback and Policy Model to Speed up Deep Reinforcement Learning in 3D Worlds

Author: Ashley Haines (4342138)
David Gauthier (1893841)
David Taylor (140886)
Donald Landers (4342165)
Hamish Small (4342156)
Jeffrey Shields (4342159)
John Hoenig (4342153)
John Swenarton (4342132)
Mark Matsche (4342147)
Matthew Smith (326340)
Maya Groner (459554)
Philip Sadler (4342126)
Roger Pradel (111017)
Rémi Choquet (111021)
Wolfgang Vogelbein (4342162)
Publication venue
Publication date: 14/06/2017
Field of study

We describe a method to use discrete human feedback to enhance the performance of deep learning agents in virtual three-dimensional environments by extending deep-reinforcement learning to model the confidence and consistency of human feedback. This enables deep reinforcement learning algorithms to determine the most appropriate time to listen to the human feedback, exploit the current policy model, or explore the agent's environment. Managing the trade-off between these three strategies allows DRL agents to be robust to inconsistent or intermittent human feedback. Through experimentation using a synthetic oracle, we show that our technique improves the training speed and overall performance of deep reinforcement learning in navigating three-dimensional environments using Minecraft. We further show that our technique is robust to highly innacurate human feedback and can also operate when no human feedback is given

arXiv.org e-Print Archive

Dryad Digital Repository (Duke University)

FigShare

Retracing trajectories: the embodied experience of cycling, urban sensescapes and the commute between ‘neighbourhood’ and ‘city’ in Utrecht, NL

Author: Aldred
Bas Spierings
Bekkering
Bissell
Corbin
Crang
Cresswell
Csikszentmihalyi
De Certeau
Degen
Degen
Degen
Degen
Edensor
Fincham
Gehl
Heinen
Jan van Duppen
Jensen
Jones
Jones
Katz
Kusenbach
Lefebvre
Middleton
Municipality of Utrecht
Pelzer
Pile
Pinder
Pooley
Rodaway
Ruimtelijk
Seamon
Sheller
Spinney
Spinney
Spinney
Spinney
Stoffers
Thrift
Tight
Van der Burg
Van der Hoeven
Virilio
Wunderlich
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

This paper looks into the experience of “passing through different territories of the city” (Sennett, 2006, p. 3). Despite their importance for making sense of the city as a whole, these experiences are often not acknowledged in urban planning. This paper compares the everyday, embodied experiences of commuter cyclists with the planners’ perspective on Utrecht. ‘On the ground’ data was collected via ride-alongs with 15 inhabitants of the Leidsche Rijn neighbourhood. Our analysis reveals cycling trajectories composed of diverse sensescapes. It paints a much more complex picture of intra-urban divisions and connections than the planners’ perspective of the ‘new’ Leidsche Rijn neighbourhood separated from the ‘old’ city by major infrastructure lines

Crossref

Open Research Online (The Open University)

Utrecht University Repository