34,027 research outputs found
Feudal Graph Reinforcement Learning
We focus on learning composable policies to control a variety of physical
agents with possibly different structures. Among state-of-the-art methods,
prominent approaches exploit graph-based representations and weight-sharing
modular policies based on the message-passing framework. However, as shown by
recent literature, message passing can create bottlenecks in information
propagation and hinder global coordination. This drawback can become even more
problematic in tasks where high-level planning is crucial. In fact, in similar
scenarios, each modular policy - e.g., controlling a joint of a robot - would
request to coordinate not only for basic locomotion but also achieve high-level
goals, such as navigating a maze. A classical solution to avoid similar
pitfalls is to resort to hierarchical decision-making. In this work, we adopt
the Feudal Reinforcement Learning paradigm to develop agents where control
actions are the outcome of a hierarchical (pyramidal) message-passing process.
In the proposed Feudal Graph Reinforcement Learning (FGRL) framework,
high-level decisions at the top level of the hierarchy are propagated through a
layered graph representing a hierarchy of policies. Lower layers mimic the
morphology of the physical system and upper layers can capture more abstract
sub-modules. The purpose of this preliminary work is to formalize the framework
and provide proof-of-concept experiments on benchmark environments (MuJoCo
locomotion tasks). Empirical evaluation shows promising results on both
standard benchmarks and zero-shot transfer learning settings
Reactive with tags classifier system applied to real robot navigation
7th IEEE International Conference on Emerging Technologies and Factory Automation. Barcelona, 18-21 October 1999.A reactive with tags classifier system (RTCS) is a special classifier system. This system combines the execution capabilities of symbolic systems and the learning capabilities of genetic algorithms. A RTCS is able to learn symbolic rules that allow to generate sequence of actions, chaining rules among different time instants, and react to new environmental situations, considering the last environmental situation to take a decision. The capacity of RTCS to learn good rules has been prove in robotics navigation problem. Results show the suitability of this approximation to the navigation problem and the coherence of extracted rules
Applying classifier systems to learn the reactions in mobile robots
The navigation problem involves how to reach a goal avoiding obstacles in dynamic environments. This problem can be faced considering reactions and sequences of actions. Classifier systems (CSs) have proven their ability of continuous learning, however, they have some problems in reactive systems. A modified CS, namely a reactive classifier system (RCS), is proposed to overcome those problems. Two special mechanisms are included in the RCS: the non-existence of internal cycles inside the CS (no internal cycles) and the fusion of environmental message with the messages posted to the message list in the previous instant (generation list through fusion). These mechanisms allow the learning of both reactions and sequences of actions. This learning process involves two main tasks: first, discriminate between rules and, second, the discovery of new rules to obtain a successful operation in dynamic environments. DiVerent experiments have been carried out using a mini-robot Khepera to find a generalized solution. The results show the ability of the system for continuous learning and adaptation to new situations.Publicad
Parallelizing RRT on large-scale distributed-memory architectures
This paper addresses the problem of parallelizing the Rapidly-exploring Random Tree (RRT) algorithm on large-scale distributed-memory architectures, using the Message Passing Interface. We compare three parallel versions of RRT based on classical parallelization schemes. We evaluate them on different motion planning problems and analyze the various factors influencing their performance
Improving situation awareness of a single human operator interacting with multiple unmanned vehicles: first results
In the context of the supervision of one or several unmanned vehicles by a human operator, the design of an adapted user interface is a major challenge. Therefore, in the context of an existing experimental set up composed of a ground station and heterogeneous unmanned ground and air vehicles we aim at redesigning the human-robot interactions to improve the operator's situation awareness. We base our new design on a classical user centered approach
RTCS: a reactive with tags classifier system
In this work, a new Classifier System is proposed (CS). The system, a Reactive with Tags Classifier System (RTCS), is able to take into account environmental situations in intermediate decisions. CSs are special production systems, where conditions and actions are codified in order to learn new rules by means of Genetic Algorithms (GA). The RTCS has been designed to generate sequences of actions like the traditional classifier systems, but RTCS also has the capability of chaining rules among different time instants and reacting to new environmental situations, considering the last environmental situation to take a decision. In addition to the capability to react and generate sequences of actions, the design of a new rule codification allows the evolution of groups of specialized rules. This new codification is based on the inclusion of several bits, named tags, in conditions and actions, which evolve by means of GA. RTCS has been tested in robotic navigation. Results show the suitability of this approximation to the navigation problem and the coherence of tag values in rules classification.Publicad
- âŠ