625 research outputs found
Scale-free vision-based aerial control of a ground formation with hybrid topology
We present a novel vision-based control method to make a group of ground
mobile robots achieve a specified formation shape with unspecified size. Our
approach uses multiple aerial control units equipped with downward-facing
cameras, each observing a partial subset of the multirobot team. The units
compute the control commands from the ground robots' image projections, using
neither calibration nor scene scale information, and transmit them to the
robots. The control strategy relies on the calculation of image similarity
transformations, and we show it to be asymptotically stable if the overlaps
between the subsets of controlled robots satisfy certain conditions. The
presence of the supervisory units, which coordinate their motions to guarantee
a correct control performance, gives rise to a hybrid system topology. All in
all, the proposed system provides relevant practical advantages in simplicity
and flexibility. Within the problem of controlling a team shape, our
contribution lies in addressing several simultaneous challenges: the controller
needs only partial information of the robotic group, does not use distance
measurements or global reference frames, is designed for unicycle agents, and
can accommodate topology changes. We present illustrative simulation results.Comment: This is the accepted version an already published manuscript. See
journal reference for detail
Fuzzy Ensembles of Reinforcement Learning Policies for Robotic Systems with Varied Parameters
Reinforcement Learning (RL) is an emerging approach to control many dynamical
systems for which classical control approaches are not applicable or
insufficient. However, the resultant policies may not generalize to variations
in the parameters that the system may exhibit. This paper presents a powerful
yet simple algorithm in which collaboration is facilitated between RL agents
that are trained independently to perform the same task but with different
system parameters. The independency among agents allows the exploitation of
multi-core processing to perform parallel training. Two examples are provided
to demonstrate the effectiveness of the proposed technique. The main
demonstration is performed on a quadrotor with slung load tracking problem in a
real-time experimental setup. It is shown that integrating the developed
algorithm outperforms individual policies by reducing the RMSE tracking error.
The robustness of the ensemble is also verified against wind disturbance.Comment: arXiv admin note: text overlap with arXiv:2311.0501
Apprenticeship Bootstrapping for Autonomous Aerial Shepherding of Ground Swarm
Aerial shepherding of ground vehicles (ASGV) musters a group of uncrewed ground vehicles (UGVs) from the air using uncrewed aerial vehicles (UAVs). This inspiration enables robust uncrewed ground-air coordination where one or multiple UAVs effectively drive a group of UGVs towards a goal. Developing artificial intelligence (AI) agents for ASGV is a non-trivial task due to the sub-tasks, multiple skills, and their non-linear interaction required to synthesise a solution. One approach to developing AI agents is Imitation learning (IL), where humans demonstrate the task to the machine. However, gathering human data from complex tasks in human-swarm interaction (HSI) requires the human to perform the entire job, which could lead to unexpected errors caused by a lack of control skills and human workload due to the length and complexity of ASGV.
We hypothesise that we can bootstrap the overall task by collecting human data from simpler sub-tasks to limit errors and workload for humans. Therefore, this thesis attempts to answer the primary research question of how to design IL algorithms for multiple agents. We propose a new learning scheme called Apprenticeship Bootstrapping (AB). In AB, the low-level behaviours of the shepherding agents are trained from human data using our proposed hierarchical IL algorithms. The high-level behaviours are then formed using a proposed gesture demonstration framework to collect human data from synthesising more complex controllers. The transferring mechanism is performed by aggregating the proposed IL algorithms.
Experiments are designed using a mixed environment, where the UAV flies in a simulated robotic Gazebo environment, while the UGVs are physical vehicles in a natural environment. A system is designed to allow switching between humans controlling the UAVs using low-level actions and humans controlling the UAVs using high-level actions. The former enables data collection for developing autonomous agents for sub-tasks. At the same time, in the latter, humans control the UAV by issuing commands that call the autonomous agents for the sub-tasks. We baseline the learnt agents against Str\"{o}mbom scripted behaviours and show that the system can successfully generate autonomous behaviours for ASGV
Communication for Teams of Networked Robots
There are a large class of problems, from search and rescue to environmental monitoring, that can benefit from teams of mobile robots in environments where there is no existing infrastructure for inter-agent communication. We seek to address the problems necessary for a team of small, low-power, low-cost robots to deploy in such a way that they can dynamically provide their own multi-hop communication network. To do so, we formulate a situational awareness problem statement that specifies both the physical task and end-to-end communication rates that must be maintained. In pursuit of a solution to this problem, we address topics ranging from the modeling of point-to-point wireless communication to mobility control for connectivity maintenance. Since our focus is on developing solutions to these problems that can be experimentally verified, we also detail the design and implantation of a decentralized testbed for multi-robot research. Experiments on this testbed allow us to determine data-driven models for point-to-point wireless channel prediction, test relative signal-strength-based localization methods, and to verify that our algorithms for mobility control maintain the desired instantaneous rates when routing through the wireless network. The tools we develop are integral to the fielding of teams of robots with robust wireless network capabilities
- …