Search CORE

1,512 research outputs found

End-to-end Reinforcement Learning for Online Coverage Path Planning in Unknown Environments

Author: Felsberg Michael
Jonnarth Arvi
Zhao Jie
Publication venue
Publication date: 29/06/2023
Field of study

Coverage path planning is the problem of finding the shortest path that covers the entire free space of a given confined area, with applications ranging from robotic lawn mowing and vacuum cleaning, to demining and search-and-rescue tasks. While offline methods can find provably complete, and in some cases optimal, paths for known environments, their value is limited in online scenarios where the environment is not known beforehand, especially in the presence of non-static obstacles. We propose an end-to-end reinforcement learning-based approach in continuous state and action space, for the online coverage path planning problem that can handle unknown environments. We construct the observation space from both global maps and local sensory inputs, allowing the agent to plan a long-term path, and simultaneously act on short-term obstacle detections. To account for large-scale environments, we propose to use a multi-scale map input representation. Furthermore, we propose a novel total variation reward term for eliminating thin strips of uncovered space in the learned path. To validate the effectiveness of our approach, we perform extensive experiments in simulation with a distance sensor, surpassing the performance of a recent reinforcement learning-based approach

arXiv.org e-Print Archive

Development and Evaluation of Sensor Concepts for Ageless Aerospace Vehicles: Report 6 - Development and Demonstration of a Self-Organizing Diagnostic System for Structural Health Monitoring

Author: Batten Adam
Edwards Graeme
Gerasimov Vadim
Hoschke Nigel
Isaacs Peter
Lewis Chris
Moore Richard
Oppolzer Florien
Price Don
Prokopenko Mikhail
Scott Andrew
Wang Peter
Publication venue
Publication date
Field of study

This report describes a significant advance in the capability of the CSIRO/NASA structural health monitoring Concept Demonstrator (CD). The main thrust of the work has been the development of a mobile robotic agent, and the hardware and software modifications and developments required to enable the demonstrator to operate as a single, self-organizing, multi-agent system. This single-robot system is seen as the forerunner of a system in which larger numbers of small robots perform inspection and repair tasks cooperatively, by self-organization. While the goal of demonstrating self-organized damage diagnosis was not fully achieved in the time available, much of the work required for the final element that enables the robot to point the video camera and transmit an image has been completed. A demonstration video of the CD and robotic systems operating will be made and forwarded to NASA

Deep Reinforcement Learning Methods for Cooperative Robotic Navigation

Author: José Pedro Ferreira Pinheiro de Carvalho
Publication venue
Publication date: 11/10/2023
Field of study

Multiscale computation and dynamic attention in biological and artificial intelligence

Author: Akaishi Rei
Badman Ryan Paul
Hills Thomas Trenholm
Publication venue: 'MDPI AG'
Publication date: 20/06/2020
Field of study

Biological and artificial intelligence (AI) are often defined by their capacity to achieve a hierarchy of short-term and long-term goals that require incorporating information over time and space at both local and global scales. More advanced forms of this capacity involve the adaptive modulation of integration across scales, which resolve computational inefficiency and explore-exploit dilemmas at the same time. Research in neuroscience and AI have both made progress towards understanding architectures that achieve this. Insight into biological computations come from phenomena such as decision inertia, habit formation, information search, risky choices and foraging. Across these domains, the brain is equipped with mechanisms (such as the dorsal anterior cingulate and dorsolateral prefrontal cortex) that can represent and modulate across scales, both with top-down control processes and by local to global consolidation as information progresses from sensory to prefrontal areas. Paralleling these biological architectures, progress in AI is marked by innovations in dynamic multiscale modulation, moving from recurrent and convolutional neural networks—with fixed scalings—to attention, transformers, dynamic convolutions, and consciousness priors—which modulate scale to input and increase scale breadth. The use and development of these multiscale innovations in robotic agents, game AI, and natural language processing (NLP) are pushing the boundaries of AI achievements. By juxtaposing biological and artificial intelligence, the present work underscores the critical importance of multiscale processing to general intelligence, as well as highlighting innovations and differences between the future of biological and artificial intelligence

Warwick Research Archives Portal Repository

Born to learn: The inspiration, progress, and future of evolved plastic artificial neural networks

Author: Abbott
Abraham
Abraham
Alexander
Allis
Alpaydin
Anderson
Andrea Soltoggio
Angeline
Arifovic
Arnold
Arnold
Ay
Bailey
Baldwin
Baxter
Baxter
Bear
Beer
Bengio
Bengio
Bentley
Best
Bienenstock
Birmingham
Blynel
Boers
Bourlard
Brown
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Butz
Butz
Bäck
Cabessa
Carew
Carlson
Carver
Cervier
Chklovskii
Clark
Cliff
Clutton-Brock
Coleman
Cooper
Damasio
Darwin
Dawkins
Deary
Deng
de Vladar
Di Paolo
Di Paolo
Dobzhansky
Doidge
Downing
Downing
Doya
Doya
Draganski
Dudai
Durr
Edelman
Eiben
Ellefsen
Ellefsen
Eskridge
Fellous
Fernando
Fernando
Finnie
Floreano
Floreano
Floreano
Floreano
Floreano
Floreano
Fogel
Fontanari
Friston
Fujii
Funahashi
Gerstner
Gerstner
Goldberg
Goodfellow
Graves
Greve
Grossberg
Gruau
Gu
Gustafsson
Hansen
Happel
Harrington
Harris-Warrick
Harvey
Hasselmo
Hasselmo
Hasselmo
Hawkins
Hebb
Hensch
Hinton
Hinton
Hinton
Hochreiter
Hoinville
Holland
Holland
Hopkins
Hornby
Howard
Howard
Howard
Hull
Husbands
Izhikevich
Izhikevich
Jaeger
Jo
Kaas
Kandel
Kandel
Katz
Katz
Katz
Kenneth O. Stanley
Khan
Khan
Khan
Khan
Kirkpatrick
Kiyota
Klug
Kohonen
Kohonen
Kolb
Kolb
Kondo
Koutník
Koza
Krichmar
Krizhevsky
Kumaran
Kupfermann
Køppe
Lake
Lalejini
Lamprecht
Langton
Lanzi
LeCun
LeDoux
Lee
Lehman
Lehman
Lehman
Levy
Lüders
Lüders
Maass
Maniadakis
Marder
Marder
Markram
Mattiussi
Mayley
McClelland
McQuesten
Meng
Merzenich
Michalewicz
Michalski
Miconi
Miller
Miller
Millington
Monroe
Mouret
Murre
Niv
Nogueira
Nogueira
Nolfi
Nolfi
Nolfi
Nolfi
Norouzzadeh
Offerman
Oja
Oquab
Orchard
Ormrod
Pan
Pascual-Leone
Pavlov
Pehlevan
Pigliucci
Rauschecker
Rawal
Rescorla
Risi
Risi
Risi
Risi
Risi
Roberts
Robins
Roff
Rolls
Rumelhart
Russell
Russo
Sanchez
Schmidhuber
Schmidhuber
Schmidhuber
Schmidhuber
Schreiweis
Schroll
Schultz
Schultz
Schultz
Sebastian Risi
Silva
Silva
Silver
Sims
Sims
Sipper
Skinner
Skinner
Smith
Smith
Smith
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Sporns
Staddon
Stanley
Stanley
Stanley
Stanley
Stanley
Steels
Stone
Suri
Suri
Sutton
Taylor
Tessier-Lavigne
Thorndike
Thrun
Thrun
Thrun
Tonelli
Tonelli
Urzelai
Varela
Venkatesh
Vitay
Wagner
Walters
Widrow
Willshaw
Yamauchi
Yao
Yao
Yoder
Young
Ziemke
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

Biological plastic neural networks are systems of extraordinary computational capabilities shaped by evolution, development, and lifetime learning. The interplay of these elements leads to the emergence of adaptive behavior and intelligence. Inspired by such intricate natural phenomena, Evolved Plastic Artificial Neural Networks (EPANNs) use simulated evolution in-silico to breed plastic neural networks with a large variety of dynamics, architectures, and plasticity rules: these artificial systems are composed of inputs, outputs, and plastic components that change in response to experiences in an environment. These systems may autonomously discover novel adaptive algorithms, and lead to hypotheses on the emergence of biological adaptation. EPANNs have seen considerable progress over the last two decades. Current scientific and technological advances in artificial neural networks are now setting the conditions for radically new approaches and results. In particular, the limitations of hand-designed networks could be overcome by more flexible and innovative solutions. This paper brings together a variety of inspiring ideas that define the field of EPANNs. The main methods and results are reviewed. Finally, new opportunities and developments are presented

arXiv.org e-Print Archive

Environment Search Planning Subject to High Robot Localization Uncertainty

Author: Beard Jared Joseph
Publication venue: The Research Repository @ WVU
Publication date: 01/01/2020
Field of study

As robots find applications in more complex roles, ranging from search and rescue to healthcare and services, they must be robust to greater levels of localization uncertainty and uncertainty about their environments. Without consideration for such uncertainties, robots will not be able to compensate accordingly, potentially leading to mission failure or injury to bystanders. This work addresses the task of searching a 2D area while reducing localization uncertainty. Wherein, the environment provides low uncertainty pose updates from beacons with a short range, covering only part of the environment. Otherwise the robot localizes using dead reckoning, relying on wheel encoder and yaw rate information from a gyroscope. As such, outside of the regions with position updates, there will be unconstrained localization error growth over time. The work contributes a Belief Markov Decision Process formulation for solving the search problem and evaluates the performance using Partially Observable Monte Carlo Planning (POMCP). Additionally, the work contributes an approximate Markov Decision Process formulation and reduced complexity state representation. The approximate problem is evaluated using value iteration. To provide a baseline, the Google OR-Tools package is used to solve the travelling salesman problem (TSP). Results are verified by simulating a differential drive robot in the Gazebo simulation environment. POMCP results indicate planning can be tuned to prioritize constraining uncertainty at the cost of increasing path length. The MDP formulation provides consistently lower uncertainty with minimal increases in path length over the TSP solution. Both formulations show improved coverage outcomes

The Research Repository @ WVU (West Virginia University)

Aerial Remote Sensing in Agriculture: A Practical Approach to Area Coverage and Path Planning for Fleets of Mini Aerial Robots

Author: Barrientos Cruz Antonio
Cerro Giner Jaime del
Colorado Montaño Julián
Coral Cuellar William
Martínez Álvarez Alexander
Pereira Valente Joao Ricardo
Publication venue: E.T.S.I. Industriales (UPM)
Publication date: 01/01/2011
Field of study

In this paper, a system that allows applying precision agriculture techniques is described. The application is based on the deployment of a team of unmanned aerial vehicles that are able to take georeferenced pictures in order to create a full map by applying mosaicking procedures for postprocessing. The main contribution of this work is practical experimentation with an integrated tool. Contributions in different fields are also reported. Among them is a new one-phase automatic task partitioning manager, which is based on negotiation among the aerial vehicles, considering their state and capabilities. Once the individual tasks are assigned, an optimal path planning algorithm is in charge of determining the best path for each vehicle to follow. Also, a robust flight control based on the use of a control law that improves the maneuverability of the quadrotors has been designed. A set of field tests was performed in order to analyze all the capabilities of the system, from task negotiations to final performance. These experiments also allowed testing control robustness under different weather conditions

Force and impedance control for hydraulically driven hexapod robot walking on uneven terrain

Author: Ilea Marioara
Mărghitaș Liviu Alexandru
Popovici Anca Aurora
Publication venue
Publication date: 01/04/2012
Field of study

A variety approach of multi-legged robot designs, especially on a large scale design with hydraulically driven actuators exist, but most of it still unsolved and used primitive techniques on control solutions. This made this area of research still far from demonstrating the scientific solutions, which is more towards developing and optimizing the algorithm, control technique and software engineering for practical locomotion (flexibility and reliability). Therefore in this thesis,the study is done to propose two categories of solution for statically stable and hydraulically driven hexapod robot, named COMET-IV, which are dynamic walking trajectory generation and force/impedance control implementation (during body start patching), in order to solve the stability problems (horizontal) that encountered when walking on extremely uneven terrains.Only three sensors are used for control feedback; potentiometers (each leg joint), pressure sensors (hydraulic cylinders) and attitude sensor (center of body). For dynamic walking trajectory generation, the fixed/determined of tripod walking trajectory is modified with force threshold-based, named as environment trailed trajectory (ETT),on each first step of foot during support phase (preliminary sensing uneven terrain surfaces). Moreover,the proposed dynamic trajectory generation is then upgraded with capability of omni-directional walking with a proposed center of body rotational-based method. The instability of using the ETT module alone and with proposed hybrid force/position control in the previous progress, during body patching on walking session is then solved using the proposed pull-back position-based force control (PPF). PPF controller is derived from the ETT module itself and supported by proposed compliant (switching) mechanism, logical attitude control and dynamic swing rising control. The limitation of PPF controller applied with ETT module for walking on uneven terrain contains extreme soft surface makes the study narrowed to the impedance control approaches as a replacement of PPF controller. Three new adaptive impedance controller are designed and proposed: Optimal single leg impedance control based on body inertia, Optimal center of mass—based impedance control based on body inertia and Single leg impedance control with self-tuning stiffness. To reduce the hard swinging/shaking of the robot's body in motion that arise after applying the proposed impedance controllers, fuzzy logic control via Takagaki-Sugeno-Kang (TSK) model is proposed to be cascaded on the input feedback of the controller.The study has verified the effectiveness of both categories of control unit (dynamic trajectory,force controller and impedance controllers) combination throughout several experiments of COMET-IV walking on uneven/unstructured terrains

Directory of Open Access Journals

UMP Institutional Repository