Search CORE

16,078 research outputs found

Intrinsic Motivation and Mental Replay enable Efficient Online Adaptation in Stochastic Recurrent Networks

Author: Peters Jan
Rueckert Elmar
Tanneberg Daniel
Publication venue: 'Elsevier BV'
Publication date: 23/10/2018
Field of study

Autonomous robots need to interact with unknown, unstructured and changing environments, constantly facing novel challenges. Therefore, continuous online adaptation for lifelong-learning and the need of sample-efficient mechanisms to adapt to changes in the environment, the constraints, the tasks, or the robot itself are crucial. In this work, we propose a novel framework for probabilistic online motion planning with online adaptation based on a bio-inspired stochastic recurrent neural network. By using learning signals which mimic the intrinsic motivation signalcognitive dissonance in addition with a mental replay strategy to intensify experiences, the stochastic recurrent network can learn from few physical interactions and adapts to novel environments in seconds. We evaluate our online planning and adaptation framework on an anthropomorphic KUKA LWR arm. The rapid online adaptation is shown by learning unknown workspace constraints sample-efficiently from few physical interactions while following given way points.Comment: accepted in Neural Network

arXiv.org e-Print Archive

TUbiblio

MPG.PuRe

Multi-robot team formation control in the GUARDIANS project

Author: Alboul Lyuba
Nomdedeu Leo
Penders Jacques
Saez-Pons Joan
Publication venue: 'Emerald'
Publication date: 05/06/2010
Field of study

Purpose The GUARDIANS multi-robot team is to be deployed in a large warehouse in smoke. The team is to assist firefighters search the warehouse in the event or danger of a fire. The large dimensions of the environment together with development of smoke which drastically reduces visibility, represent major challenges for search and rescue operations. The GUARDIANS robots guide and accompany the firefighters on site whilst indicating possible obstacles and the locations of danger and maintaining communications links. Design/methodology/approach In order to fulfill the aforementioned tasks the robots need to exhibit certain behaviours. Among the basic behaviours are capabilities to stay together as a group, that is, generate a formation and navigate while keeping this formation. The control model used to generate these behaviours is based on the so-called social potential field framework, which we adapt to the specific tasks required for the GUARDIANS scenario. All tasks can be achieved without central control, and some of the behaviours can be performed without explicit communication between the robots. Findings The GUARDIANS environment requires flexible formations of the robot team: the formation has to adapt itself to the circumstances. Thus the application has forced us to redefine the concept of a formation. Using the graph-theoretic terminology, we can say that a formation may be stretched out as a path or be compact as a star or wheel. We have implemented the developed behaviours in simulation environments as well as on real ERA-MOBI robots commonly referred to as Erratics. We discuss advantages and shortcomings of our model, based on the simulations as well as on the implementation with a team of Erratics.</p

Crossref

Sheffield Hallam University Research Archive

Learning obstacle avoidance with an operant behavioral model

Author: Gutnisky D. A.
Zanutto Bonifacio Silvano
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2004
Field of study

Artificial intelligence researchers have been attracted by the idea of having robots learn how to accomplish a task, rather than being told explicitly. Reinforcement learning has been proposed as an appealing framework to be used in controlling mobile agents. Robot learning research, as well as research in biological systems, face many similar problems in order to display high flexibility in performing a variety of tasks. In this work, the controlling of a vehicle in an avoidance task by a previously developed operant learning model (a form of animal learning) is studied. An environment in which a mobile robot with proximity sensors has to minimize the punishment for colliding against obstacles is simulated. The results were compared with the Q-Learning algorithm, and the proposed model had better performance. In this way a new artificial intelligence agent inspired by neurobiology, psychology, and ethology research is proposed.Fil: Gutnisky, D. A.. Universidad de Buenos Aires. Facultad de Ingeniería.Instituto de Ingeniería Biomédica; ArgentinaFil: Zanutto, Bonifacio Silvano. Consejo Nacional de Investigaciones Científicas y Técnicas. Instituto de Biología y Medicina Experimental. Fundación de Instituto de Biología y Medicina Experimental. Instituto de Biología y Medicina Experimental; Argentina. Universidad de Buenos Aires. Facultad de Ingeniería.Instituto de Ingeniería Biomédica; Argentin

CONICET Digital

Adaptive planning for distributed systems using goal accomplishment tracking

Author: Lee K
Mann G
Small N
Publication venue: Australian Computer Society
Publication date: 01/01/2015
Field of study

Goal accomplishment tracking is the process of monitoring the progress of a task or series of tasks towards completing a goal. Goal accomplishment tracking is used to monitor goal progress in a variety of domains, including workflow processing, teleoperation and industrial manufacturing. Practically, it involves the constant monitoring of task execution, analysis of this data to determine the task progress and notification of interested parties. This information is usually used in a passive way to observe goal progress. However, responding to this information may prevent goal failures. In addition, responding proactively in an opportunistic way can also lead to goals being completed faster. This paper proposes an architecture to support the adaptive planning of tasks for fault tolerance or opportunistic task execution based on goal accomplishment tracking. It argues that dramatically increased performance can be gained by monitoring task execution and altering plans dynamically

CiteSeerX

Deakin Research Online

Nottingham Trent Institutional Repository (IRep)

Research Repository

Robotic Wireless Sensor Networks

Author: A Balasubramanian
A Chattopadhyay
A Fida
A Gasparri
A Gasparri
A Ghaffarkhah
A Gonzalez-Ruiz
A Gonzalez-Ruiz
A Sanfeliu
A Tiderko
AM Hsieh
AM Ladd
B Mohar
C Dixon
C Lochert
CE Perkins
CR Lin
D Calkins
D Son
D Tardioli
DV Dimarogonas
DV Dimarogonas
E Aahin
E Prassler
EA Thompson
EW Dijkstra
F Knorn
FT Dagefu
G Holland
G Sun
G Tuna
H Choset
HG Nguyen
Hui Liu
I Guvenc
J Baber
J Cortes
J Fink
J Ny Le
J Penders
J Zhou
JB Petelin
JC Curlander
JG Dai
JK Erickson
JR Pinta De La
K Fall
K Kamei
K Konolige
K Savla
KA Qaraqe
L Oliveira
L Sabattini
L Tassiulas
LE Parker
M Fiedler
M Franceschelli
M Guo
M Malmirchegini
M Mauve
M Mauve
M Michael
M Michael
M Michael
M Michael
M Naghshvar
M Saumitra
M Schuresko
MA Batalin
MA Hsieh
MD Weiss
N Bezzo
N Boillot
N Hazon
N Xiong
NP Papanikolopoulos
O Tekdas
P Brass
P Wilke
P Yang
P Yang
P. Ibach
Pradipta Ghosh
Pradipta Ghosh
PX Liu
Q Dong
R Olfati-Saber
RC Arkin
RR Murphy
RR Murphy
S Depatla
S Gil
S Manfredi
S Thrun
S Wang
SJ Lee
SR Theodore
T Gustavi
V Milanés
Y Mostofi
Y Mostofi
Y Mostofi
Y Mostofi
Y Uchimura
Y Yan
Y Yan
Z Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/09/2018
Field of study

In this chapter, we present a literature survey of an emerging, cutting-edge, and multi-disciplinary field of research at the intersection of Robotics and Wireless Sensor Networks (WSN) which we refer to as Robotic Wireless Sensor Networks (RWSN). We define a RWSN as an autonomous networked multi-robot system that aims to achieve certain sensing goals while meeting and maintaining certain communication performance requirements, through cooperative control, learning and adaptation. While both of the component areas, i.e., Robotics and WSN, are very well-known and well-explored, there exist a whole set of new opportunities and research directions at the intersection of these two fields which are relatively or even completely unexplored. One such example would be the use of a set of robotic routers to set up a temporary communication path between a sender and a receiver that uses the controlled mobility to the advantage of packet routing. We find that there exist only a limited number of articles to be directly categorized as RWSN related works whereas there exist a range of articles in the robotics and the WSN literature that are also relevant to this new field of research. To connect the dots, we first identify the core problems and research trends related to RWSN such as connectivity, localization, routing, and robust flow of information. Next, we classify the existing research on RWSN as well as the relevant state-of-the-arts from robotics and WSN community according to the problems and trends identified in the first step. Lastly, we analyze what is missing in the existing literature, and identify topics that require more research attention in the future

arXiv.org e-Print Archive

Crossref