Search CORE

30,229 research outputs found

Learning Sensor Feedback Models from Demonstrations via Phase-Modulated Neural Networks

Author: Meier Franziska
Schaal Stefan
Su Zhe
Sutanto Giovanni
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

In order to robustly execute a task under environmental uncertainty, a robot needs to be able to reactively adapt to changes arising in its environment. The environment changes are usually reflected in deviation from expected sensory traces. These deviations in sensory traces can be used to drive the motion adaptation, and for this purpose, a feedback model is required. The feedback model maps the deviations in sensory traces to the motion plan adaptation. In this paper, we develop a general data-driven framework for learning a feedback model from demonstrations. We utilize a variant of a radial basis function network structure --with movement phases as kernel centers-- which can generally be applied to represent any feedback models for movement primitives. To demonstrate the effectiveness of our framework, we test it on the task of scraping on a tilt board. In this task, we are learning a reactive policy in the form of orientation adaptation, based on deviations of tactile sensor traces. As a proof of concept of our method, we provide evaluations on an anthropomorphic robot. A video demonstrating our approach and its results can be seen in https://youtu.be/7Dx5imy1KcwComment: 8 pages, accepted to be published at the International Conference on Robotics and Automation (ICRA) 201

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Comparative evaluation of approaches in T.4.1-4.3 and working definition of adaptive module

Author: Ajallooeian Mostafa
Billard Aude
Carbajal Juan Pablo
Gay Sébastien
Ijspeert Auke
Khansari-Zadeh Mohammad
Kim Seungsu
Kuppuswamy Naveen
Lemme Andre
Neumann Gerhard
Reinhart Felix
Rolf Matthias
Rückert Elmar
Schrauwen Benjamin
Steil Jochen
Sumioka Hidenobu
Waegeman Tim
wyffels Francis
Zhao Qian
Publication venue
Publication date: 01/01/2010
Field of study

The goal of this deliverable is two-fold: (1) to present and compare different approaches towards learning and encoding movements us- ing dynamical systems that have been developed by the AMARSi partners (in the past during the first 6 months of the project), and (2) to analyze their suitability to be used as adaptive modules, i.e. as building blocks for the complete architecture that will be devel- oped in the project. The document presents a total of eight approaches, in two groups: modules for discrete movements (i.e. with a clear goal where the movement stops) and for rhythmic movements (i.e. which exhibit periodicity). The basic formulation of each approach is presented together with some illustrative simulation results. Key character- istics such as the type of dynamical behavior, learning algorithm, generalization properties, stability analysis are then discussed for each approach. We then make a comparative analysis of the different approaches by comparing these characteristics and discussing their suitability for the AMARSi project

Ghent University Academic Bibliography

Towards a learning-theoretic analysis of spike-timing dependent plasticity

Author: Balduzzi David
Besserve Michel
Publication venue
Publication date: 01/01/2012
Field of study

This paper suggests a learning-theoretic perspective on how synaptic plasticity benefits global brain functioning. We introduce a model, the selectron, that (i) arises as the fast time constant limit of leaky integrate-and-fire neurons equipped with spiking timing dependent plasticity (STDP) and (ii) is amenable to theoretical analysis. We show that the selectron encodes reward estimates into spikes and that an error bound on spikes is controlled by a spiking margin and the sum of synaptic weights. Moreover, the efficacy of spikes (their usefulness to other reward maximizing selectrons) also depends on total synaptic strength. Finally, based on our analysis, we propose a regularized version of STDP, and show the regularization improves the robustness of neuronal learning when faced with multiple stimuli.Comment: To appear in Adv. Neural Inf. Proc. System

arXiv.org e-Print Archive

MPG.PuRe

In silico case studies of compliant robots: AMARSI deliverable 3.3

Author: Alessandro Cristiano
Caluwaerts Ken
Carbajal Juan Pablo
Dhaene Michiel
d’Avella Andrea
Ivanenko Yuri
Martinez Salazar Harold Roberto
Nakajima Kohei
Pfeifer Rolf
Schrauwen Benjamin
Sumioka Hidenobu
Verstraeten David
Yu Xiaoxiang
Zhao Qian
Publication venue
Publication date: 01/01/2014
Field of study

In the deliverable 3.2 we presented how the morphological computing ap- proach can significantly facilitate the control strategy in several scenarios, e.g. quadruped locomotion, bipedal locomotion and reaching. In particular, the Kitty experimental platform is an example of the use of morphological computation to allow quadruped locomotion. In this deliverable we continue with the simulation studies on the application of the different morphological computation strategies to control a robotic system

Ghent University Academic Bibliography

Recurrent cerebellar architecture solves the motor-error problem

Author: Brindley G. S.
James V. Stone
John Porrill
Markert G.
Nicolelis M. A. L.
Paul Dean
Publication venue: 'The Royal Society'
Publication date: 01/01/2004
Field of study

Current views of cerebellar function have been heavily influenced by the models of Marr and Albus, who suggested that the climbing fibre input to the cerebellum acts as a teaching signal for motor learning. It is commonly assumed that this teaching signal must be motor error (the difference between actual and correct motor command), but this approach requires complex neural structures to estimate unobservable motor error from its observed sensory consequences. We have proposed elsewhere a recurrent decorrelation control architecture in which Marr-Albus models learn without requiring motor error. Here, we prove convergence for this architecture and demonstrate important advantages for the modular control of systems with multiple degrees of freedom. These results are illustrated by modelling adaptive plant compensation for the three-dimensional vestibular ocular reflex. This provides a functional role for recurrent cerebellar connectivity, which may be a generic anatomical feature of projections between regions of cerebral and cerebellar cortex

CiteSeerX

Crossref

PubMed Central

White Rose Research Online