Search CORE

377 research outputs found

Online Learning of a Memory for Learning Rates

Author: Kappler Daniel
Meier Franziska
Schaal Stefan
Publication venue
Publication date: 01/01/2018
Field of study

The promise of learning to learn for robotics rests on the hope that by extracting some information about the learning process itself we can speed up subsequent similar learning tasks. Here, we introduce a computationally efficient online meta-learning algorithm that builds and optimizes a memory model of the optimal learning rate landscape from previously observed gradient behaviors. While performing task specific optimization, this memory of learning rates predicts how to scale currently observed gradients. After applying the gradient scaling our meta-learner updates its internal memory based on the observed effect its prediction had. Our meta-learner can be combined with any gradient-based optimizer, learns on the fly and can be transferred to new optimization tasks. In our evaluations we show that our meta-learning algorithm speeds up learning of MNIST classification and a variety of learning control tasks, either in batch or online learning settings.Comment: accepted to ICRA 2018, code available: https://github.com/fmeier/online-meta-learning ; video pitch available: https://youtu.be/9PzQ25FPPO

arXiv.org e-Print Archive

MPG.PuRe

Präoperatives sensomotorisches Training : Einfluss auf die postoperative Gleichgewichts- und körperliche Funktionsfähigkeit

Author: Meier Franziska
Publication venue: ZHAW Zürcher Hochschule für Angewandte Wissenschaften
Publication date: 01/01/2012
Field of study

ZHAW digitalcollection

Learning Sensor Feedback Models from Demonstrations via Phase-Modulated Neural Networks

Author: Meier Franziska
Schaal Stefan
Su Zhe
Sutanto Giovanni
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

In order to robustly execute a task under environmental uncertainty, a robot needs to be able to reactively adapt to changes arising in its environment. The environment changes are usually reflected in deviation from expected sensory traces. These deviations in sensory traces can be used to drive the motion adaptation, and for this purpose, a feedback model is required. The feedback model maps the deviations in sensory traces to the motion plan adaptation. In this paper, we develop a general data-driven framework for learning a feedback model from demonstrations. We utilize a variant of a radial basis function network structure --with movement phases as kernel centers-- which can generally be applied to represent any feedback models for movement primitives. To demonstrate the effectiveness of our framework, we test it on the task of scraping on a tilt board. In this task, we are learning a reactive policy in the form of orientation adaptation, based on deviations of tactile sensor traces. As a proof of concept of our method, we provide evaluations on an anthropomorphic robot. A video demonstrating our approach and its results can be seen in https://youtu.be/7Dx5imy1KcwComment: 8 pages, accepted to be published at the International Conference on Robotics and Automation (ICRA) 201

arXiv.org e-Print Archive

Crossref

MPG.PuRe

A New Data Source for Inverse Dynamics Learning

Author: Kappler Daniel
Meier Franziska
Ratliff Nathan
Schaal Stefan
Publication venue
Publication date: 01/01/2017
Field of study

Modern robotics is gravitating toward increasingly collaborative human robot interaction. Tools such as acceleration policies can naturally support the realization of reactive, adaptive, and compliant robots. These tools require us to model the system dynamics accurately -- a difficult task. The fundamental problem remains that simulation and reality diverge--we do not know how to accurately change a robot's state. Thus, recent research on improving inverse dynamics models has been focused on making use of machine learning techniques. Traditional learning techniques train on the actual realized accelerations, instead of the policy's desired accelerations, which is an indirect data source. Here we show how an additional training signal -- measured at the desired accelerations -- can be derived from a feedback control signal. This effectively creates a second data source for learning inverse dynamics models. Furthermore, we show how both the traditional and this new data source, can be used to train task-specific models of the inverse dynamics, when used independently or combined. We analyze the use of both data sources in simulation and demonstrate its effectiveness on a real-world robotic platform. We show that our system incrementally improves the learned inverse dynamics model, and when using both data sources combined converges more consistently and faster.Comment: IROS 201

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Learning Feedback Terms for Reactive Planning and Control

Author: Meier Franziska
Rai Akshara
Schaal Stefan
Sutanto Giovanni
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 03/03/2017
Field of study

With the advancement of robotics, machine learning, and machine perception, increasingly more robots will enter human environments to assist with daily tasks. However, dynamically-changing human environments requires reactive motion plans. Reactivity can be accomplished through replanning, e.g. model-predictive control, or through a reactive feedback policy that modifies on-going behavior in response to sensory events. In this paper, we investigate how to use machine learning to add reactivity to a previously learned nominal skilled behavior. We approach this by learning a reactive modification term for movement plans represented by nonlinear differential equations. In particular, we use dynamic movement primitives (DMPs) to represent a skill and a neural network to learn a reactive policy from human demonstrations. We use the well explored domain of obstacle avoidance for robot manipulation as a test bed. Our approach demonstrates how a neural network can be combined with physical insights to ensure robust behavior across different obstacle settings and movement durations. Evaluations on an anthropomorphic robotic system demonstrate the effectiveness of our work.Comment: 8 pages, accepted to be published at ICRA 2017 conferenc

arXiv.org e-Print Archive

Crossref

Myristoyl CoA:Protein N-Myristoyl Transferase: A Target for a Novel Antimalarial Drug

Author: Meier Franziska
Meier Franziska
Publication venue: Department of Chemistry, Imperial College London
Publication date: 01/10/2009
Field of study

Malaria, an illness caused by protozoan parasites of the genus Plasmodium, continues to be a key global health issue; around 40% of the world’s population are at risk and more than one million people are killed each year according to the World Health Organisation (WHO). It is transmitted via bites of infected female mosquitoes (Anopheles) and its severest form, falciparum malaria, can lead to death if left untreated. Effective malarial treatment is complex due to drug resistance and socioeconomic issues in many of the most affected areas. An enzyme from the parasite, myristoyl CoA:protein N-myristoyl transferase (NMT), has been identified as a potential target for antimalarial drugs. N-Myristoyl transferase, which catalyses the co-translational transfer of myristic acid to an N-terminal glycine of certain substrate proteins, has been shown to be essential for various pathogens. This thesis demonstrates the design, synthesis and analysis of potential inhibitors of Plasmodium falciparum NMT. Approximately 50 inhibitors with systematic variations based on a benzothiazole scaffold have been synthesised. It is known that these benzothiazoles compete with binding of peptide substrate within the NMT enzyme binding cleft. Differences between the peptide binding pockets of P. falciparum and human NMTs were exploited to design effective and selective new antimalarial treatments. The level of inhibition was measured using SPA that monitors the transfer of 3H-labelled myristoyl CoA to the N-terminus of a polypeptide substrate. A plot of enzyme activity as a function of inhibitor concentration gave inhibition curves from which IC50-values were derived. In vitro tests resulted in four hits with improved activity in the low micromolar region against P. falciparum NMT compared to the lead compound. Nevertheless, the inhibitors were not exceptionally selective over Homo sapiens NMT with an IC50 in the low micromolar region also. Selections of the most promising inhibitors have been tested in vivo and considerable reductions in parasitemia were noted

Spiral - Imperial College Digital Repository