Search CORE

12 research outputs found

Learning Constrained Dynamics with Gauss Principle adhering Gaussian Processes

Author: Geist A. Rene
Trimpe Sebastian
Publication venue
Publication date: 01/01/2020
Field of study

The identification of the constrained dynamics of mechanical systems is often challenging. Learning methods promise to ease an analytical analysis, but require considerable amounts of data for training. We propose to combine insights from analytical mechanics with Gaussian process regression to improve the model's data efficiency and constraint integrity. The result is a Gaussian process model that incorporates a priori constraint knowledge such that its predictions adhere to Gauss' principle of least constraint. In return, predictions of the system's acceleration naturally respect potentially non-ideal (non-)holonomic equality constraints. As corollary results, our model enables to infer the acceleration of the unconstrained system from data of the constrained system and enables knowledge transfer between differing constraint configurations.Comment: To be published in 2nd Annual Conference on Learning for Dynamics and Control (L4DC), Proceedings of Machine Learning Research 202

arXiv.org e-Print Archive

MPG.PuRe

Controle por Modos Deslizantes de um Atuador Eletro-hidráulico com Compensação por Processo Gaussiano / Sliding Mode Control of an Electric-Hydraulic Actuator with Gaussian Process Compensation

Author: Bessa Wallace Moreira
Lima Gabriel da Silva
Publication venue: Brazilian Journals Publicações de Periódicos e Editora Ltda.
Publication date: 27/05/2020
Field of study

O desenvolvimento de sistemas de controle precisos para atuadores eletro-hidráulicos depende de uma adequada compensação dos efeitos dinâmicos desconhecidos. Neste trabalho, um controlador por Modos Deslizantes é combinado com um compensador por Processo Gaussiano para proporcionar um adequado rastreamento de trajetória. Processo Gaussiano é uma conhecida estratégia de aprendizagem de máquinas que pode ser utilizada no reconhecimento de funções. As propriedades de convergência do sistema em malha fechada são analisadas pela Teoria de Estabilidade de Lyapunov. Resultados numéricos confirmam uma forte melhora no desempenho do controlador ao ser inserido o compensador proposto

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Brazilian Journals

Probabilistic Recurrent State-Space Models

Author: Daniel Christian
Doerr Andreas
Nguyen-Tuong Duy
Schaal Stefan
Schiegg Martin
Toussaint Marc
Trimpe Sebastian
Publication venue
Publication date: 01/01/2018
Field of study

State-space models (SSMs) are a highly expressive model class for learning patterns in time series data and for system identification. Deterministic versions of SSMs (e.g. LSTMs) proved extremely successful in modeling complex time series data. Fully probabilistic SSMs, however, are often found hard to train, even for smaller problems. To overcome this limitation, we propose a novel model formulation and a scalable training algorithm based on doubly stochastic variational inference and Gaussian processes. In contrast to existing work, the proposed variational approximation allows one to fully capture the latent state temporal correlations. These correlations are the key to robust training. The effectiveness of the proposed PR-SSM is evaluated on a set of real-world benchmark datasets in comparison to state-of-the-art probabilistic model learning methods. Scalability and robustness are demonstrated on a high dimensional problem

arXiv.org e-Print Archive

MPG.PuRe

A survey on policy search algorithms for learning robot controllers in a handful of trials

Author: Calinon Sylvain
Chatzilygeroudis Konstantinos
Mouret Jean-Baptiste
Stulp Freek
Vassiliades Vassilis
Publication venue
Publication date: 04/12/2019
Field of study

Most policy search algorithms require thousands of training episodes to find an effective policy, which is often infeasible with a physical robot. This survey article focuses on the extreme other end of the spectrum: how can a robot adapt with only a handful of trials (a dozen) and a few minutes? By analogy with the word "big-data", we refer to this challenge as "micro-data reinforcement learning". We show that a first strategy is to leverage prior knowledge on the policy structure (e.g., dynamic movement primitives), on the policy parameters (e.g., demonstrations), or on the dynamics (e.g., simulators). A second strategy is to create data-driven surrogate models of the expected reward (e.g., Bayesian optimization) or the dynamical model (e.g., model-based policy search), so that the policy optimizer queries the model instead of the real system. Overall, all successful micro-data algorithms combine these two strategies by varying the kind of model and prior knowledge. The current scientific challenges essentially revolve around scaling up to complex robots (e.g., humanoids), designing generic priors, and optimizing the computing time.Comment: 21 pages, 3 figures, 4 algorithms, accepted at IEEE Transactions on Robotic

arXiv.org e-Print Archive

Institute of Transport Research:Publications

ZENODO

INRIA a CCSD electronic archive server

HAL Descartes

HAL-Rennes 1