12 research outputs found

    Learning Constrained Dynamics with Gauss Principle adhering Gaussian Processes

    Full text link
    The identification of the constrained dynamics of mechanical systems is often challenging. Learning methods promise to ease an analytical analysis, but require considerable amounts of data for training. We propose to combine insights from analytical mechanics with Gaussian process regression to improve the model's data efficiency and constraint integrity. The result is a Gaussian process model that incorporates a priori constraint knowledge such that its predictions adhere to Gauss' principle of least constraint. In return, predictions of the system's acceleration naturally respect potentially non-ideal (non-)holonomic equality constraints. As corollary results, our model enables to infer the acceleration of the unconstrained system from data of the constrained system and enables knowledge transfer between differing constraint configurations.Comment: To be published in 2nd Annual Conference on Learning for Dynamics and Control (L4DC), Proceedings of Machine Learning Research 202

    Controle por Modos Deslizantes de um Atuador Eletro-hidráulico com Compensação por Processo Gaussiano / Sliding Mode Control of an Electric-Hydraulic Actuator with Gaussian Process Compensation

    Get PDF
    O desenvolvimento de sistemas de controle precisos para atuadores eletro-hidráulicos depende de uma adequada compensação dos efeitos dinâmicos desconhecidos. Neste trabalho, um controlador por Modos Deslizantes é combinado com um compensador por Processo Gaussiano para proporcionar um adequado rastreamento de trajetória. Processo Gaussiano é uma conhecida estratégia de aprendizagem de máquinas que pode ser utilizada no reconhecimento de funções. As propriedades de convergência do sistema em malha fechada são analisadas pela Teoria de Estabilidade de Lyapunov. Resultados numéricos confirmam uma forte melhora no desempenho do controlador ao ser inserido o compensador proposto

    Probabilistic Recurrent State-Space Models

    Full text link
    State-space models (SSMs) are a highly expressive model class for learning patterns in time series data and for system identification. Deterministic versions of SSMs (e.g. LSTMs) proved extremely successful in modeling complex time series data. Fully probabilistic SSMs, however, are often found hard to train, even for smaller problems. To overcome this limitation, we propose a novel model formulation and a scalable training algorithm based on doubly stochastic variational inference and Gaussian processes. In contrast to existing work, the proposed variational approximation allows one to fully capture the latent state temporal correlations. These correlations are the key to robust training. The effectiveness of the proposed PR-SSM is evaluated on a set of real-world benchmark datasets in comparison to state-of-the-art probabilistic model learning methods. Scalability and robustness are demonstrated on a high dimensional problem

    A survey on policy search algorithms for learning robot controllers in a handful of trials

    Get PDF
    Most policy search algorithms require thousands of training episodes to find an effective policy, which is often infeasible with a physical robot. This survey article focuses on the extreme other end of the spectrum: how can a robot adapt with only a handful of trials (a dozen) and a few minutes? By analogy with the word "big-data", we refer to this challenge as "micro-data reinforcement learning". We show that a first strategy is to leverage prior knowledge on the policy structure (e.g., dynamic movement primitives), on the policy parameters (e.g., demonstrations), or on the dynamics (e.g., simulators). A second strategy is to create data-driven surrogate models of the expected reward (e.g., Bayesian optimization) or the dynamical model (e.g., model-based policy search), so that the policy optimizer queries the model instead of the real system. Overall, all successful micro-data algorithms combine these two strategies by varying the kind of model and prior knowledge. The current scientific challenges essentially revolve around scaling up to complex robots (e.g., humanoids), designing generic priors, and optimizing the computing time.Comment: 21 pages, 3 figures, 4 algorithms, accepted at IEEE Transactions on Robotic
    corecore