Search CORE

18 research outputs found

A direct method for trajectory optimization of rigid bodies through contact

Author: Betts JT
Jacobson DH
Mordatch I
Posa M
Pratt J
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Animating human lower limbs using contact-invariant optimization

Author: Elble R. J.
Emanuel Todorov
Erez T.
Igor Mordatch
Jack M. Wang
Millard M.
Mordatch I.
Perry J.
Tassa Y.
Todorov E.
Vladlen Koltun
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

A survey of motion planning techniques for humanoid robots

Author: Mordatch I
Nagasaka K
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Fast interpolation and time-optimization with contact

Author: Escande A
Hauser K
Mordatch I
Pan J
Pham QC
Publication venue: 'SAGE Publications'
Publication date
Field of study

Crossref

Learning with opponent-learning awareness

Author: Abbeel P
Al-Shedivat M
Chen R
Foerster J
Mordatch I
Whiteson S
Publication venue: International Foundation for Autonomous Agents and Multiagent Systems
Publication date: 01/01/2018
Field of study

Multi-agent settings are quickly gathering importance in machine learning. This includes a plethora of recent work on deep multi-agent reinforcement learning, but also can be extended to hierarchical reinforcement learning, generative adversarial networks and decentralised optimization. In all these settings the presence of multiple learning agents renders the training problem non-stationary and often leads to unstable training or undesired final results. We present Learning with Opponent-Learning Awareness (LOLA), a method in which each agent shapes the anticipated learning of the other agents in the environment. The LOLA learning rule includes an additional term that accounts for the impact of one agent’s policy on the anticipated parameter update of the other agents. Preliminary results show that the encounter of two LOLA agents leads to the emergence of titfor-tat and therefore cooperation in the iterated prisoners’ dilemma (IPD), while independent learning does not. In this domain, LOLA also receives higher payouts compared to a naive learner, and is robust against exploitation by higher order gradient-based methods. Applied to infinitely repeated matching pennies, LOLA agents converge to the Nash equilibrium. In a round robin tournament we show that LOLA agents can successfully shape the learning of a range of multi-agent learning algorithms from literature, resulting in the highest average returns on the IPD. We also show that the LOLA update rule can be efficiently calculated using an extension of the likelihood ratio policy gradient estimator, making the method suitable for model-free reinforcement learning. This method thus scales to large parameter and input spaces and nonlinear function approximators. We also apply LOLA to a grid world task with an embedded social dilemma using deep recurrent policies and opponent modelling. Again, by explicitly considering the learning of the other agent, LOLA agents learn to cooperate out of self-interest

Oxford University Research Archive

Parametric Trajectory Libraries for Online Motion Planning with Application to Soft Robots

Author: B Vanderborght
I Mordatch
M Diehl
MF Silva
MG Catalano
N Ganesh
S Kim
S Schaal
VM Zavala
Publication venue: International Federation on Robotics Research (IFRR)
Publication date: 01/01/2017
Field of study

In this paper we propose a method for online motion planning of constrained nonlinear systems. The method consists of three steps: the offline generation of a library of parametric trajectories via direct trajectory optimization, the online search in the library for the best candidate solution to the optimal control problem we aim to solve, and the online refinement of this trajectory. The last phase of this process takes advantage of a sensitivity-like analysis and guarantees to comply with the first-order approximation of the constraints even in case of active set changes. Efficiency of the trajectory generation process is discussed and a valid strategy to minimize online computations is proposed; together with this, an effective procedure for searching the candidate trajectory is also presented. As a case study, we examine optimal control of a planar soft manipulator performing a pick-and-place task: through simulations and experiments, we show how crucial online computation times are to achieve considerable energy savings in the presence of variability of the task to perform

Crossref

Archivio della Ricerca - Università di Pisa

Contact Planning for the ANYmal Quadruped Robot using an Acyclic Reachability-Based Planner

Author: A Herdt
A Winkler
B Aceituno Cabezas
D Holden
I Mordatch
J Carpentier
M Kalakrishnan
M Naveau
N Perrin
S Tonneau
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

International audienceDespite the great progress in quadrupedal robotics during the last decade, selecting good contacts (footholds) in highly uneven and cluttered environments still remains an open challenge. This paper builds upon a state-of-the-art approach, already successfully used for humanoid robots, and applies it to our robotic platform; the quadruped robot ANY-mal. The proposed algorithm decouples the problem into two subprob-lems: first a guide trajectory for the robot is generated, then contacts are created along this trajectory. Both subproblems rely on approximations and heuristics that need to be tuned. The main contribution of this work is to explain how this algorithm has been retuned to work with ANY-mal and to show the relevance of the approach with a variety of tests in realistic dynamic simulations

arXiv.org e-Print Archive

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

HAL-INSA Toulouse

Oxford University Research Archive

Sabanci University Research Database

Hal-Diderot

Avoiding Inter-Leg Collision for Data-Driven Control

Author: D. Han
I. Mordatch
J. -c. Wu
J. M. Wang
K. Yin
M. de Lasa
S. Coros
U. Muico
Y. Lee
Y. Lee
Y. Ye
Publication venue: 'Korea Computer Graphics Society'
Publication date
Field of study

Crossref

Injury Assessment for Physics-Based Characters

Author: A. Hertzmann
C. Ennis
E. Petrucelli
H.J. Mertz
I. Mordatch
J.-c. Wu
J.M. Wang
J.R. Crandall
M. Stevenson
M.E. Muller
P. Faloutsos
P.S.A. Reitsma
S. Coros
S.P. Baker
T. Geijtenbeek
U. Muico
Publication venue
Publication date: 01/01/2011
Field of study

Determining injury levels for virtual characters is an important aspect of many games. For characters that are animated using simulated physics, it is possible assess injury levels based on physical properties, such as accelerations and forces. We have constructed a model for injury assessment that relates results from research on human injury response to parameters in physics-based animation systems. We describe a set of different normalized injury measures for individual body parts, which can be combined into a single measure for total injury. Our research includes a user study in which human observers rate the injury levels of physics-based characters falling from varying heights at different orientations. Results show that the correlation between our model output and perceived injury is stronger than the correlation between perceived injury and fall height (0.603 versus 0.466, respectively, with N = 1020 and p

Crossref

Utrecht University Repository