Search CORE

1,594 research outputs found

Coordinated Multi-Agent Imitation Learning

Author: Carr Peter
Le Hoang M.
Lucey Patrick
Yue Yisong
Publication venue
Publication date: 01/08/2017
Field of study

We study the problem of imitation learning from demonstrations of multiple coordinating agents. One key challenge in this setting is that learning a good model of coordination can be difficult, since coordination is often implicit in the demonstrations and must be inferred as a latent variable. We propose a joint approach that simultaneously learns a latent coordination model along with the individual policies. In particular, our method integrates unsupervised structure learning with conventional imitation learning. We illustrate the power of our approach on a difficult problem of learning multiple policies for fine-grained behavior modeling in team sports, where different players occupy different roles in the coordinated team strategy. We show that having a coordination model to infer the roles of players yields substantially improved imitation loss compared to conventional baselines.Comment: International Conference on Machine Learning 201

arXiv.org e-Print Archive

Caltech Authors

Recommended from our members

Reinforcement Learning for Hybrid and Plug-In Hybrid Electric Vehicle Energy Management: Recent Advances and Prospects

Author: Barth Matthew
Hu Xiaosong
Liu Teng
Qi Xuewei
Publication venue: eScholarship, University of California
Publication date: 01/09/2019
Field of study

eScholarship - University of California

Learning Local Components to Understand Large Bayesian Networks

Author: H. Jorge Cordero
Lin Yujian
Xiang Yanping
Zeng Yifeng
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

Teeside University's Research Repository

VBN

Pooling or sampling: Collective dynamics for electrical flow estimation

Author: Becchetti Luca
Bonifaci Vincenzo
Natale Emanuele
Publication venue: International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
Publication date: 01/01/2018
Field of study

The computation of electrical flows is a crucial primitive for many recently proposed optimization algorithms on weighted networks. While typically implemented as a centralized subroutine, the ability to perform this task in a fully decentralized way is implicit in a number of biological systems. Thus, a natural question is whether this task can provably be accomplished in an efficient way by a network of agents executing a simple protocol. We provide a positive answer, proposing two distributed approaches to electrical flow computation on a weighted network: a deterministic process mimicking Jacobi's iterative method for solving linear systems, and a randomized token diffusion process, based on revisiting a classical random walk process on a graph with an absorbing node. We show that both processes converge to a solution of Kirchhoff's node potential equations, derive bounds on their convergence rates in terms of the weights of the network, and analyze their time and message complexity

Archivio della ricerca- Università di Roma La Sapienza

The 1990 progress report and future plans

Author: Compton Michael
Friedland Peter
Zweben Monte
Publication venue
Publication date
Field of study

This document describes the progress and plans of the Artificial Intelligence Research Branch (RIA) at ARC in 1990. Activities span a range from basic scientific research to engineering development and to fielded NASA applications, particularly those applications that are enabled by basic research carried out at RIA. Work is conducted in-house and through collaborative partners in academia and industry. Our major focus is on a limited number of research themes with a dual commitment to technical excellence and proven applicability to NASA short, medium, and long-term problems. RIA acts as the Agency's lead organization for research aspects of artificial intelligence, working closely with a second research laboratory at JPL and AI applications groups at all NASA centers

NASA Technical Reports Server