Search CORE

3,840 research outputs found

Imitating Driver Behavior with Generative Adversarial Networks

Author: Kochenderfer Mykel
Kuefler Alex
Morton Jeremy
Wheeler Tim
Publication venue
Publication date: 23/01/2017
Field of study

The ability to accurately predict and simulate human driving behavior is critical for the development of intelligent transportation systems. Traditional modeling methods have employed simple parametric models and behavioral cloning. This paper adopts a method for overcoming the problem of cascading errors inherent in prior approaches, resulting in realistic behavior that is robust to trajectory perturbations. We extend Generative Adversarial Imitation Learning to the training of recurrent policies, and we demonstrate that our model outperforms rule-based controllers and maximum likelihood models in realistic highway simulations. Our model both reproduces emergent behavior of human drivers, such as lane change rate, while maintaining realistic control over long time horizons.Comment: 8 pages, 6 figure

arXiv.org e-Print Archive

Crossref

Measurement-based optimization of batch and repetitive processes using an integrated two-layer architecture

Author: Chachuat B
Fikar M
Podmajersky M
Publication venue: 'Elsevier BV'
Publication date: 14/05/2013
Field of study

Spiral - Imperial College Digital Repository

Reaching the limit in autonomous racing: Optimal control versus reinforcement learning

Author: Koltun Vladlen
Müller Matthias
Romero Angel
Scaramuzza Davide
Song Yunlong
Publication venue: American Association for the Advancement of Science
Publication date: 13/09/2023
Field of study

A central question in robotics is how to design a control system for an agile mobile robot. This paper studies this question systematically, focusing on a challenging setting: autonomous drone racing. We show that a neural network controller trained with reinforcement learning (RL) outperformed optimal control (OC) methods in this setting. We then investigated which fundamental factors have contributed to the success of RL or have limited OC. Our study indicates that the fundamental advantage of RL over OC is not that it optimizes its objective better but that it optimizes a better objective. OC decomposes the problem into planning and control with an explicit intermediate representation, such as a trajectory, that serves as an interface. This decomposition limits the range of behaviors that can be expressed by the controller, leading to inferior control performance when facing unmodeled effects. In contrast, RL can directly optimize a task-level objective and can leverage domain randomization to cope with model uncertainty, allowing the discovery of more robust control responses. Our findings allowed us to push an agile drone to its maximum performance, achieving a peak acceleration greater than 12 times the gravitational acceleration and a peak velocity of 108 kilometers per hour. Our policy achieved superhuman control within minutes of training on a standard workstation. This work presents a milestone in agile robotics and sheds light on the role of RL and OC in robot control

ZORA

Recommended from our members

Photovoltaic and Behind-the-Meter Battery Storage: Advanced Smart Inverter Controls and Field Demonstration

Author: Gehbauer Christoph
Mueller Joscha
Swenson Tucker
Vrettos Evangelos
Publication venue: eScholarship, University of California
Publication date: 13/07/2021
Field of study

eScholarship - University of California