Search CORE

62,899 research outputs found

Time-Delayed Data Informed Reinforcement Learning for Approximate Optimal Tracking Control

Author: Buss Martin
Li Cong
Liu Fangzhou
Sun Weichao
Wang Yongchao
Publication venue
Publication date: 01/08/2022
Field of study

This paper proposes a time-delayed data informed reinforcement learning method, referred as incremental adaptive dynamic programming, to learn approximate solutions to optimal tracking control problems (OTCPs) of high-dimensional nonlinear systems. Departing from available solutions to OTCPs, our developed tracking control scheme settles the curse of complexity problem in value function approximation from a decoupled way, circumvents the learning inefficiency regarding varying desired trajectories by avoiding introducing a reference trajectory dynamics into the learning process, and requires neither an accurate nor identified dynamics using time-delayed signals. Specifically, the intractable OTCP of a high-dimensional uncertain system is first converted into multiple manageable sub-OTCPs of low-dimensional incremental subsystems constructed using time-delayed data. Then, the resulting sub-OTCPs are approximately solved by a parallel critic learning structure. The proposed tracking control scheme is developed with rigorous theoretical analysis of system stability and weight convergence, and validated experimentally on a 3-DoF robot manipulator

arXiv.org e-Print Archive

Recommended from our members

Towards Informed Exploration for Deep Reinforcement Learning

Author: Tang Haoran
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

In this thesis, we discuss various techniques for improving exploration for deep reinforcement learning. We begin with a brief review of reinforcement learning (RL) and the fundamental v.s. exploitation trade-off. Then we review how deep RL has improved upon classical and summarize six categories of the latest exploration methods for deep RL, in the order increasing usage of prior information. We then explore representative works in three categories discuss their strengths and weaknesses. The first category, represented by Soft Q-learning, uses regularization to encourage exploration. The second category, represented by count-based via hashing, maps states to hash codes for counting and assigns higher exploration to less-encountered states. The third category utilizes hierarchy and is represented by modular architecture for RL agents to play StarCraft II. Finally, we conclude that exploration by prior knowledge is a promising research direction and suggest topics of potentially impact

eScholarship - University of California

Risk, Unexpected Uncertainty, and Estimation Uncertainty: Bayesian Learning in Unstable Settings

Author: A Quinn
A Wagner
AC Courville
AJ Yu
AN Hampton
BA Strange
CD Fiorillo
D Draper
D Ellsberg
E Payzan-LeNestour
Elise Payzan-LeNestour
FH Knight
G Aston-Jones
G Vanni-Mercier
GI Christopoulos
J Dow
JD Cohen
JM Keynes
JM Pearce
JO Berger
K Craik
K Doya
K Preuschoff
K Preuschoff
K Sangjoon
LP Hansen
M Allais
M Basili
M d'Acremont
M Hsu
MFS Rushworth
MP Paulus
ND Daw
ND Daw
P Bossaerts
P Dayan
P Diaconis
Peter Bossaerts
PN Tobler
RE Kass
RH Thaler
S Huettel
S Ishii
S Kakade
SA Huettel
TEJ Behrens
Tim Behrens
U Rutishauser
W Epstein
W Yoshida
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

Recently, evidence has emerged that humans approach learning using Bayesian updating rather than (model-free) reinforcement algorithms in a six-arm restless bandit problem. Here, we investigate what this implies for human appreciation of uncertainty. In our task, a Bayesian learner distinguishes three equally salient levels of uncertainty. First, the Bayesian perceives irreducible uncertainty or risk: even knowing the payoff probabilities of a given arm, the outcome remains uncertain. Second, there is (parameter) estimation uncertainty or ambiguity: payoff probabilities are unknown and need to be estimated. Third, the outcome probabilities of the arms change: the sudden jumps are referred to as unexpected uncertainty. We document how the three levels of uncertainty evolved during the course of our experiment and how it affected the learning rate. We then zoom in on estimation uncertainty, which has been suggested to be a driving force in exploration, in spite of evidence of widespread aversion to ambiguity. Our data corroborate the latter. We discuss neural evidence that foreshadowed the ability of humans to distinguish between the three levels of uncertainty. Finally, we investigate the boundaries of human capacity to implement Bayesian learning. We repeat the experiment with different instructions, reflecting varying levels of structural uncertainty. Under this fourth notion of uncertainty, choices were no better explained by Bayesian updating than by (model-free) reinforcement learning. Exit questionnaires revealed that participants remained unaware of the presence of unexpected uncertainty and failed to acquire the right model with which to implement Bayesian updating

Infoscience - École polytechnique fédérale de Lausanne

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Caltech Authors

University of Melbourne Institutional Repository

Machine Learning for Fluid Mechanics

Author: Brunton Steven
Koumoutsakos Petros
Noack Bernd
Publication venue: 'Annual Reviews'
Publication date: 04/01/2020
Field of study

The field of fluid mechanics is rapidly advancing, driven by unprecedented volumes of data from field measurements, experiments and large-scale simulations at multiple spatiotemporal scales. Machine learning offers a wealth of techniques to extract information from data that could be translated into knowledge about the underlying fluid mechanics. Moreover, machine learning algorithms can augment domain knowledge and automate tasks related to flow control and optimization. This article presents an overview of past history, current developments, and emerging opportunities of machine learning for fluid mechanics. It outlines fundamental machine learning methodologies and discusses their uses for understanding, modeling, optimizing, and controlling fluid flows. The strengths and limitations of these methods are addressed from the perspective of scientific inquiry that considers data as an inherent part of modeling, experimentation, and simulation. Machine learning provides a powerful information processing framework that can enrich, and possibly even transform, current lines of fluid mechanics research and industrial applications.Comment: To appear in the Annual Reviews of Fluid Mechanics, 202

arXiv.org e-Print Archive