Policy optimization for industrial benchmark using deep reinforcement learning

Kumar, Anurag

Policy optimization for industrial benchmark using deep reinforcement learning

Authors: Anurag Kumar
Publication date: 1 January 2020
Publisher: Colorado State University. Libraries

Abstract

2020 Summer.Includes bibliographical references.Significant advancements have been made in the field of Reinforcement Learning (RL) in recent decades. Numerous novel RL environments and algorithms are mastering these problems that have been studied, evaluated, and published. The most popular RL benchmark environments produced by OpenAI Gym and DeepMind Labs are modeled after single/multi-player board, video games, or single-purpose robots and the RL algorithms modeling optimal policies for playing those games have even outperformed humans in almost all of them. However, the real-world applications using RL is very limited, as the academic community has limited access to real industrial data and applications. Industrial Benchmark (IB) is a novel RL benchmark motivated by Industrial Control problems with properties such as continuous state and action spaces, high dimensionality, partially observable state space, delayed effects combined with complex heteroscedastic stochastic behavior. We have used Deep Reinforcement Learning (DRL) algorithms like Deep Q-Networks (DQN) and Double-DQN (DDQN) to study and model optimal policies on IB. Our empirical results show various DRL models outperforming previously published models on the same IB

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Mountain Scholar (Digital Collections of Colorado and Wyoming)

oai:mountainscholar.org:10217/...

Last time updated on 02/12/2020