Weakly Coupled Deep Q-Networks

Jiang, Daniel R.; Shar, Ibrahim El

Weakly Coupled Deep Q-Networks

Authors: Daniel R. Jiang
Ibrahim El Shar
Publication date: 28 October 2023
Publisher

Abstract

We propose weakly coupled deep Q-networks (WCDQN), a novel deep reinforcement learning algorithm that enhances performance in a class of structured problems called weakly coupled Markov decision processes (WCMDP). WCMDPs consist of multiple independent subproblems connected by an action space constraint, which is a structural property that frequently emerges in practice. Despite this appealing structure, WCMDPs quickly become intractable as the number of subproblems grows. WCDQN employs a single network to train multiple DQN "subagents", one for each subproblem, and then combine their solutions to establish an upper bound on the optimal action value. This guides the main DQN agent towards optimality. We show that the tabular version, weakly coupled Q-learning (WCQL), converges almost surely to the optimal action value. Numerical experiments show faster convergence compared to DQN and related techniques in settings with as many as 10 subproblems,

3^{10}

total actions, and a continuous state space.Comment: To appear in proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2310.18803

Last time updated on 18/01/2024