4,857 research outputs found
Deep Reinforcement Learning for Wireless Sensor Scheduling in Cyber-Physical Systems
In many Cyber-Physical Systems, we encounter the problem of remote state
estimation of geographically distributed and remote physical processes. This
paper studies the scheduling of sensor transmissions to estimate the states of
multiple remote, dynamic processes. Information from the different sensors have
to be transmitted to a central gateway over a wireless network for monitoring
purposes, where typically fewer wireless channels are available than there are
processes to be monitored. For effective estimation at the gateway, the sensors
need to be scheduled appropriately, i.e., at each time instant one needs to
decide which sensors have network access and which ones do not. To address this
scheduling problem, we formulate an associated Markov decision process (MDP).
This MDP is then solved using a Deep Q-Network, a recent deep reinforcement
learning algorithm that is at once scalable and model-free. We compare our
scheduling algorithm to popular scheduling algorithms such as round-robin and
reduced-waiting-time, among others. Our algorithm is shown to significantly
outperform these algorithms for many example scenarios
Multiple Loop Self-Triggered Model Predictive Control for Network Scheduling and Control
We present an algorithm for controlling and scheduling multiple linear
time-invariant processes on a shared bandwidth limited communication network
using adaptive sampling intervals. The controller is centralized and computes
at every sampling instant not only the new control command for a process, but
also decides the time interval to wait until taking the next sample. The
approach relies on model predictive control ideas, where the cost function
penalizes the state and control effort as well as the time interval until the
next sample is taken. The latter is introduced in order to generate an adaptive
sampling scheme for the overall system such that the sampling time increases as
the norm of the system state goes to zero. The paper presents a method for
synthesizing such a predictive controller and gives explicit sufficient
conditions for when it is stabilizing. Further explicit conditions are given
which guarantee conflict free transmissions on the network. It is shown that
the optimization problem may be solved off-line and that the controller can be
implemented as a lookup table of state feedback gains. Simulation studies which
compare the proposed algorithm to periodic sampling illustrate potential
performance gains.Comment: Accepted for publication in IEEE Transactions on Control Systems
Technolog
- …