14 research outputs found
Event-triggered Learning
The efficient exchange of information is an essential aspect of intelligent
collective behavior. Event-triggered control and estimation achieve some
efficiency by replacing continuous data exchange between agents with
intermittent, or event-triggered communication. Typically, model-based
predictions are used at times of no data transmission, and updates are sent
only when the prediction error grows too large. The effectiveness in reducing
communication thus strongly depends on the quality of the prediction model. In
this article, we propose event-triggered learning as a novel concept to reduce
communication even further and to also adapt to changing dynamics. By
monitoring the actual communication rate and comparing it to the one that is
induced by the model, we detect a mismatch between model and reality and
trigger model learning when needed. Specifically, for linear Gaussian dynamics,
we derive different classes of learning triggers solely based on a statistical
analysis of inter-communication times and formally prove their effectiveness
with the aid of concentration inequalities
Overcoming Bandwidth Limitations in Wireless Sensor Networks by Exploitation of Cyclic Signal Patterns: An Event-triggered Learning Approach
Wireless sensor networks are used in a wide range of applications, many of which require real-time transmission of the measurements. Bandwidth limitations result in limitations on the sampling frequency and number of sensors. This problem can be addressed by reducing the communication load via data compression and event-based communication approaches. The present paper focuses on the class of applications in which the signals exhibit unknown and potentially time-varying cyclic patterns. We review recently proposed event-triggered learning (ETL) methods that identify and exploit these cyclic patterns, we show how these methods can be applied to the nonlinear multivariable dynamics of three-dimensional orientation data, and we propose a novel approach that uses Gaussian process models. In contrast to other approaches, all three ETL methods work in real time and assure a small upper bound on the reconstruction error. The proposed methods are compared to several conventional approaches in experimental data from human subjects walking with a wearable inertial sensor network. They are found to reduce the communication load by 60–70%, which implies that two to three times more sensor nodes could be used at the same bandwidth
Event-triggered Pulse Control with Model Learning (if Necessary)
In networked control systems, communication is a shared and therefore scarce
resource. Event-triggered control (ETC) can achieve high performance control
with a significantly reduced amount of samples compared to classical, periodic
control schemes. However, ETC methods usually rely on the availability of an
accurate dynamics model, which is oftentimes not readily available. In this
paper, we propose a novel event-triggered pulse control strategy that learns
dynamics models if necessary. In addition to adapting to changing dynamics, the
method also represents a suitable replacement for the integral part typically
used in periodic control.Comment: Accepted final version to appear in: Proc. of the American Control
Conference, 201
Event-triggered Learning
Machine learning has seen many recent breakthroughs. Inspired by these, learningcontrol
systems emerged. In essence, the goal is to learn models and control policies
for dynamical systems. Dealing with learning-control systems is hard and there are
several key challenges that differ from classical machine learning tasks. Conceptually,
excitation and exploration play a major role in learning-control systems. On
the one hand, we usually aim for controllers that stabilize a system with the goal of
avoiding deviations from a setpoint or reference. However, we also need informative
data for learning, which is often not the case when controllers work well. Therefore,
there is a problem due to the opposing objectives of many control theoretical tasks
and the requirements for successful learning outcomes.
Additionally, change of dynamics or other conditions is often encountered for
control systems in practice. For example, new tasks, changing load conditions, or
different external conditions have a substantial influence on the underlying distribution.
Learning can provide the flexibility to adapt the behavior of learning-control
systems to these events.
Since learning has to be applied with sufficient excitation there are many practical
situations that hinge on the following problem:
"When to trigger learning updates in learning-control systems?"
This is the core question of this thesis and despite its relevance, there is no general
method that provides an answer. We propose and develop a new paradigm for
principled decision making on when to learn, which we call event-triggered learning
(ETL).
The first triggers that we discuss are designed for networked control systems. All
agents use model-based predictions to anticipate the other agents’ behavior which
makes communication only necessary when the predictions deviate too much. Essentially,
an accurate model can save communication, while a poor model leads to
poor predictions and thus frequent updates. The learning triggers are based on the
inter-communication times (the time between two communication instances). They
are independent and identically distributed random variables, which directly leads
to sound guarantees. The framework is validated in experiments and leads to 70%
communication savings for wireless sensor networks that monitor human walking.
In the second part, we consider optimal control algorithms and start with linear
quadratic regulators. A perfect model yields the best possible controller, while poor models result in poor controllers. Thus, by analyzing the control performance, we
can infer the model’s accuracy. From a technical point of view, we have to deal
with correlated data and work with more sophisticated tools to provide the desired
theoretical guarantees. While we obtain a powerful test that is tightly tailored to the
problem at hand, it does not generalize to different control architectures. Therefore,
we also consider a more general point of view, where we recast the learning of linear
systems as a filtering problem. We leverage Kalman filter-based techniques to derive
a sound test and utilize the point estimate of the parameters for targeted learning
experiments. The algorithm is independent of the underlying control architecture,
but demonstrated for model predictive control.
Most of the results in the first two parts critically depend on linearity assumptions
in the dynamics and further problem-specific properties. In the third part, we
take a step back and ask the fundamental question of how to compare (nonlinear)
dynamical systems directly from state data. We propose a kernel two-sample test
that compares stationary distributions of dynamical systems. Additionally, we introduce
a new type of mixing that can directly be estimated from data to deal with
the autocorrelations.
In summary, this thesis introduces a new paradigm for deciding when to trigger
updates in learning-control systems. Additionally, we develop three instantiations
of this paradigm for different learning-control problems. Further, we present applications
of the algorithms that yield substantial communication savings, effective
controller updates, and the detection of anomalies in human walking data
Event-Triggered Time-Varying Bayesian Optimization
We consider the problem of sequentially optimizing a time-varying objective
function using time-varying Bayesian optimization (TVBO). Here, the key
challenge is the exploration-exploitation trade-off under time variations.
Current approaches to TVBO require prior knowledge of a constant rate of
change. However, in practice, the rate of change is usually unknown. We propose
an event-triggered algorithm, ET-GP-UCB, that treats the optimization problem
as static until it detects changes in the objective function online and then
resets the dataset. This allows the algorithm to adapt to realized temporal
changes without the need for prior knowledge. The event-trigger is based on
probabilistic uniform error bounds used in Gaussian process regression. We
provide regret bounds for ET-GP-UCB and show in numerical experiments that it
outperforms state-of-the-art algorithms on synthetic and real-world data.
Furthermore, these results demonstrate that ET-GP-UCB is readily applicable to
various settings without tuning hyperparameters
Identifying Causal Structure in Dynamical Systems
We present a method for automatically identifying the causal structure of a
dynamical control system. Through a suitable experiment design and subsequent
causal analysis, the method reveals, which state and input variables of the
system have a causal influence on each other. The experiment design builds on
the concept of controllability, which provides a systematic way to compute
input trajectories that steer the system to specific regions in its state
space. For the causal analysis, we leverage powerful techniques from causal
inference and extend them to control systems. Further, we derive conditions
that guarantee discovery of the true causal structure of the system and show
that the obtained knowledge of the causal structure reduces the complexity of
model learning and yields improved generalization capabilities. Experiments on
a robot arm demonstrate reliable causal identification from real-world data and
extrapolation to regions outside the training domain
Hierarchical Event-triggered Learning for Cyclically Excited Systems with Application to Wireless Sensor Networks
Communication load is a limiting factor in many real-time systems.
Event-triggered state estimation and event-triggered learning methods reduce
network communication by sending information only when it cannot be adequately
predicted based on previously transmitted data. This paper proposes an
event-triggered learning approach for nonlinear discrete-time systems with
cyclic excitation. The method automatically recognizes cyclic patterns in data
- even when they change repeatedly - and reduces communication load whenever
the current data can be accurately predicted from previous cycles. Nonetheless,
a bounded error between original and received signal is guaranteed. The cyclic
excitation model, which is used for predictions, is updated hierarchically,
i.e., a full model update is only performed if updating a small number of model
parameters is not sufficient. A nonparametric statistical test enforces that
model updates happen only if the cyclic excitation changed with high
probability. The effectiveness of the proposed methods is demonstrated using
the application example of wireless real-time pitch angle measurements of a
human foot in a feedback-controlled neuroprosthesis. The experimental results
show that communication load can be reduced by 70 % while the root-mean-square
error between measured and received angle is less than 1{\deg}.Comment: 6 pages and 6 figures; to appear in IEEE Control Systems Letter