3,038 research outputs found
Error Bounds for Piecewise Smooth and Switching Regression
The paper deals with regression problems, in which the nonsmooth target is
assumed to switch between different operating modes. Specifically, piecewise
smooth (PWS) regression considers target functions switching deterministically
via a partition of the input space, while switching regression considers
arbitrary switching laws. The paper derives generalization error bounds in
these two settings by following the approach based on Rademacher complexities.
For PWS regression, our derivation involves a chaining argument and a
decomposition of the covering numbers of PWS classes in terms of the ones of
their component functions and the capacity of the classifier partitioning the
input space. This yields error bounds with a radical dependency on the number
of modes. For switching regression, the decomposition can be performed directly
at the level of the Rademacher complexities, which yields bounds with a linear
dependency on the number of modes. By using once more chaining and a
decomposition at the level of covering numbers, we show how to recover a
radical dependency. Examples of applications are given in particular for PWS
and swichting regression with linear and kernel-based component functions.Comment: This work has been submitted to the IEEE for possible publication.
Copyright may be transferred without notice,after which this version may no
longer be accessibl
Fast Automatic Verification of Large-Scale Systems with Lookup Tables
Modern safety-critical systems are difficult to formally verify, largely due to their large scale. In particular, the widespread use of lookup tables in embedded systems across diverse industries, such as aeronautics and automotive systems, create a critical obstacle to the scalability of formal verification. This paper presents a novel approach for the formal verification of large-scale systems with lookup tables. We use a learning-based technique to automatically learn abstractions of the lookup tables and use the abstractions to then prove the desired property. If the verification fails, we propose a falsification heuristic to search for a violation of the specification. In contrast with previous work on lookup table verification, our technique is completely automatic, making it ideal for deployment in a production environment. To our knowledge, our approach is the only technique that can automatically verify large-scale systems lookup with tables.
We illustrate the effectiveness of our technique on a benchmark which cannot be handled by the commercial tool SLDV, and we demonstrate the performance improvement provided by our technique
Risk Bounds for Learning Multiple Components with Permutation-Invariant Losses
This paper proposes a simple approach to derive efficient error bounds for
learning multiple components with sparsity-inducing regularization. We show
that for such regularization schemes, known decompositions of the Rademacher
complexity over the components can be used in a more efficient manner to result
in tighter bounds without too much effort. We give examples of application to
switching regression and center-based clustering/vector quantization. Then, the
complete workflow is illustrated on the problem of subspace clustering, for
which decomposition results were not previously available. For all these
problems, the proposed approach yields risk bounds with mild dependencies on
the number of components and completely removes this dependence for nonconvex
regularization schemes that could not be handled by previous methods
Short and long-term wind turbine power output prediction
In the wind energy industry, it is of great importance to develop models that
accurately forecast the power output of a wind turbine, as such predictions are
used for wind farm location assessment or power pricing and bidding,
monitoring, and preventive maintenance. As a first step, and following the
guidelines of the existing literature, we use the supervisory control and data
acquisition (SCADA) data to model the wind turbine power curve (WTPC). We
explore various parametric and non-parametric approaches for the modeling of
the WTPC, such as parametric logistic functions, and non-parametric piecewise
linear, polynomial, or cubic spline interpolation functions. We demonstrate
that all aforementioned classes of models are rich enough (with respect to
their relative complexity) to accurately model the WTPC, as their mean squared
error (MSE) is close to the MSE lower bound calculated from the historical
data. We further enhance the accuracy of our proposed model, by incorporating
additional environmental factors that affect the power output, such as the
ambient temperature, and the wind direction. However, all aforementioned
models, when it comes to forecasting, seem to have an intrinsic limitation, due
to their inability to capture the inherent auto-correlation of the data. To
avoid this conundrum, we show that adding a properly scaled ARMA modeling layer
increases short-term prediction performance, while keeping the long-term
prediction capability of the model
Fitting Jump Models
We describe a new framework for fitting jump models to a sequence of data.
The key idea is to alternate between minimizing a loss function to fit multiple
model parameters, and minimizing a discrete loss function to determine which
set of model parameters is active at each data point. The framework is quite
general and encompasses popular classes of models, such as hidden Markov models
and piecewise affine models. The shape of the chosen loss functions to minimize
determine the shape of the resulting jump model.Comment: Accepted for publication in Automatic
- …