490 research outputs found
Regularization and Bayesian Learning in Dynamical Systems: Past, Present and Future
Regularization and Bayesian methods for system identification have been
repopularized in the recent years, and proved to be competitive w.r.t.
classical parametric approaches. In this paper we shall make an attempt to
illustrate how the use of regularization in system identification has evolved
over the years, starting from the early contributions both in the Automatic
Control as well as Econometrics and Statistics literature. In particular we
shall discuss some fundamental issues such as compound estimation problems and
exchangeability which play and important role in regularization and Bayesian
approaches, as also illustrated in early publications in Statistics. The
historical and foundational issues will be given more emphasis (and space), at
the expense of the more recent developments which are only briefly discussed.
The main reason for such a choice is that, while the recent literature is
readily available, and surveys have already been published on the subject, in
the author's opinion a clear link with past work had not been completely
clarified.Comment: Plenary Presentation at the IFAC SYSID 2015. Submitted to Annual
Reviews in Contro
A Bayesian Approach to Sparse plus Low rank Network Identification
We consider the problem of modeling multivariate time series with
parsimonious dynamical models which can be represented as sparse dynamic
Bayesian networks with few latent nodes. This structure translates into a
sparse plus low rank model. In this paper, we propose a Gaussian regression
approach to identify such a model
The Harmonic Analysis of Kernel Functions
Kernel-based methods have been recently introduced for linear system
identification as an alternative to parametric prediction error methods.
Adopting the Bayesian perspective, the impulse response is modeled as a
non-stationary Gaussian process with zero mean and with a certain kernel (i.e.
covariance) function. Choosing the kernel is one of the most challenging and
important issues. In the present paper we introduce the harmonic analysis of
this non-stationary process, and argue that this is an important tool which
helps in designing such kernel. Furthermore, this analysis suggests also an
effective way to approximate the kernel, which allows to reduce the
computational burden of the identification procedure
Visual Representations: Defining Properties and Deep Approximations
Visual representations are defined in terms of minimal sufficient statistics
of visual data, for a class of tasks, that are also invariant to nuisance
variability. Minimal sufficiency guarantees that we can store a representation
in lieu of raw data with smallest complexity and no performance loss on the
task at hand. Invariance guarantees that the statistic is constant with respect
to uninformative transformations of the data. We derive analytical expressions
for such representations and show they are related to feature descriptors
commonly used in computer vision, as well as to convolutional neural networks.
This link highlights the assumptions and approximations tacitly assumed by
these methods and explains empirical practices such as clamping, pooling and
joint normalization.Comment: UCLA CSD TR140023, Nov. 12, 2014, revised April 13, 2015, November
13, 2015, February 28, 201
Maximum Entropy Vector Kernels for MIMO system identification
Recent contributions have framed linear system identification as a
nonparametric regularized inverse problem. Relying on -type
regularization which accounts for the stability and smoothness of the impulse
response to be estimated, these approaches have been shown to be competitive
w.r.t classical parametric methods. In this paper, adopting Maximum Entropy
arguments, we derive a new penalty deriving from a vector-valued
kernel; to do so we exploit the structure of the Hankel matrix, thus
controlling at the same time complexity, measured by the McMillan degree,
stability and smoothness of the identified models. As a special case we recover
the nuclear norm penalty on the squared block Hankel matrix. In contrast with
previous literature on reweighted nuclear norm penalties, our kernel is
described by a small number of hyper-parameters, which are iteratively updated
through marginal likelihood maximization; constraining the structure of the
kernel acts as a (hyper)regularizer which helps controlling the effective
degrees of freedom of our estimator. To optimize the marginal likelihood we
adapt a Scaled Gradient Projection (SGP) algorithm which is proved to be
significantly computationally cheaper than other first and second order
off-the-shelf optimization methods. The paper also contains an extensive
comparison with many state-of-the-art methods on several Monte-Carlo studies,
which confirms the effectiveness of our procedure
Bayesian and regularization approaches to multivariable linear system identification: the role of rank penalties
Recent developments in linear system identification have proposed the use of
non-parameteric methods, relying on regularization strategies, to handle the
so-called bias/variance trade-off. This paper introduces an impulse response
estimator which relies on an -type regularization including a
rank-penalty derived using the log-det heuristic as a smooth approximation to
the rank function. This allows to account for different properties of the
estimated impulse response (e.g. smoothness and stability) while also
penalizing high-complexity models. This also allows to account and enforce
coupling between different input-output channels in MIMO systems. According to
the Bayesian paradigm, the parameters defining the relative weight of the two
regularization terms as well as the structure of the rank penalty are estimated
optimizing the marginal likelihood. Once these hyperameters have been
estimated, the impulse response estimate is available in closed form.
Experiments show that the proposed method is superior to the estimator relying
on the "classic" -regularization alone as well as those based in atomic
and nuclear norm.Comment: to appear in IEEE Conference on Decision and Control, 201
Identification of stable models via nonparametric prediction error methods
A new Bayesian approach to linear system identification has been proposed in
a series of recent papers. The main idea is to frame linear system
identification as predictor estimation in an infinite dimensional space, with
the aid of regularization/Bayesian techniques. This approach guarantees the
identification of stable predictors based on the prediction error minimization.
Unluckily, the stability of the predictors does not guarantee the stability of
the impulse response of the system. In this paper we propose and compare
various techniques to address this issue. Simulations results comparing these
techniques will be provided.Comment: number of pages = 6, number of figures =
Recommended from our members
Closed-loop Identification of an Industrial Extrusion Process
This paper deals with the challenging problem of closed-loop identification for multivariable chemical processes and particularly the estimation of an open-loop plant model for a lab-scale industrial twin-screw extruder used in a powder coatings manufacturing line. The aim is to produce a low order efficient model in order to assist the scaling-up and the model-based control design of the manufacturing process. To achieve this goal, a two-stage indirect approach has been deployed which relies on the a-priori knowledge of the controller parameters in order to extract good estimates of the open-loop dynamics of the underlying process. As input excitation signals we have used multiple single variable step tests at various operating conditions (current industrial practice) carried out manually in order to generate the data-set which captures the dynamics of the extrusion process. In order to increase the efforts for obtaining a suitable plant model, we have employed various identification techniques, such as Prediction Error Methods (PEM) and Subspace Identification Methods (SIM) in order to generate candidate closed-loop models that fit to the original input-output process data. Then, a comparison of the estimated models was performed by means of the mean square error and data fitting criteria in order to select the model that best describes the dynamic behaviour of the extrusion process. Model validation based on closed-loop step responses also used as verification of the results
Classical vs. Bayesian methods for linear system identification: point estimators and confidence sets
This paper compares classical parametric methods with recently developed
Bayesian methods for system identification. A Full Bayes solution is considered
together with one of the standard approximations based on the Empirical Bayes
paradigm. Results regarding point estimators for the impulse response as well
as for confidence regions are reported.Comment: number of pages = 8, number of figures =
Estimating effective connectivity in linear brain network models
Contemporary neuroscience has embraced network science to study the complex
and self-organized structure of the human brain; one of the main outstanding
issues is that of inferring from measure data, chiefly functional Magnetic
Resonance Imaging (fMRI), the so-called effective connectivity in brain
networks, that is the existing interactions among neuronal populations. This
inverse problem is complicated by the fact that the BOLD (Blood Oxygenation
Level Dependent) signal measured by fMRI represent a dynamic and nonlinear
transformation (the hemodynamic response) of neuronal activity. In this paper,
we consider resting state (rs) fMRI data; building upon a linear population
model of the BOLD signal and a stochastic linear DCM model, the model
parameters are estimated through an EM-type iterative procedure, which
alternately estimates the neuronal activity by means of the Rauch-Tung-Striebel
(RTS) smoother, updates the connections among neuronal states and refines the
parameters of the hemodynamic model; sparsity in the interconnection structure
is favoured using an iteratively reweighting scheme. Experimental results using
rs-fMRI data are shown demonstrating the effectiveness of our approach and
comparison with state of the art routines (SPM12 toolbox) is provided
- …