Search CORE

8,061 research outputs found

Safe Q-learning for continuous-time linear systems

Author: Bandyopadhyay Soutrik
Bhasin Shubhendu
Publication venue
Publication date: 26/04/2023
Field of study

Q-learning is a promising method for solving optimal control problems for uncertain systems without the explicit need for system identification. However, approaches for continuous-time Q-learning have limited provable safety guarantees, which restrict their applicability to real-time safety-critical systems. This paper proposes a safe Q-learning algorithm for partially unknown linear time-invariant systems to solve the linear quadratic regulator problem with user-defined state constraints. We frame the safe Q-learning problem as a constrained optimal control problem using reciprocal control barrier functions and show that such an extension provides a safety-assured control policy. To the best of our knowledge, Q-learning for continuous-time systems with state constraints has not yet been reported in the literature

arXiv.org e-Print Archive

Min-max model predictive control as a quadratic program

Author: Alamo Teodoro
Camacho Eduardo F.
Muñoz de la Peña Sequedo David
Rodríguez Ramírez Daniel
Publication venue: 'Elsevier BV'
Publication date: 01/01/2005
Field of study

This paper deals with the implementation of min-max model predictive control for constrained linear systems with bounded additive uncertainties and quadratic cost functions. This type of controller has been shown to be a continuous piecewise affine function of the state vector by geometrical methods. However, no algorithm for computing the explicit solution has been given. In this paper, we show that the min-max optimization problem can be expressed as a multi-parametric quadratic program, and so, the explicit form of the controller may be determined by standard multi-parametric techniques

idUS. Depósito de Investigación Universidad de Sevilla

Store-and-forward based methods for the signal control problem in large-scale congested urban road networks

Author: Abu-Lebdeh
Cremer
Daganzo
Diakaki
Diakaki
E. Kosmatopoulos
Gartner
Helgason
Hunt
K. Aboudolas
Kosmatopoulos
Lo
Lo
M. Papageorgiou
Mirchandani
Papageorgiou
Papageorgiou
Rosen
Schutter
Singh
Publication venue: 'Elsevier BV'
Publication date: 01/04/2009
Field of study

The problem of designing network-wide traffic signal control strategies for large-scale congested urban road networks is considered. One known and two novel methodologies, all based on the store-and-forward modeling paradigm, are presented and compared. The known methodology is a linear multivariable feedback regulator derived through the formulation of a linear-quadratic optimal control problem. An alternative, novel methodology consists of an open-loop constrained quadratic optimal control problem, whose numerical solution is achieved via quadratic programming. Yet a different formulation leads to an open-loop constrained nonlinear optimal control problem, whose numerical solution is achieved by use of a feasible-direction algorithm. A preliminary simulation-based investigation of the signal control problem for a large-scale urban road network using these methodologies demonstrates the comparative efficiency and real-time feasibility of the developed signal control methods

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Enlighten

Structured Linearization of Discrete Mechanical Systems for Analysis and Optimal Control

Author: Johnson Elliot
Murphey Todd
Schultz Jarvis
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 31/08/2017
Field of study

Variational integrators are well-suited for simulation of mechanical systems because they preserve mechanical quantities about a system such as momentum, or its change if external forcing is involved, and holonomic constraints. While they are not energy-preserving they do exhibit long-time stable energy behavior. However, variational integrators often simulate mechanical system dynamics by solving an implicit difference equation at each time step, one that is moreover expressed purely in terms of configurations at different time steps. This paper formulates the first- and second-order linearizations of a variational integrator in a manner that is amenable to control analysis and synthesis, creating a bridge between existing analysis and optimal control tools for discrete dynamic systems and variational integrators for mechanical systems in generalized coordinates with forcing and holonomic constraints. The forced pendulum is used to illustrate the technique. A second example solves the discrete LQR problem to find a locally stabilizing controller for a 40 DOF system with 6 constraints.Comment: 13 page

arXiv.org e-Print Archive

CiteSeerX

Structured Linearization of Discrete Mechanical Systems for Analysis and Optimal Control

Author: Amarilla José Manuel
Morales Enrique
Tartaj Pedro
Vazquez-Santos Maria B.
Publication venue
Publication date: 01/01/2014
Field of study

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Digital.CSIC